Grünwald P.D. The Minimum Description Length Principle

pdf file
size 3,02 MB

added by Михаил 02/24/2014 03:29
info modified 02/24/2014 16:13

Grünwald P.D. The Minimum Description Length Principle

Massachusetts Institute of Technology, Adaptive Computation and Machine Learning series, 2007.
— 703 p. — ISBN: 0262072815, 978-0262072816.

The minimum description length (MDL) principle is a powerful method of inductive inference, the basis of statistical modeling, pattern recognition, and machine learning. It holds that the best explanation, given a limited set of observed data, is the one that permits the greatest compression of the data. MDL methods are particularly well-suited for dealing with model selection, prediction, and estimation problems in situations where the models under consideration can be arbitrarily complex, and overfitting the data is a serious concern.This extensive, step-by-step introduction to the MDL Principle provides a comprehensive reference (with an emphasis on conceptual issues) that is accessible to graduate students and researchers in statistics, pattern classification, machine learning, and data mining, to philosophers interested in the foundations of statistics, and to researchers in other applied sciences that involve model selection, including biology, econometrics, and experimental psychology.
Part I provides a basic introduction to MDL and an overview of the concepts in statistics and information theory needed to understand MDL. Part II treats universal coding, the information-theoretic notion on which MDL is built, and part III gives a formal treatment of MDL theory as a theory of inductive inference based on universal coding. Part IV provides a comprehensive overview of the statistical theory of exponential families with an emphasis on their information-theoretic properties. The text includes a number of summaries, paragraphs offering the reader a "fast track" through the material, and boxes highlighting the most important concepts.

Introductory Material
Learning, Regularity, and Compression
Probabilistic and Statistical Preliminaries
Information-Theoretic Preliminaries
Information-Theoretic Properties of Statistical Models
Crude Two-Part Code MDL
Universal Coding
Universal Coding with Countable Models
Parametric Models: Normalized Maximum Likelihood
Parametric Models: Bayes
Parametric Models: Prequential Plug-in
Parametric Models: Two-Part
NML With Infinite Complexity
Linear Regression
Beyond Parametrics
Refined MDL
MDL Model Selection
MDL Prediction and Estimation
MDL Consistency and Convergence
MDL in Context
Additional Background
The Exponential or Maximum Entropy Families
Information-Theoretic Properties of Exponential Families