3rd Edition. — Abingdon, UK: Chapman and Hall/CRC, 2017. — 625 p. — (Computer Science and Data Analysis Series) — ISBN13: 978-1-4987-7606-6.
Exploratory Data Analysis (EDA) is an important part of the data analysis process. The methods presented in this text are ones that should be in the toolkit of every data scientist. As computational sophistication has increased and data sets have grown in size and complexity, EDA has become an even more important process for visualizing and summarizing data before making assumptions to generate hypotheses and models.
Exploratory Data Analysis with MatLAB, Third Edition presents EDA methods from a computational perspective and uses numerous examples and applications to show how the methods are used in practice. The authors use MatLAB code, pseudo-code, and algorithm descriptions to illustrate the concepts. The MatLAB code for examples, data sets, and the EDA Toolbox are available for download on the book’s website.
New to the Third EditionRandom projections and estimating local intrinsic dimensionality
Deep learning autoencoders and stochastic neighbor embedding
Minimum spanning tree and additional cluster validity indices
Kernel density estimation
Plots for visualizing data distributions, such as beanplots and violin plots
A chapter on visualizing categorical data
Introduction to Exploratory Data AnalysisIntroduction to Exploratory Data Analysis
EDA as Pattern DiscoveryDimensionality Reduction — Linear Methods
Dimensionality Reduction — Nonlinear Methods
Data Tours
Finding Clusters
Model-Based Clustering
Smoothing Scatterplots
Graphical Methods for EDAVisualizing Clusters
Distribution Shapes
Multivariate Visualization
Visualizing Categorical Data
Proximity Measures
Software Resources for EDA
Description of Data Sets
MatLAB Basics