CRC, 2017. — 342 p. — ISBN: 978-1138088634. — (Chapman & Hall/CRC The R Series).
Building on over thirty years’ experience in teaching and practicing data science, the author encourages a programming-by-example approach to ensure students and practitioners attune to the practice of data science while building their data skills. Proven frameworks are provided as reusable templates. Real world case studies then provide insight for the data scientist to swiftly adapt the templates to new tasks and datasets.
The book begins by introducing data science. It then reviews R’s capabilities for analysing data by writing computer programs. These programs are developed and explained step by step. From analysing and visualising data, the framework moves on to tried and tested machine learning techniques for predictive modeling and knowledge discovery. Literate programming and a consistent style are a focus throughout the book.