CRC Press, 2015. — 544 p. — (The R Series). — ISBN: 1482234815, 9781482234817
Data Science in R: A Case Studies Approach to Computational Reasoning and Problem Solving illustrates the details involved in solving real computational problems encountered in data analysis. It reveals the dynamic and iterative process by which data analysts approach a problem and reason about different ways of implementing solutions.
The book’s collection of projects, comprehensive sample solutions, and follow-up exercises encompass practical topics pertaining to data processing, including:
Non-standard, complex data formats, such as robot logs and email messages
Text processing and regular expressions
Newer technologies, such as Web scraping, Web services, Keyhole Markup Language (KML), and Google Earth
Statistical methods, such as classification trees, k-nearest neighbors, and naive Bayes
Visualization and exploratory data analysis
Relational databases and Structured Query Language (SQL)
Simulation
Algorithm implementation
Large data and efficiency