Sign up
Forgot password?
FAQ: Login

Apeltsin Leonard. Data Science Bookcamp: Five real-world Python projects

  • zip file
  • size 18,90 MB
  • contains epub document(s)
Apeltsin Leonard. Data Science Bookcamp: Five real-world Python projects
Manning Publications, 2021. — 706 p. — ISBN: 978-1617296253.
Learn data science with Python by building five real-world projects! Experiment with card game predictions, tracking disease outbreaks, and more, as you build a flexible and intuitive understanding of data science.
In Data Science Bookcamp you will learn:

Techniques for computing and plotting probabilities.
Statistical analysis using Scipy.
How to organize datasets with clustering algorithms.
How to visualize complex multi-variable datasets.
How to train a decision tree machine learning algorithm.
In Data Science Bookcamp you’ll test and build your knowledge of Python with the kind of open-ended problems that professional data scientists work on every day. Downloadable data sets and thoroughly-explained solutions help you lock in what you’ve learned, building your confidence and making you ready for an exciting new data science career.
Purchase of the print book includes a free eBook in PDF, Kindle, and EPUB formats from Manning Publications.
About the technology
A data science project has a lot of moving parts, and it takes practice and skill to get all the code, algorithms, datasets, formats, and visualizations working together harmoniously. This unique book guides you through five realistic projects, including tracking disease outbreaks from news headlines, analyzing social networks, and finding relevant patterns in ad click data.
About the book
Data Science Bookcamp doesn’t stop with surface-level theory and toy examples. As you work through each project, you’ll learn how to troubleshoot common problems like missing data, messy data, and algorithms that don’t quite fit the model you’re building. You’ll appreciate the detailed setup instructions and the fully explained solutions that highlight common failure points. In the end, you’ll be confident in your skills because you can see the results.
What's inside
Web scraping.
Organize datasets with clustering algorithms.
Visualize complex multi-variable datasets.
Train a decision tree machine learning algorithm.
For readers who know the basics of Python. No prior data science or machine learning skills required.
Leonard Apeltsin is the Head of Data Science at Anomaly, where his team applies advanced analytics to uncover healthcare fraud, waste, and abuse.
Case Study 1 Finding the winning strategi in a card game
Computing probabilities using Python.
Plotting probabilities using Matplotlib.
Running random simulations in NumPy.
Case study 1 solution.
Case Study 2 Assessing online ad clicks for significance
Basic probability and statistical analysis using SciPy.
Making predictions using the central limit theorem and SciPy.
Statistical hypothesis testing.
Analyzing tables using Pandas.
Case study 2 solution.
Case Study 3 Tracking disease outbreaks using news headlines
Clustering data into groups.
Geographic location visualization and analysis.
Case study 3 solution.
Case Study 4 Using online job posting to improve your data science resume
Measuring text similarities.
Dimension reduction of matrix data.
NLP analysis of large text datasets.
Extracting text from web pages.
Case study 4 solution.
Case Study 5 Predicting future friendships from social network data
An introduction to graph theory and network analysis.
Dynamic graph theory techniques for node ranking and social network analysis.
Network-driven supervised machine learning.
Training linear classifiers with logistic regression.
Training nonlinear classifiers with decision tree techniques.
Case study 5 solution.
  • Sign up or login using form at top of the page to download this file.
  • Sign up
Up