Applied Statistics for Data Science: A Hands-On Approach!
Build An Intuitive Understanding Using Python code: Histograms, CLT, Testing, Distributions, Correlation and much more!
Description
Welcome to the course on Statistics For Data Scientists!
Learn about the key concepts in statistics, and how to apply them to your data analysis.
A highly practical and hands-on approach.
A focus on building an intuitive understanding of each topic.
Learn to use Python code to simulate various scenarios in a plug-and-play manner.
What is included in the course:
Detailed Course Notes (100 page textbook with 50+ illustrative figures)
Deck of 360 slides
Lectures with 10h+ content spread over 40+ videos
All of the code in Jupyter Notebooks (7 notebooks, 2000+ lines of code)
Bonus Chapter: Introduction to Machine Learning
Topics that the course covers:
The Histogram
Generating artificial Data sets
The central tenet of Statistics
The Central Limit Theorem
Distribution functions
Percentiles
Data Ranges
Cumulative Distribution Function
Different Distribution types:
Normal Distribution
Uniform Distribution
Exponential Distribution
Poisson Distribution
Bernoulli Distribution
Rayleigh Distribution
Statistical Testing
Reasoning behind statistical testing
P-value
Statistical Significance
Different Statistical Tests:
Shapiro-Wilk test
Levene's test
Student T-test/ Welsh T-test
ANOVA test
Kolmogorov Smirnov test
Non-parametric tests
Two real-life examples
Detect a biased coin with 95% certainty
Real-life A/B testing
Correlation
Linear correlation - Pearson correlation coefficient + alternatives
Categorical correlation - Chi-Squared test + contingency tables
EXTRA: Regression and intro to Machine Learning
Linear Regression
Logistic Regression + ML pipeline
Who is this course for:
Students on a data science track, or any other technical field.
Professionals that want to pivot into a data science career.
Managers that want to be able to make data driven decisions.
Practicing Data Scientists that want to add this value skill to their tool belt.
What You Will Learn!
- Perform elaborate and involved Data Analysis on any dataset.
- Build an intuitive understanding of concept in Statistics: Sample, Population, Correlation, P-value, Significance, and others.
- Be able to write Python code that generates elaborate and beautiful Visuals.
- Make Simulations using Python code that showcase various Statistical Concepts.
- Be able to perform various Statistical Tests using Python (Student T-test, Welsh's Test, Levene's Test, Shapiro-Wilk test, ...)
- Be able to build a Machine Learning model to predict outcomes based on linear and logistic regression.
Who Should Attend!
- Students on a Data Science track or other technical field.
- Professionals that want to pivot towards a data science career.
- Active Data Scientists that want to add statistical knowledge and intuition to their tool belt.
- Managerial Roles in technical fields that want to up their skill to make better decisions about data.