Applied Statistics and Data Preparation with Python
Applied Statistics with Python
Description
Why learn Data Analysis and Data Science?
According to SAS, the five reasons are
1. Gain problem solving skills
The ability to think analytically and approach problems in the right way is a skill that is very useful in the professional world and everyday life.
2. High demand
Data Analysts and Data Scientists are valuable. With a looming skill shortage as more and more businesses and sectors work on data, the value is going to increase.
3. Analytics is everywhere
Data is everywhere. All company has data and need to get insights from the data. Many organizations want to capitalize on data to improve their processes. It's a hugely exciting time to start a career in analytics.
4. It's only becoming more important
With the abundance of data available for all of us today, the opportunity to find and get insights from data for companies to make decisions has never been greater. The value of data analysts will go up, creating even better job opportunities.
5. A range of related skills
The great thing about being an analyst is that the field encompasses many fields such as computer science, business, and maths. Data analysts and Data Scientists also need to know how to communicate complex information to those without expertise.
The Internet of Things is Data Science + Engineering. By learning data science, you can also go into the Internet of Things and Smart Cities.
This is the bite-size course to learn Python Programming for Applied Statistics. In CRISP-DM data mining process, Applied Statistics is at the Data Understanding stage. This course also covers Data processing, which is at the Data Preparation Stage.
You will need to know some Python programming, and you can learn Python programming from my "Create Your Calculator: Learn Python Programming Basics Fast" course. You will learn Python Programming for applied statistics.
You can take the course as follows, and you can take an exam at EMHAcademy to get SVBook Certified Data Miner using Python certificate :
- Create Your Calculator: Learn Python Programming Basics Fast (R Basics)
- Applied Statistics using Python with Data Processing (Data Understanding and Data Preparation)
- Advanced Data Visualizations using Python with Data Processing (Data Understanding and Data Preparation, in the future)
- Machine Learning with Python (Modeling and Evaluation)
Content
Getting Started
Getting Started 2
Getting Started 3
Data Mining Process
Download Data set
Read Data set
Mode
Median
Mean
Range
Range One Column
Quantile
Variance
Standard Deviation
Histogram
QQPLot
Shapiro Test
Skewness and Kurtosis
Describe()
Correlation
Covariance
One Sample T Test
Two Sample TTest
Chi-Square Test
One Way ANOVA
Simple Linear Regression
Multiple Linear Regression
Data Processing: DF.head()
Data Processing: DF.tail()
Data Processing: DF.describe()
Data Processing: Select Variables
Data Processing: Select Rows
Data Processing: Select Variables and Rows
Data Processing: Remove Variables
Data Processing: Append Rows
Data Processing: Sort Variables
Data Processing: Rename Variables
Data Processing: GroupBY
Data Processing: Remove Missing Values
Data Processing: Is THere Missing Values
Data Processing: Replace Missing Values
Data Processing: Remove Duplicates
What You Will Learn!
- Applied Statistics using Python
Who Should Attend!
- Beginner Data Scientist or Analyst interested in Python programming