Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Applied Statistics and Data Preparation with Python
Rating: 1.4 out of 5(2 ratings)
526 students

Applied Statistics and Data Preparation with Python

Applied Statistics with Python
Created byGoh Ming Hui
Last updated 2/2019
English

What you'll learn

  • Applied Statistics using Python

Course content

1 section47 lectures1h 46m total length
  • Getting Started10:48
  • Getting Started 22:04
  • Getting Started 32:52
  • Getting Started 45:40
  • Data Mining Process5:37
  • Download Dataset1:11
  • Read CSV2:03
  • Mode5:23
  • Median1:17

    Explore computing medians for columns in a dataset using the median function in Python, and interpret the medians for different variables.

  • Mean0:59
  • Range1:38

    Identify range as the difference between the largest and smallest values, compute with max and min functions, and print and compare the min and max to understand data spread.

  • Range One Column1:28
  • Quantile3:11
  • Variance1:11
  • Standard Deviation1:10
  • Histogram3:43
  • QQ Plot3:00
  • Shapiro Test3:24
  • Skewness1:00
  • Kurtosis1:04
  • Describe Function1:06
  • Correlation2:23
  • Covariance0:41

    Explore covariance, a measure of variability between two variables, and learn to compute it using R, with notes on variances.

  • One Sample T Test6:18
  • Two Sample TTest3:31
  • Two Sample TTest1:35
  • Two Sample TTest1:58
  • Chi Square Test2:44
  • ANOVA3:23
  • Regression Analysis6:21
  • Multiple Regression Analysis3:21
  • Data Processing: DF.Head()1:13

    Learn date preparation and data processing in Python after importing data, including selecting data, handling missing values, and inspecting data. Use df.head(10) to view the first 10 rows.

  • Data Processing: DF.Tail()0:16
  • Data Processing: DF.Describe()0:21
  • Data Processing: Select Variable or Column0:24
  • Data Processing: Select Variable or Column0:25
  • Data Processing: Select Rows0:41
  • Data Processing: Select Rows and Variables0:51
  • Data Processing: Remove Variables0:27
  • Data Processing: Append Rows1:26
  • Data Processing: Sort Variables and Columns1:10
  • Data Processing: Rename Variables2:39
  • Data Processing: GroupBy1:58
  • Data Processing: Remove Missing Values0:37
  • Data Processing: Is there Missing Values0:32
  • Data Processing: Replace Missing Values0:22
  • Data Processing: Remove Duplicates0:39

Requirements

  • Fundamentals Python programming

Description

Why learn Data Analysis and Data Science?


According to SAS, the five reasons are


1. Gain problem solving skills

The ability to think analytically and approach problems in the right way is a skill that is very useful in the professional world and everyday life.


2. High demand

Data Analysts and Data Scientists are valuable. With a looming skill shortage as more and more businesses and sectors work on data, the value is going to increase.


3. Analytics is everywhere

Data is everywhere. All company has data and need to get insights from the data. Many organizations want to capitalize on data to improve their processes. It's a hugely exciting time to start a career in analytics.


4. It's only becoming more important

With the abundance of data available for all of us today, the opportunity to find and get insights from data for companies to make decisions has never been greater. The value of data analysts will go up, creating even better job opportunities.


5. A range of related skills

The great thing about being an analyst is that the field encompasses many fields such as computer science, business, and maths.  Data analysts and Data Scientists also need to know how to communicate complex information to those without expertise.


The Internet of Things is Data Science + Engineering. By learning data science, you can also go into the Internet of Things and Smart Cities.


This is the bite-size course to learn Python Programming for Applied Statistics. In CRISP-DM data mining process, Applied Statistics is at the Data Understanding stage. This course also covers Data processing, which is at the Data Preparation Stage. 

You will need to know some Python programming, and you can learn Python programming from my "Create Your Calculator: Learn Python Programming Basics Fast" course.  You will learn Python Programming for applied statistics.


You can take the course as follows, and you can take an exam at EMHAcademy to get SVBook Certified Data Miner using Python certificate : 

- Create Your Calculator: Learn Python Programming Basics Fast (R Basics)

- Applied Statistics using Python with Data Processing (Data Understanding and Data Preparation)

- Advanced Data Visualizations using Python with Data Processing (Data Understanding and Data Preparation, in the future)

- Machine Learning with Python (Modeling and Evaluation)


Content

  1. Getting Started

  2. Getting Started 2

  3. Getting Started 3

  4. Data Mining Process

  5. Download Data set

  6. Read Data set

  7. Mode

  8. Median

  9. Mean

  10. Range

  11. Range One Column

  12. Quantile

  13. Variance

  14. Standard Deviation

  15. Histogram

  16. QQPLot

  17. Shapiro Test

  18. Skewness and Kurtosis

  19. Describe()

  20. Correlation

  21. Covariance

  22. One Sample T Test

  23. Two Sample TTest

  24. Chi-Square Test

  25. One Way ANOVA

  26. Simple Linear Regression

  27. Multiple Linear Regression

  28. Data Processing: DF.head()

  29. Data Processing: DF.tail()

  30. Data Processing: DF.describe()

  31. Data Processing: Select Variables

  32. Data Processing: Select Rows

  33. Data Processing: Select Variables and Rows

  34. Data Processing: Remove Variables

  35. Data Processing: Append Rows

  36. Data Processing: Sort Variables

  37. Data Processing: Rename Variables

  38. Data Processing: GroupBY

  39. Data Processing: Remove Missing Values

  40. Data Processing: Is THere Missing Values

  41. Data Processing: Replace Missing Values

  42. Data Processing: Remove Duplicates

Who this course is for:

  • Beginner Data Scientist or Analyst interested in Python programming