
This course includes our updated coding exercises so you can practice your skills as you learn.
See a demo
Launch Jupyter notebook via Anaconda Navigator, create and save notebooks in a Python folder, and master code and markdown cells plus shortcuts like shift-enter, a/b, and y/m.
Master Python lists by reading, adding, and removing items, counting length, and testing membership with in. Build lists from ranges using start, stop, and step to generate larger sequences.
Explore how Python uses logical operators to compare values and produce booleans, distinguishing assignment from equality and covering not equal, and order comparisons, including case sensitivity in strings.
Explore descriptive statistics by examining dispersion through quartiles, median, and interquartile range, and learn to visualize with box and whisker plots in Python.
Explore mutually exclusive, independent, and complementary events, and apply the rule of multiplication and addition to calculate probabilities using coin flips, jar draws, and dice.
Use NumPy's random to simulate probability concepts from coin flips to die rolls, with seeds for reproducibility, and count outcomes. Visualize results with Matplotlib and Seaborn count plots.
Explore the binomial distribution for n independent trials with two outcomes and constant p. Use P(X=x)=C(n,x)p^x(1-p)^{n-x} to compute probabilities and the mean np and variance np(1-p).
Demonstrate binomial distribution with numpy by simulating coin flips (1, 2, and 10 coins), visualize results with seaborn, and compare simulated versus theoretical outcomes for 20-sample defects.
Explore the Poisson distribution, where possibilities are infinite, as a counterpart to binomial for discrete data, emphasizing mu as the average number of successes and the probability of X successes.
Learn to convert any normal distribution to the standard normal with z-scores, then read the z-table for probabilities and the 68/95/99.7 percent areas.
Examine descriptive statistics in Python, covering central tendency and dispersion with statistics, NumPy, and SciPy.stats. Learn axis-based means on 2d arrays and shape, with binomial, Poisson, and normal distributions.
Learn to manage pandas DataFrame columns: select single or multiple columns, create a volume column from length, width, height, and drop columns with axis.
Learn to create bar plots with matplotlib.pyplot in Python from discrete data, using simple and advanced examples to count categories and visualize with plt.bar and plt.show.
Explore the Titanic dataset with Seaborn in Python to load data, inspect structure, and analyze age and survival outcomes using histograms, KDE plots, and categorical visuals.
Understand a one sample z test through a perfume bottle example: test if the mean volume differs from 150 cc with n=100 and sigma=2 at 95% confidence, concluding no change.
practice performing a one-sample z test on bottle volume data using Python, pandas to read CSV files, and weightstats from statsmodels to obtain z- and p-values.
Explore conducting a one proportion test in Python using the binomial distribution with scipy.stats binom_test. Analyze smoking data, compare observed smokers to 0.21 expectation, and interpret p-values without normal approximation.
Learn a two-tailed one-variance test using chi-square to assess if the population variance changes from an established value, with df=50, sample sd 2.35, 90% confidence, and interpreting critical values.
Perform a two proportions test in python, comparing 30/200 vs 10/100 defectives, using statsmodels proportions with score method to reflect the p-value result.
Use contingency tables and chi-square to test relationships between two discrete variables, with null and alternative hypotheses, observed versus expected counts, and practical examples like gender and smoking.
Perform simple or complex statistical calculations using Python! - You don't need to be a programmer for this :)
You are not expected to have any prior knowledge of Python. I will start with the basics. Coding exercises are provided to test your learnings.
The course not only explains, how to conduct statistical tests using Python but also explains in detail, how to perform these using a calculator (as if, it was the 1960s). This will help you in gaining the real intuition behind these tests.
Learn statistics, and apply these concepts in your workplace using Python.
The course will teach you the basic concepts related to Statistics and Data Analysis, and help you in applying these concepts. Various examples and data-sets are used to explain the application.
I will explain the basic theory first, and then I will show you how to use Python to perform these calculations.
The following areas of statistics are covered:
Descriptive Statistics - Mean, Mode, Median, Quartile, Range, Inter Quartile Range, Standard Deviation.
Data Visualization - Commonly used plots such as Histogram, Box and Whisker Plot and Scatter Plot, using the Matplotlib.pyplot and Seaborn libraries.
Probability - Basic Concepts, Permutations, Combinations
Population and Sampling - Basic concepts
Probability Distributions - Normal, Binomial and Poisson Distributions
Hypothesis Testing - One Sample and Two Samples - z Test, t-Test, F Test and Chi-Square Test
ANOVA - Perform Analysis of Variance (ANOVA) step by step doing the manual calculation and by using Python.
The Goodness of Fit and the Contingency Tables.