
Learn how to manage, manipulate, and analyze data with Stata, covering basics to advanced data visualization across its three sub courses: Essentials, tips for data, and data visualization in Stata.
Explore the Stata interface across Mac and PC, navigate the main window with results, command, and variable panes, and customize preferences for specialized editors.
Master loading and importing data in Stata with clear, use, system use (cissus), input, and import, including Excel imports with first row as variable names and sheet options.
Learn to view and edit raw data in Stata using the browse and edit commands, filter with if and in, and insert a new variable.
Explore basic descriptive statistics in Stata using describe and summarize to understand observations, variables, and data features, then learn how the codebook command generates a data codebook.
Explore converting string dates to numeric in Stata, using the date function with mdy or ymd, extracting year, month, and day, and formatting for analysis.
Create bar graphs and dot charts in Stata, comparing category frequencies and summary statistics like price and standard deviation, with graphical user interface and do-file options.
Explore how to graph distributions in Stata, using histogram, kernel density plots, quantile plots, box plots, and glider to assess skewness and compare groups.
Explore pie charts to visualize categorical data with Stata, first via the graphical user interface, then by writing do-file code, including customization and group comparisons like domestic vs foreign.
Visualize three-dimensional data in two dimensions with contour plots in Stata, using a z variable to define contours and explore continuous by continuous interactions via margins.
Apply the jitter option to scatterplots to add random noise, revealing data distribution while protecting confidentiality. See how jitter clarifies crowded or binary data and enables visualization of sensitive relationships.
Learn how sunflower plots in Stata reveal data density in large bivariate datasets using sunflowers, hexagons, and petals, with bin width adjustments and optional two-way plots.
Learn to use sunflower plots in Stata to visualize density distributions of bivariate data with hexagons, petals, and color cues for high, medium, and low density.
Explore diagnostic statistics and interpretation of logit and probit regression, including goodness-of-fit measures, pseudo r-squared, classification tables, odds ratios, and marginal effects via margins.
Learn how to model fractional data bounded between 0 and 1 with beta regression, fractional logit/probit, and zero inflated beta regression in Stata, ensuring valid predictions.
Explore generating random numbers in Stata for Monte Carlo simulations, using a data generating process to control the environment, including normal and uniform draws with seeds.
Learn to perform Monte Carlo simulation in Stata by repeatedly generating random data, running regressions, and evaluating estimator bias through summaries and density plots.
[Updated with new Audio in 2025]
The Complete Guide to Stata
Are you ready to harness the full power of Stata for data analysis? Whether you’re a complete beginner or an experienced analyst looking to sharpen your skills, this course provides a thorough, hands-on introduction to Stata’s most useful features. You’ll learn to manipulate, explore, visualize, and model complex datasets—all while developing “good practice” habits that will help you code efficiently, interpret results correctly, and present your findings with confidence.
This course is split into three main sub-courses to guide your learning:
Stata Fundamentals – A step-by-step introduction to the essentials of Stata, from data loading and cleaning to basic descriptive statistics and graphing.
Stata Tips and Tricks – Explore 125 short, standalone tips to help you solve common challenges, speed up your workflow, and uncover hidden capabilities within Stata.
Advanced Data Visualization – Master a wide variety of visualization techniques, learning how (and when) to use each one effectively.
I'll consistently focus on practical application - rather than diving into lengthy statistical theory - and show you how to implement and interpret commonly used statistical methods with real-world data.
What You’ll Learn
Getting Started: Install and navigate Stata with ease.
Data Exploration and Management: Load, view, clean, and manipulate datasets for proper analysis.
Basic and Advanced Visualizations: Create histograms, box plots, scatter plots, violin plots, spike plots, line charts, and more—while understanding the pros and cons of each.
Statistical Analysis:
Correlation and ANOVA
Regression (including diagnostics and model building)
Hypothesis Testing
Binary Outcome Models (Logit/Probit)
Fractional Response Models
Categorical Choice Models (Ordered Logit/Multinomial Logit)
Simulation Techniques
Count Data Models (Poisson/Negative Binomial)
Survival Data Analysis (Parametric, Cox-Proportional Hazard, Parametric Survival Regression)
Panel Data Analysis (including Lags, Leads, Fixed/Random Effects, Hausman Tests)
Difference-in-Differences Analysis
Instrumental Variable Regression (Endogenous Variables, Sample Selection, Non-Linear Endogenous Models)
Epidemiological Tables: Cohort studies, case-control studies, and matched case-control studies.
Power Analysis: Determine sample size, power size, and effect size.
Matrix Operations: Operators, functions, and subscripting.
You’ll also get 125 “Tips and Tricks” to help you become a Stata power user, covering topics like data management, graphing, statistics, and programming. Each standalone tip takes just a couple of minutes to learn and immediately apply.
Finally, the Advanced Data Visualization portion of the course will guide you through a diverse range of graphing techniques—from histograms and rootograms to bubble plots and mosaic plots—giving you the skills to present data in clear, compelling ways.
Prerequisites and Target Audience
Prerequisites: No prior experience with Stata is required. Familiarity with basic quantitative concepts is helpful but not mandatory.
Who Should Enroll: Anyone who wants to enhance their data analytics skill set with professional Stata techniques, including students, researchers, data scientists, and analysts in business, academia, and the public sector.
Suggested Learning Paths
Depending on your goals, you may wish to focus on certain sections:
Basic Stata Fundamentals: Sections 2, 3, 4, 5, 6, 7, 8
Advanced Stata Concepts: Sections 8, 9, 10, 11, 12, 13, 14, 15, 16, 17
Quick Tips and Tricks: Sections 18, 19, 20, 21
Data Visualization: Sections 5, 21, 22, 23, 24, 25, 26
Data Management: Sections 3, 4, 18
Take this course to experience Stata at its finest; learn the coding, analytic, and visualization skills you need to excel in modern data analysis, and gain the confidence to tackle real-life projects with competence and clarity.