Name: Stata Programming-Mastering Household Survey Data Processing
Rating: 4.6 (73 reviews)

Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Created byThemba Chirwa, PhD

Last updated 7/2024

English

What you'll learn

Learn STATA programming language on how to process large survey data in raw form to processed data that can be used to infer about the population
Develop and gain hands-on skills in applying insights from large survey data sets to inform policy, conduct program evaluations, and research studies
Equip researchers with the specialized technical skills required to navigate, process, and analyze raw databases effectively
Provide practical guidance on data linkage techniques, data cleaning processes, and methods for resolving inconsistencies in integrated datasets
Educate researchers on the importance of standardizing data formats, variables, and coding schemes for improved data analysis and comparison
Outline best practices for data standardization and provide guidelines on harmonizing datasets from different sources to ensure consistency
Offer strategies and tools for integrating data from various sources or surveys to create comprehensive datasets for research purposes

Course content

11 sections • 140 lectures • 25h 18m total length

Course Objectives13:39

Introduction3:33
Download World Bank household survey data, set up folders, and save data in Stata; explore module descriptions, observations, and variables, including MP editions and the agricultural household model.
Downloading Household Survey Data17:26
Exploring Household Survey Data16:50
Ordering Stata and Stata Graphic User Interface17:54
Agricultural Household Models - A Conceptual Framework26:28
Apply a conceptual framework from Deaton and Zaidi linking household, enterprise, and institution dynamics to consumption aggregates via money-metric utility in survey data processing.

Introduction3:56
Set up Stata, define file paths and log files, and load and save data with a do-file for verification. Learn long-to-wide reshaping, covariates, weights, duplicates, and poverty metrics with graphing.
Creating File Paths and Structure13:26
Create a do-file, save with a name and date, and configure global data paths and separate data folders (household, consumption, agricultural, fisheries, community) plus output, log, and graph directories.
Setting Up Stata and Log File Paths11:50
Creating Section Headers9:12
Create and verify a log file, then save data as you define household variables for the Malawi IHS5 poverty assessment using the household questionnaire data.
Loading Data into Stata11:17
The Split Command13:28
Use the split command to separate interview dates into year, month, and day. Drop extraneous variables, rename, and label new date components for clear survey data analysis.
Creating Covariates through Merging Datasets17:32
Creating a Population Weight Variable11:08
Creating Covariates through Merging Datasets12:30
Dealing with Duplicate Observations12:06
Identify and drop duplicate observations in a merged Stata file using household ID and case ID, then remove extraneous variables and save the cleaned dataset.
Creating Covariates through Merging Datasets11:31
Creating Covariates Related to the Household Head14:39
Create covariates related to the household head by extracting and merging head-specific variables with the variables data file for 11,434 observations, then drop extraneous variables to prepare for consumption aggregates.
Creating Covariates to use in Subsequent Analyses7:47
Reshaping Data from Long to Wide19:39
Creating an Adult Equivalence Variable16:05
Creating Dependency Ratio Variables10:57
Creating Dependency Ratio Variables17:09
Create and label dependency ratio variables in Stata, including child, elder, and youth dependency ratios, compare them with adult equivalents, and apply 99th percentile rules when workers are zero.
Creating a Land Hectarage Variable16:36
Collapsing Observations by Household and Graphing16:32
Inspect the land hectare variable, collapse observations by household, and compute summary statistics; then graph distributions with box plots, histograms, spike plots, and kdensity, and winsorize the data.
Creating Land-Related Covariates12:22

Introduction3:40
Learn how to estimate household food consumption expenditures from survey data by analyzing food items, separating consumed, purchased, own‑production, and gifts, and building conversion factors, calories, prices, and COICOP classifications.
Describing Food Expenditure Data10:06
Estimate household food expenditure by constructing consumption aggregates from module G1 data, merging with conversion factors, and tracing item codes like GO2, GO3A, and G05.
Viewing Food Item Labels9:40
Descriptive Statistics of Food Items13:48
Creating a Total Food Consumed Data File12:34
Creating a Total Food Consumed Data File15:32
Learn to create a total food consumption data file in Stata by defining units and subunits with photo aids, recoding labels, and handling missing values.
Creating a Purchased Food Consumed Data File9:26
Creating a Own-Produced Food Consumed Data File18:50
Creating a Gifts & Other Sources Food Consumed Data File11:40
Creating New Conversion Factors12:46
Creating New Conversion Factors10:30
Merging and Labeling Conversion Factors15:01
Merging and Labeling Conversion Factors16:52
Merging Conversion Factors with Total Food Consumed19:06
Merging Conversion Factors with Other Food Consumed15:15
Collapsing Total Food Grams by Household12:26
Collapsing Total Food Grams by Household11:17
Collapsing Other Food Grams by Household10:07
Merging Food Consumption Files10:26
Consolidating Food Consumption Expenditures8:15
Revising Calories of Food Items9:27
Merging Aggregated Food Consumption with Calories9:13
Creating Food Calorie Intake Variables11:43
Collapsing Food Calorie Intake for Each Household11:45
Collapse food calorie intake by household to sum calories and grams by case ID, relabel variables, and merge with case and variables data to create a food data file.
Generating Food Unit Prices9:02
Replacing Missing Food Unit Prices10:22
Creating a Paasche Price Index Variable17:12
Computing Aggregated Household Food Expenditures14:08
Consolidating Aggregated Food Expenditures18:30
Consolidate seven-day household food expenditures in Stata by creating a total expenditure from components, handling missing values, identifying outliers, and saving the aggregated dataset.
Generating Food Expenditure Categories based on COICOP18:00

Introduction2:00
Computing Household Education Expenditures14:32
Compute household education expenditures from survey data using stata, constructing the education expenditure variable from tuition, after-school programs, and boarding costs, and compare with total expenditures to adjust estimates.
Computing Household Education Expenditures12:28
Computing Household Education Expenditures12:08
Computing Household Health Expenditures16:32
Computing Household Housing Utility Expenditures9:25
Analyze household utility expenditures by processing rent, fuelwood, electricity, telephone, and cell phone costs using Stata; clean data, select variables, and generate aggregate and annual expense measures.
Computing Household Housing Utility Expenditures6:55
Master Stata programming to compute household electricity expenditure, create consolidated cost variables, and analyze daily, weekly, and monthly estimates for utilities and related costs.
Computing Household Housing Utility Expenditures14:39
Computing Household Nonfood Expenditures - One Week Recall10:21
Computing Household Nonfood Expenditures - One Month Recall8:59
Computing Household Nonfood Expenditures - Three Month Recall10:55
Computing Household Nonfood Expenditures - One Year Recall8:55
Analyze the one-year recall of household nonfood expenditures (module k) by recoding and labeling items into expenditure categories, then save the results as module k1 and proceed to k2.
Computing Household Nonfood Expenditures - One Year Recall10:47
Computing Household Housing Rental Expenditures11:47
Computing Household Housing Rental Expenditures9:45
Merging Nonfood Expenditure Data Files4:53

Introduction0:51
Understanding the Household Agriculture Questionnaire7:11
Explore part d of the agricultural questionnaire, detailing expenditures, rainy and dry seasons, land ownership, and household production in Malawi, with data preparation steps including variable renaming and merging.
Creating Agriculture Covariates9:48
Merge agricultural covariates from module B with the dta file, label and describe garden-related variables (B06, B214), and prepare covariates for regression analysis.
Viewing Household Agriculture Expenditures4:16
Creating Agricultural Covariates and Expenditures10:37
Creating Agricultural Covariates and Expenditures7:43
Learn to generate agricultural expenditure aggregates and covariates in Stata by computing transport, coupon, and bribe costs, labeling and merging datasets across module E and F.
Creating Agricultural Covariates and Expenditures11:39
Generate agricultural covariates and expenditures from module f inputs and costs during the rainy season, then label input types and merge with the dta file to estimate costs.
Creating Agricultural Covariates and Expenditures11:12
Creating Agricultural Covariates and Expenditures5:18
Generate agricultural expenditure covariates by calculating transport and input costs from module H (H09, H10, H40), collapse by case, and merge into the expenditure file to obtain seven variables.
Creating Agricultural Covariates and Expenditures11:14
Apply a consistent Stata workflow to generate covariates and agricultural expenditures from household survey data, including reshaping, handling missing values, and merging covariates into the agricultural data file.
Creating Agricultural Covariates and Expenditures10:35
Creating Agricultural Covariates and Expenditures9:52
Creating Agricultural Covariates and Expenditures9:11
Creating Agricultural Covariates and Expenditures8:21
Create agricultural covariates and expenditures by reshaping from long to wide, renaming variables, and handling missing values, then merge to yield 628 covariates across 1,954 observations, including transport costs (O10).
Creating Agricultural Covariates and Expenditures9:23
Create agricultural covariates and expenditures from module Q, including crop sales and transport costs. Reshape long to wide, rename and replace variables, then merge with the dta file.
Creating Agricultural Covariates and Expenditures10:21
Creating Agricultural Covariates and Expenditures7:48
Creating Agricultural Covariates and Expenditures11:08
Create agricultural covariates and expenditures in Stata by computing input costs, generating expenditure variables from modules S and T, and merging data for robust household covariate analysis.

Introduction0:39
Creating Fisheries Covariates8:15
Creating Fisheries Covariates9:07
Creating Fisheries Covariates and Expenditures12:25
Creating Fisheries Covariates and Expenditures9:53
Creating Fisheries Covariates and Expenditures12:07
Creating Fisheries Covariates and Expenditures9:06
Creating Fisheries Covariates and Expenditures8:37
Generate covariates and fisheries expenditures for low-season fishing households in Stata; clean, rename, and label variables, compute expenditure categories 144 and 144B, and merge with the fisheries data file.
Creating Fisheries Covariates and Expenditures9:33

Introduction0:53
Creating Community-Related Covariates9:49
Aggregate and merge community covariates from the district questionnaire, combining modules CA, CD, CF1, CB, and CE for a 710-observation data file ready for regression analysis.
Creating Community-Related Covariates10:29
Creating Community-Related Covariates10:04
Identify and process community-related covariates from household survey data in Stata, including droughts, floods, price changes, access to services; drop missing values, remove duplicates, collapse to unique observations, save dta.
Creating Community-Related Covariates10:18
Create community-related covariates from the community questionnaire by generating group and resource IDs, reshaping data into variables of interest, and merging with the dta file for analysis.
Creating Community-Related Covariates11:11

Introduction2:27
Combine consumption aggregates to compute 14 real consumption aggregates for poverty analysis, generate area-specific non-food price deflators and adult equivalence scales, then winsorize and graph real consumption for regression.
Linking Theory with Practice8:11
Merging All Household Expenditures10:14
Labeling Aggregated Consumption Aggregates9:32
Generating Price Deflators - Nonfood Price Index9:33
Construct a non-food price deflator using Malawi's non-food CPI from April 2019 to April 2020, generate a non-food price index for deflating expenditures, and derive the adult-equivalents denominator.
Generating Adult Equivalence Scales by Area12:38
Generate the adult equivalence denominator and factor using Deaton and Zaidi 2002, merge by area, and derive a Paasche price index deflator for real consumption.
Generating Paasche Price Index by Area4:24
Merging Price Deflators with Real Consumption Aggregates5:32
Generating Real Consumption Aggregates13:05
Generate real consumption aggregates by deflating food and non-food expenditures with Paasche and adult-equivalence indices, create per-capita measures, label variables, and save 14 aggregates for poverty analysis.
Generating 14 COICOP Real Expenditure Aggregates7:45
Generate fourteen real expenditure aggregates from the expenditure categories using a Paasche price index and adult-equivalence scaling, then label and document variables for poverty analysis and data normalization in Stata.
Winsorizing Real Consumption Aggregates8:01
Viewing Graphs of Winsorized and Logs of Real Consumption Aggregates14:24

Introduction2:43
Understanding Average Food Calorie Requirements9:24
Generating Food Calorie Requirements Variables13:26
Computing Food Calorie Consumption per Household9:46
Compute calorie consumption per household by merging the food expenditure data with the calories data, then sum carbohydrate, protein, and fat calories for weekly per-household totals.
Constructing Food and Nonfood Poverty Lines11:40
Constructing Food and Nonfood Poverty Lines8:28
Constructing Food and Nonfood Poverty Lines10:31
Learn to construct the food component of the poverty line in Stata by using median calories, Paasche price index, and adult equivalence scales, including labeling and data cleaning.
Constructing Food and Nonfood Poverty Lines9:36
Constructing Poor and Ultrapoor Variables12:44
Rebasing Real Consumption Aggregates10:46
Constructing Other Poverty-Related Variables7:00

Requirements

Students and professionals should have a basic understanding of data manipulation and analysis. Familiarity with the Stata programming language would also be advantageous for participants to fully engage with the course content. However, this may not be necessary as the course is in itself a tutorial on how to do programming using STATA Language.
The course is also geared toward university students and professionals in various disciplines interested in gaining proficiency in Stata to enhance their research and data analysis skills and prepare them for future academic and professional opportunities.

Description

Course Overview:

Embark on a comprehensive online course tailored to elevate researchers' technical proficiency in processing and analyzing raw household survey data using Stata programming.

Explore the significance of data standardization, integration techniques, and best practices for harmonizing datasets from diverse sources exclusively with Stata.

Acquire practical skills through hands-on examples and exercises to excel in data manipulation, cleaning, and analysis using Stata programming.

Course Duration:

Delve into 25.5 hours of content spread across nine online modules, encompassing a wide range of topics from agricultural household models to setting up complex survey designs—all taught through the lens of Stata programming.

Key Learning Objectives:

Enhance technical skills in data processing and analysis through Stata programming.

Advocate for data standardization to enhance analysis and comparability.

Facilitate seamless data integration from various sources for robust research datasets.

Learn to calculate poverty estimates using consumption aggregates derived from the Paasche Price Index and Adult Equivalent Factors with Stata programming.

Who Should Enroll:

This course is ideal for researchers, students pursuing degrees in Economics, Statistics, Public Health, and Social Sciences (Sociology, Psychology, and Political Science focusing on human behavior), as well as academicians and professionals in social and applied sciences. Those seeking to advance their research skills and leverage insights from large survey datasets for policy-making, program evaluations, and decision-making processes will benefit significantly from this Stata-focused course.

Embark on this educational journey to unlock the full potential of advanced integrated household survey data processing with Stata programming at the forefront!

Who this course is for:

University Students: Both undergraduate and graduate students pursuing degrees (e.g., Bachelors, Masters, and PhD) in fields such as economics, statistics, public health, social sciences, and other related disciplines.
Researchers: Professionals involved in research projects who need to work with large household survey data to inform policy decisions and achieve development goals.
Economists: Individuals working in economic research, policy analysis, and development projects who utilize Stata for data analysis and modelling.
Statisticians: Professionals specializing in statistical analysis and data interpretation who seek to enhance their skills in handling complex survey data using Stata.
Public Health Professionals: Those working in the field of public health research, epidemiology, and health policy analysis who rely on Stata for analyzing health-related data from household surveys.
Social Scientists: Researchers in social sciences such as sociology, anthropology, and political science who use Stata for data analysis in their studies on human behavior, society, and politics.

What you'll learn

Explore related topics

Course content

Prelude1 lecture • 14min

Introduction5 lectures • 1hr 22min

Part A - Setting Up Stata and Generating Covariates20 lectures • 4hr 20min

Part B - Estimating Food Consumption Expenditures30 lectures • 6hr 17min

Part C - Estimating Nonfood Consumption Expenditures16 lectures • 2hr 45min

Part D - Estimating Agriculture Consumption Expenditures18 lectures • 2hr 36min

Part E - Estimating Fisheries Consumption Expenditures9 lectures • 1hr 20min

Part F - Constructing Community Related Variables6 lectures • 53min

Part G - Combining Consumption Aggregates12 lectures • 1hr 46min

Part H - Generating Poverty-Related Variables11 lectures • 1hr 46min

Requirements

Description

Who this course is for: