Causal Data Science with Directed Acyclic Graphs

Name: Causal Data Science with Directed Acyclic Graphs
Rating: 4.6 (583 reviews)

Get to know the modern tools for causal inference from machine learning and AI, with many practical examples in R

Created byPaul Hünermund

Last updated 9/2020

English

What you'll learn

Causal inference in data science and machine learning
How to work with directed acylic graphs (DAG)
Newest developments in causal AI

Course content

7 sections • 27 lectures • 4h 57m total length

Welcome15:41
Explore why causal data science matters, introduce directed acyclic graphs, and preview core concepts like Simpson's paradox, confounding, and transportability.

Directed Acyclic Graphs5:21
Represent causal structures with graphs, where nodes are variables and edges link them; understand directed versus undirected graphs, paths and cycles, and how directed acyclic graphs relate to causal interpretation.
Structural Causal Models4:18
D-Separation16:31
Explore how directed acyclic graphs reveal conditional independence through d-separation, showing how chains, forks, and colliders block or open paths between variables for causal inference.
Interventions12:31
Explore causal inference by modeling interventions with the do operator in structural causal models. Trace post-intervention distributions of Y and counterfactual reasoning to predict outcomes of actions.
R Examples15:05
Explore practical causal inference with a simulated dag in R, using tidyverse and ggdag to show how intervening on X shifts Y and clarifies causation.
Appendix6:49

Testable Implications of DAGs4:19
Explore how directed acyclic graphs yield testable d-separation implications for causal inference. Use conditional independence tests to assess data, discard incompatible graphs, and iteratively refine models.
R Interlude2:36
Learn how to use D'agoty for causal discovery by building a graph, deriving D separation implications, and testing conditional independencies with partial correlations to refine the model.
Causal Discovery5:58
The PC Algorithm17:28
Practical Considerations4:31
Explore considerations in causal discovery with acyclic graphs, including equivalence class limitations, pc algorithm complexity, and conditional independence tests by data types, with fci as alternative when causal sufficiency fails.

Confounding Bias3:39
Explore confounding bias in causal data science with directed acyclic graphs, using back-door and front-door criteria, do calculus, and identification tasks to recover causal effects from observational data.
Backdoor Adjustment10:20
Apply the backdoor criterion to identify valid adjustment sets that block spurious paths from X to Y, enabling identification and estimation of causal effects from observational data.
Frontdoor Adjustment3:53
Explain front door adjustment in causal graphs, showing how Z6 intercepts X to Y, blocks unblocked X to Z paths, and, with X, blocks backdoor paths from Z6 to Y.
Do-Calculus15:24
R Examples 129:39
Demonstrate practical causal inference in R using dagitty and ggdag to perform backdoor and front-door adjustment, do-calculus, and propensity-score weighting on DAGs with observed and unobserved factors.
Z-Identification15:12
Explore how Z identification extends instrumental variable ideas to identify causal effects in DAGs when X cannot be manipulated, using graphical criteria and do-calculus.
R Examples 216:20
Apply Z identification to identify the causal effect of X on Y using a DAG with unobserved confounders in R, using the args effect algorithm and surrogate Z experiments.

Selection Bias5:47
Apply directed acyclic graphs to diagnose and recover causal effects under selection bias, using selection diagrams (G_s) and do calculus, while addressing collider bias and non-parametric methods.
Recovering from Selelection Bias10:58
Explore recovering conditional and interventional distributions from selection bias using selection diagrams, d-separation, and do-calculus, with a practical two-step strategy and examples.
R Examples12:20

The Transportability Task10:16
Examine how transportability of causal knowledge across structurally different domains uses selection diagrams to compare source and target domains, and determine when experiments or observational data transport causal effects.
S-Admissibility and Do-Calculus12:25
Learn how s-admissibility and do-calculus enable transportability of causal effects across populations using selection diagrams and reweighting. See practical examples from economics and education illustrating admissibility and do-calculus applications.
Mz-Transportability8:41
Explore transportability in causal data science, including Z transportability via surrogate experiments, and meta transportability to combine heterogeneous source studies using do-calculus and selection diagrams.
R Examples26:35

Requirements

Basic knowledge of probability and statistcs
Basic programming skills would be an advantage

Description

This course offers an introduction into causal data science with directed acyclic graphs (DAG). DAGs combine mathematical graph theory with statistical probability concepts and provide a powerful approach to causal reasoning. Originally developed in the computer science and artificial intelligence field, they recently gained increasing traction also in other scientific disciplines (such as machine learning, economics, finance, health sciences, and philosophy). DAGs allow to check the validity of causal statements based on intuitive graphical criteria, that do not require algebra. In addition, they open the possibility to completely automatize the causal inference task with the help of special identification algorithms. As an encompassing framework for causal thinking, DAGs are becoming an essential tool for everyone interested in data science and machine learning.

The course provides a good overview of the theoretical advances that have been made in causal data science during the last thirty year. The focus lies on practical applications of the theory and students will be put into the position to apply causal data science methods in their own work. Hands-on examples, using the statistical software R, will guide through the presented material. There are no particular prerequisites, but a good working knowledge in basic statistics and some programming skills are a benefit.

Who this course is for:

Data scientists
Economists
Computer Scientists
People intersted in machine learning

Causal Data Science with Directed Acyclic Graphs

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 16min

Structural Causal Models, Interventions, and Graphs6 lectures • 1hr 1min

Causal Discovery5 lectures • 35min

Confounding Bias and Surrogate Experiments7 lectures • 1hr 34min

Recovering from Selection Bias3 lectures • 29min

Transportability of Causal Knowledge Across Domains4 lectures • 58min

Outro1 lecture • 5min

Requirements

Description

Who this course is for: