Microeconometrics

This course introduces students to basic microeconmetric methods. The objective is to learn how to make and evaluate causal claims. By the end of the course, students should be able to apply each of the methods discussed and critically evaluate research based on them. Throughout the course we will make heavy use of Python and its SciPy ecosystem as well as Jupyter Notebooks.

Counterfactual approach to causal analysis

Winship, C., and Morgan, S. L. (2014). Counterfactuals and causal inference: Methods and principles for social research. Cambridge, England: Cambridge University Press.
Frölich, M., and Sperlich, S. (2019). Impact evaluation: Treatment effects and causal analysis. Cambridge, England: Cambridge University Press.
Angrist, J. D., and Pischke, J. (2009). Mostly harmless econometrics: An empiricists companion. Princeton, NJ: Princeton University Press.

Potential outcome model

Heckman, J. J., and Vytlacil, E. J. (2007a). Econometric evaluation of social programs, part I: Causal effects, structural models and econometric policy evaluation. In J. J. Heckman, and E. E. Leamer (Eds.), Handbook of Econometrics (Vol. 6B, pp. 4779–4874). Amsterdam, Netherlands: Elsevier Science.
Imbens G. W., and Rubin D. B. (2015). Causal inference for statistics, social, and biomedical sciences: An introduction. Cambridge, England: Cambridge University Press.
Rosenbaum, P. R. (2017). Observation and experiment : An introduction to causal inference. Cambridge, MA: Harvard University Press.

Directed graphs

Pearl, J. (2014). Causality. Cambridge, England: Cambridge University Press.
Pearl, J., and Mackenzie, D. (2018). The book of why: The new science of cause and effect. New York, NY: Basic Books.
Pearl J., Glymour M., and Jewell N. P. (2016). Causal inference in statistics: A primer. Chichester, UK: Wiley.

Please use the table of content to navigate the rest of the material.

Lectures
Problem sets
Handouts
Special focus
Resources
Iterations

We collect a list of additional, more general, reading recommendations here.

Lectures

We provide the lectures in the form of a Jupyter notebook.

Introduction

We briefly introduce the course and discuss some basic ideas about counterfactuals and causal inference. We touch on the two pillars of the counterfactual approach to casusal analysis. We first explore the basic ideas of the potential outcome model and then preview the use of causal graphs. In addition, we provide a basic tutorial for some core tools used in data science.

Potential outcome model

We discuss the core conceptual model of the course. We initially discuss the individual-level treatment effect but then quickly scale back our ambitions to learn about population-level parameters instead. Then we turn to the stable-unit treatment assumption and address the challenges to the naive estimation of average causal effects in observational studies. We conclude with some examples that illustrate the flexibility of the potential outcome model to more than a simple binary treatment.

Causal graphs

We explore the usefulness of causal graphs for the visualization of complex causal systems and the clarification of alternative identification strategies for causal effects. After establishing their basic notation and some key concepts, we link them to structural equations and the potential outcome model.

Models of causal exposure and identification criteria for conditioning estimators

We study the basic conditioning strategy for the estimation of causal effects. We first link the concept of conditioning to direct graphs and start discussing the concept of a back-door path. Then we illustrate in a simulated example how collider variables induce a conditional association between two independent variables. Finally, we discuss the back-door criterion and work through some examples.

Matching estimators of causal effects

We review the fundamental concepts of matching such as stratification of data, weighting to achieve balance, and propensity scores. We explore several alternative implementations as we consider matching as conditioning via stratification, matching as a weighing approach, and matching as a data analysis algorithm. Throughout we heavily rely on simulated examples to explore some practical issues such as sparsity of data.

Regression estimators of causal effects

We study the most common form of data analysis by looking at simple regression estimators. We first study them as a basic descriptive tool that provides the best linear approximation to the conditional expectation function. Then we turn to the more demanding interpretation that it allows to determine causal effects. We contrast the issues of omitted-variable bias and selection bias. Finally, we conclude with an illustration of Freedman's paradox to showcase some of the challenges in applied empirical work.

Self-selection, heterogeneity, and causal graphs

We revisit the issues of treatment effect heterogeneity and individuals' selecting their treatment status based on gains unobserved by the econometrician. We lay the groundwork to estimate causal effects using instrumental variables, front-door identification with causal mechanisms, and conditioning estimators using pretreatment variables. We work through an elaborate panel data demonstration that illustrates the shortcoming of conditioning estimators in the presence of self-selection.

Instrumental variable estimators of causal effects

We study the use of instrumental variable estimators.

Mechanisms and causal explanation

We study front-door identification that allow (under certain conditions) to provide a causal account of the effect of D on Y.

Repeated observations and the estimation of causal effects

We now explore models in which we have multiple observations at different points in time. Due to its similar structure, we also look at the sharp and fuzzy regression discontinuity design.

Regression discontinuity design

We study regression discontinuity design in more detail. We discuss identification, issues in interpretation, and challenges to application based on the seminal review by Lee & Lemieux (2010). We reproduce and check the robustness of some of the results in Lee (2008).

Generalized method of moments

We review the basic ideas behind the generalized method of moments (GMM) and implement some numerical examples. After introducing its basic setup, we discuss the GMM criterion function and how alternative estimation strategies are cast as GMM estimation problems. We then turn to the issues of identification and the role of the weighing matrix. Throughout, we practice the basic derivations involved in the GMM approach using an instrumental variables setup.

Problem sets

We will work on several problem sets throughout the course.

Potential outcome model

We explore the potential outcome model using observed and simulated data inspired by the National Health Interview Survey. The accompanying data sets are available here.

Regression and matching estimators of causal effects

We compare the consistency of regression and matching estimators using LaLonde (1986) framework and the Current Population Survey data. The accompanying data sets are available here.

Regression discontinuity design (RDD)

We practice RDD with Lee (2008) framework. In particular, we illustrate a discontinuity at the cutoff point with local averages graph, estimate treatment effect by local linear regression and choose an optimal bandwidth by cross-validation procedure. The accompanying data sets are available here.

Generalized Roy model

We explore the Generalized Roy framework and practice estimation of marginal treatment effects using the open-source software package grmpy. Moreover, we simulate our own data set to conduct a Monte Carlo analysis and compare the performance of different estimators in the presence of essential heterogeneity. The accompanying files are available here and data here.

Handouts

We curate list of handouts that summarize selected issues.

Causal graphs: Definitions, patterns, and strategies

Special focus

We discuss selected topics in more details based on student demands.

Nonstandard standard errors

We review issues in the construction of standard errors such as the potential bias of robust standard error estimates, clustering, and serial correlation based on the material presented in Angrist & Pischke (2009). We use this opportunity to discuss the research reported in Krueger (1999).

Resources

We provide some additional resources that are useful for our course work in general.

Textbooks

Wooldridge, J. M. (2009). Econometric analysis of cross section and panel data. Cambridge, MA: The MIT Press.
Angrist, J. D., and Pischke, J. (2014). Mastering 'metrics. Princeton, NJ: Princeton University Press.
Stock, J. H., and Watson, M. W. (2019). Introduction to econometrics. New York, NY: Pearson.

Datasets

The textbooks above provide an impressive amount of data from research articles. We provide them in a central place here.

Tools

We maintain a list of useful resources around the tooling used in the course here.

Iterations

Summer Quarter 2020, Graduate Program at the University of Bonn, please see here for details.
Summer Quarter 2019, Graduate Program at the University of Bonn, please see here for details.

li-hai-yang / microeconometrics Goto Github PK