Econ 305: Economics, Causality, and Analytics

These slides make up much of the content of the course. If you’d like to print out the slides, click on the link, add ‘?print-pdf’ to the end of the URL, and then you can use your browser’s Print function to print it out or save as a PDF.

Lecture 1: A World of Data

This lecture introduces the structure of the class and shows the link between our underlying model of the world and the data that the world produces for us.

Lecture 2: Understanding Data

This lecture introduces the concept of the “data-generating process” and how we can use data to get at underlying truths.

Lecture 3: Introduction to R and Rstudio

Here we start working in R and get familiar with RStudio and some basic commands. These are the building blocks we’ll be continuing to use later!

Lecture 4: Objects and Functions in R

R works by creating and manipulating objects. We cover the kinds of objects and the functions we can run them through here.

Lecture 5: Working with Data Part 1

We start getting comfortable with data frames and tibbles, the main way of handling data in R. And we’ll need to be handling data!

Lecture 6: Working with Data Part 2

In this lecture we start getting comfortable with manipulating data using the dplyr commands select(), filter(), and mutate().

Lecture 7: Summarizing Data Part 1

We don’t want to just look at raw data, we want to summarize it so we can make sense of it! Here we cover ways of describing the distributions of single variables.

Lecture 8: Summarizing Data Part 2: Plots

You didn’t think you were going to get away without doing a little data visualization, did you?

Lecture 9: Relationships Between Variables Part 1

Most of what we do in econometrics has to do with not just one variable, but looking at how multiple variables interact. Let’s do some of that.

Lecture 10: Relationships Between Variables Part 2: Explaining

What does it mean to “explain one variable with another” and how do we do it? There are many ways, of course, but conceptually they all boil down to variations on some simple concepts we’ll be going over here.

Lecture 11: Simulating Data

If we want to understand whether our methods are actually capable of uncovering the data generating process, we need a situation where we know what the answer is so we can check it. How about we just make up our own?

Lecture 12: Programming Midterm Review

This is just a recap lecture for the programming content of the course before the midterm.

Lecture 13: Causality

This lecture introduces the fundamental problem of identifying causal effects from observational data.

Lecture 14: Causal Diagrams

We will be representing our underlying models using causal diagrams. This lecture goes over what those are and how they work.

Lecture 15: Drawing Causal Diagrams

And of course we need to be able to get the models in our own heads down on paper too, right? We need to know how to draw our own causal diagrams.

Lecture 16: Back Doors

What keeps us from just being able to look at correlations in the data and call them causal? Why, it’s those nasty back doors. What are they and how can we find and close them?

Lecture 17: Causal Diagram Practice

We need a little muscle memory in order to be able to put models down on paper. Let’s do the work.

Lecture 18: Closing Back Doors: Controlling

An important part of causal inference is being able to close back doors by controlling for variables. How does that work? When should we do it, and when shouldn’t we? What are “colliders”?

Lecture 19: Fixed Effects

Now we’re starting to get into the standard econometrician’s toolbox. How can you possibly measure everything you need to control for? You can’t! But sometimes you can control for things that you can’t measure. That’s where fixed effects comes in.

Lecture 20: Untreated Groups

A major concept in causal inference is the idea that we’re comparing a “treated” and an “untreated” group. What do we mean by that, and is it possible to create those groups artificially using matching?

Lecture 21: Difference in Differences

One of the most important tools for econometricians is difference-in-differences, where you compare a treated and untreated group across time to isolate a causal effect.

Lecture 22: Regression Discontinuity

It’s hard to find a treatment and control group that are really the same except that one got treatment. One way you can do it is by finding two groups standing just next to each other when the scimitar cleaves juuuust between them.

Lecture 23: Comparison Groups Practice

I’ve thrown a lot of tools at you in the past few weeks. Let’s take a chance to slow down and try to apply them.

Lecture 24: Instrumental Variables

One last tool for the toolbox: instrumental variables. It’s like something out in the world did the randomization in our experiment for us!

Lecture 25: Instrumental Variables in Action

When should we use IV? When should we trust it? When should we use something else from the toolbox?

Lecture 26: Causal Inference Midterm Review

Just a review period before the causal inference midterm. We’ll be covering again, and practicing, the methods and material we’ve gone over so far.

Lecture 27: Explaining Better: Regression

We’ve been explaining one variable with another all term. One common high-octane way of doing this is with regression. We’ll cover regression conceptually, as a preview of future classes, and see what it can do for us to aid our causal inference.

Lecture 28: Explaining Better: Regression Part 2

We apply regression to all of our causal inference methods here, and even get a little peek at some other ways of explaining with machine learning.

A few handy PDFs with information on how to use certain tools in the class.

Base R Cheat Sheet

(Not by me) a nice cheat sheet with an overview of some of the common R commands you’ll be using.

RStudio Cheat Sheet

(Not by me) An overview of the RStudio layout, as well as some very handy hotkeys to learn.

dplyr Cheat Sheet

(Not by me) We’ll be manipulating data in this class using dplyr. Dplyr has lots of useful commands in it. I’ve got this one taped up in my office.

Relationships Between Variables Cheat Sheet

Reminders of the kinds of commands we’ll be using to look at the relationship between two variables, or to explain one variable with another.

Simulation Cheat Sheet

How to simulate data-generating processes and test the results. Simulation is a highly useful way of understanding how methods work, and it’s a good idea to use a simulation the first time you’re trying out a new method.

Causal Diagrams Cheat Sheet

How to build and use causal diagrams. We will of course be using causal diagrams heavily throughout the causal inference part of the course, and it’s important to know the details!

Dagitty cheat sheet

How to use the causal diagram building and analyzing website Dagitty.net. This goes along well with the dagitty video on the Videos page.

Instructor Guide

A guide for instructors who are familiar with econometrics but may be unfamiliar with this class’s approach, R, RMarkdown, or causal diagrams.