Course Content

The course is eight 200-minute sessions: a five-week core course plus a three-session capstone. Each week opens with a 30–60 minute “Before you come to class” checklist and ends with a deliverable due Sunday 23:55.

Core course

Week 1: LLMs, Harnesses & Setup

What LLMs are and how to work with them; the model-vs-harness distinction; closed/frontier vs open-weights models; the two course harnesses (Claude Code in terminal and desktop, VS Code + GitHub Copilot). Set up a working environment.

Which AI? See my take on current models.

Content

Week 2: From Raw Data to Report

Download real survey data, build a reproducible pipeline (explore → composite variables → aggregate → join an external API), and contrast an undirected “vibe report” with a carefully directed report.

Case study: World Values Survey

Content

Week 3: Data Wrangling & Debugging

Reviewing AI’s work: project instruction files (CLAUDE.md / agents.md), reusable skills, the three kinds of tests, git discipline, and safe autonomous execution.

Case study: Austrian Hotels

Content

Week 4: Econometrics with AI

AI as a research companion for causal identification — designing control sets (helpful vs adversarial), surfacing instruments via prompt chaining, and difference-in-differences.

Content

Week 5: Text as Data

Turning short texts into numbers — the NLP pipeline, four ways to score sentiment, humans vs AI, and a text→data pipeline that scales via an LLM API.

Case study: Football Manager Interviews

Content

Capstone Project (3 sessions)

Three-session team project on manager changes in football. One big research question, messy real-world data, AI as your primary teammate. — Project description

Session 1: Data collection & description

Pick a league, collect and document 10+ seasons of match and manager-change data.

Content

Session 2: From text to expectations (APIs)

Scrape news sources and use an LLM API to score expectations around each manager change.

Content

Session 3: Difference-in-Differences analysis + final presentation

Run a DiD on your panel, explore heterogeneity (including by expectation score), and present to the class.

Content