Welcome to QTM 385 - Experimental Methods

Lecture 01: Introduction

Danilo Freire

Department of Quantitative Theory and Methods
Emory University

Welcome to QTM 385
Experimental Methods! 🎉

Lecture overview

Today’s agenda

  • Introduction
  • Motivation
  • Class logistics
  • Assignments and grading
  • Set up for next class

Course materials

Course repository: https://github.com/danilofreire/qtm385

Course website: https://danilofreire.github.io/qtm385

All course materials, including lectures, code, discussions, assignments, and project guidelines, are available on our GitHub repository

We will use Canvas for course administration, including submitting assignments, accessing grades, and receiving announcements

Please take some time to get to know both platforms, and reach out if you have any questions.

Note

Please remember to check the course repository regularly for updates and announcements!

Nice to meet you! 😊

Instructor

A bit about me

Visiting Assistant Professor in the QTM

MA from the Graduate Institute Geneva, PhD from King’s College London, Postdoc at Brown University, Senior Lecturer at the University of Lincoln, UK

Research interests: computational social science, experimental methods, policy evaluation, political violence, organised crime

What about you? (time permitting!)

Now it’s your turn! 😉

Please introduce yourself! 👋

Tell us your name, your major, one thing you really like, and something we don’t know about your city or country! 🌍

My teaching philosophy

  • I love teaching and aim to make learning fun
  • Classes where students participate are the best
  • Hands-on activities help you learn better
  • I am always available to help and answer questions. And I mean it
  • Your feedback helps me improve my teaching. Please let me know what is working and what is not

Teaching assistants

  • The teaching assistants for this course will be determined soon (hopefully!). They will be available to help you with assignments, quizzes, and any questions you may have

  • They will also hold office hours and review sessions to help you prepare for quizzes and the final project

  • We are all here to help you! So feel free to ask questions during class, office hours, or via email 😃

Office hours

What for and what not for

  • What office hours are meant for:
    • Applying tools in practice
    • Discussion of issues related to the assignments
    • Boosting your knowledge of data science
  • What these sessions are not meant for:
    • Solving the assignments for you
    • Taking care of developing your coding or stats skills

Class etiquette

  • Learning a new topic can be tough and push you out of your comfort zone. If the course pace is too fast, let us know. I expect your commitment, but I do not want anyone to fail
  • You are all keen on experimental methods, but your backgrounds vary. That is great! Some sessions might be more engaging than others. If you are bored, help others or explore new data science areas
  • Always be respectful to each other
  • Ask questions whenever you need to!

Questions?

What are randomised controlled trials? 🎲🧪

Randomised controlled trials (RCTs): diagram

Source: Liakos et al (2024)

What are randomised controlled trials (RCTs)?

  • Randomised controlled trials (RCTs) are a type of scientific experiment that aim to reduce bias when testing a new treatment or intervention
  • In its simplest form, a RCT consists of a control group and a treatment group, with the treatment group randomly receiving the intervention and the control group not receiving it
  • They are often used in medicine, psychology, and social sciences to evaluate the effectiveness of new treatments or interventions
  • It is easy to take them for granted, but these trials have transformed our understanding of cause and effect
  • They are the best tool we have to illuminate what is unknown or uncertain, and to discern whether something works and how well it works

From xkcd

History of experiments

Francis Pretrarch

  • Petrarch (1394): “I solemnly affirm and believe, if a hundred or a thousand of men of the same age, same temperament and habits, together with the same surroundings, were attacked at the same time by the same disease, that if the one half followed the prescriptions of doctors of the variety of those practicing at the present day, and that the other half took no medicine but relied on Nature’s instincts, I have no doubt as to which half would escape

History of experiments

James Lind

  • In 1747, Scottish naval surgeon James Lind sought to cure scurvy during a sea voyage
  • Lind implemented the first (non-randomised) experiment comparing two treatments across six pairs of sailors
  • One effective treatment involved administering two oranges and one lemon daily to sailors
  • To ensure consistency, Lind aimed for similarity among participants: “The cases were as similar as I could have them. They all in general had putrid gums, the spots and lassitude, with weakness of their knees. They lay together in one place, being a proper apartment of the sick in the fore-hold; and had one diet common to all.

History of experiments

  • The first random assignment of treatment and control happened in Nuremberg, 1835, to assess homeopathic medicine (yes, it’s been ongoing for a while!)
  • In 1884, Peirce and Jastrow conducted the first psychological experiment using randomisation to study sensory perception
  • This approach later became standard practice in experimental psychology and social sciences
  • In 1947, the Medical Research Council carried out the first RCT to evaluate streptomycin for tuberculosis
  • Results showed streptomycin was effective, and the RCT became the gold standard for evaluating medical treatments

Recent examples: Banerjee, Duflo, and Kremer

Recent examples: Card, Angrist, and Imbens

Types of Experiments

  • Lab experiments: Conducted in controlled environments
  • Field experiments: Conducted in real-world settings
  • Natural experiments: Use naturally occurring events as treatments
  • Quasi-experiments: Use non-randomised designs


  • Why are natural/quasi-experiments not as reliable as RCTs?

Questions?

Causality 🔍🔗

Correlation versus causation

Source Tyler Vigen

Correlation versus causation

Source Tyler Vigen

Correlation versus causation

Source Tyler Vigen

Correlation versus causation

From xkcd

David Hume and causality

  • David Hume (1739): “We may define a cause to be an object followed by another, and where all the objects similar to the first are followed by objects similar to the second. Or in other words where, if the first object had not been, the second never had existed.
  • Three concepts worth noting:
    • Temporal sequence: The cause must precede the effect
    • Covariation: The cause and effect must vary together
    • Non-spuriousness: The relationship must not be due to a third variable

Five principles of causality

  • A more modern take involves 5 principles:
    • Relationship: There must be a relationship between the cause and effect
    • Direction (temporality): The cause must precede the effect
    • Nonconfounding: The relationship must not be due to a third variable
    • Mechanism: There must be a plausible mechanism linking the cause and effect
    • Appropriate level of analysis: The relationship must happen (and be measured) at the appropriate level of analysis

Causality terminology

  • Unit: The entity to which the treatment is applied
  • Treatment: The intervention applied to the unit
  • Outcome: The variable of interest
  • Potential outcomes: The outcome that would be observed under different treatments
  • Causal effect: The difference between potential outcomes
  • Average treatment effect: The average causal effect across units
  • Random assignment: The process of randomly assigning treatments to units
  • Confounding: The presence of a third variable that affects the treatment and outcome
  • Compliance: The extent to which units follow the treatment assigned
  • Heterogeneity: The extent to which the treatment effect varies across units
  • Internal validity: The extent to which the results are due to the treatment
  • External validity: The extent to which the results can be applied to other settings
  • Statistical power: The probability of detecting an effect if it exists

Questions?

Course overview and logistics 📖 📚 💻

Course objectives


  • Design rigorous experiments with proper randomisation procedures and sample sizing calculations
  • Create pre-analysis plans (PAP) and apply appropriate statistical methods for experimental analysis
  • Produce reproducible reports using Quarto
  • Evaluate experimental designs, identifying potential limitations
  • Manage unexpected data challenges, such as attrition and non-compliance
  • Understand ethical considerations in experimental design
  • Develop analytical skills through practical problem sets and discussion sessions

Textbooks

Required: Alan Gerber and Donald Green - Field Experiments

Lots of additional readings included in the syllabus 😊

Key tools

R or Python for data analysis, GitHub for version control, and Quarto for reproducible reports

Guides: https://danilofreire.github.io/qtm385/guides.html

Logistics

Course information

  • Syllabus: Available on our course repository and website. The course is designed to be self-contained. The syllabus includes links to slides and code we will use in class, along with recommended readings, and problem sets. I will upload slides throughout the term as we progress.

  • Schedule: Lectures are on Mondays and Wednesdays from 2:30 to 3:45 pm in the Psychology Building, Room 250

  • Office Hours: I’m available to meet you at any time. Please reach out some time in advance and we can schedule a meeting

  • Materials:

Assignments

How you will be graded

  • Problem sets: Ten of them, due on Wednesdays at 11:59 pm (50%)
  • Pre-Analysis Plan: Due mid-term (20%)
  • Final project: Due on the last day of class (30%)
  • Late policy: 10% off per day late
  • Collaboration: You can discuss assignments with your classmates, but you must write your own code and submit your own work. AI is allowed.
  • Academic integrity: Please refer to the syllabus for the university’s policy on academic integrity

Next class

  • Research design process: Formulate good research questions, develop strong hypotheses, and design experiments
  • Theory and experiments: Link theory to experiments and how to test hypotheses using RCTs (and which ones to use)
  • EGAP research design form: Think systematically about your research design and how to implement it
  • Preparation: Read the syllabus, familiarise yourself with the course website and repository, and read the texts and the slides for our next class 😉

and that’s all for today! 🎉

Questions?

Thank you very much for your attention! 🙏🏻

Have a great day! 😊