2025-05-26
larsvilhuber.github.io/summer-school-qicss-2025/presentation/presentation.html (PDF)
Assistant professor at the University of Toronto and a Research Principal at Opportunity Insights, a lab based at Harvard University.
Executive Director of the Labor Dynamics Institute and Senior Research Associate in the Economics Department at Cornell University, and the American Economic Association’s Data Editor.
2389 Manuscripts and 4440 Reports, approx. 4400 authors reached.
We are dedicated to providing a welcoming and supportive environment for all people, regardless of background or identity. By participating in this team, participants accept to abide by LDI ReplicationLab’s Code of Conduct and accept the procedures by which any Code of Conduct incidents are resolved. Any form of behaviour to exclude, intimidate, or cause discomfort is a violation of the Code of Conduct. In order to foster a positive and professional learning environment we encourage the following kinds of behaviours in all platforms and events:
If you believe someone is violating the Code of Conduct, we ask that you report it to us, QICSS/CIQSS (sponsor), or the CEA (host).
Reproducibility means obtaining consistent computational results using the same input data, computational steps, methods, code, and conditions of analysis.2
You.
👶🏻 Now:
👵🏽 Soon
Computational empathy: think of the next person to run this - It could be you in 5 years!
Lars Vilhuber, Connolly, M., Koren, M., Llull, J., & Morrow, P. (2022). A template README for social science replication packages (v1.1). Social Science Data Editors. https://doi.org/10.5281/zenodo.7293838
You can download the Word, LaTeX, or Markdown version of the README with lots of examples.
Presentation on “Self Checking Reproducibility” and its associated website
Guidance when (some) data are confidential: https://labordynamicsinstitute.github.io/reproducibility-confidential/
Guidance for citations: https://social-science-data-editors.github.io/guidance/addtl-data-citation-guidance.html
It is the policy of the American Economic Association to publish papers only if the data used in the analysis are clearly and precisely documented and access to the data and code is clearly and precisely documented and is non-exclusive to the authors.
Authors … must provide, prior to acceptance, the data, programs, and other details of the computations sufficient to permit replication
AER 1911 thanks to Stefano Dellavigna
None
… there is a grey zone:
https://social-science-data-editors.github.io/
Roth, Jonathan. 2022. “Pretest with Caution: Event-Study Estimates after Testing for Parallel Trends.” American Economic Review: Insights 4 (3): 305–22. DOI: 10.1257/aeri.20210236
Notes: “I exclude 43 papers for which data to replicate the main event-study plot were unavailable.”
de Chaisemartin, Clément, and Xavier D’Haultfœuille. 2020. “Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects.” American Economic Review 110 (9): 2964–96. DOI: 10.1257/aer.20181169
The results from various other papers are recomputed to empirically demonstrate the relevance of the proposed methods.
Why should I believe the third party?
Data Citation Synthesis Group: Joint Declaration of Data Citation Principles. Martone M. (ed.) San Diego CA: FORCE11; 2014 https://www.force11.org/group/joint-declaration-data-citation-principles-final
National Academies of Sciences, Engineering, and Medicine. 2019. Reproducibility and Replicability in Science. Washington, DC: The National Academies Press. https://doi.org/10.17226/25303.
Weeden, K. A. (2023). Crisis? What Crisis? Sociology’s Slow Progress Toward Scientific Transparency . Harvard Data Science Review, 5(4). https://doi.org/10.1162/99608f92.151c41e3