Tutorial reproducibility

Author

Lars Vilhuber

Published

September 1, 2025

Please fill out this survey on background and skills, to provide us with information on who you are. It will help us improve the presentation, and make it more relevant for you.

https://cornell.yul1.qualtrics.com/jfe/form/SV_bBqbJ9cSSJdOBw2

One of the following (or a linear combination):

Results from Survey 1

Last updated: 17 October 2025, 15:50

Education

Degree Frequency Frequency (Complete) Percent
graduate student (Ph.D.) 26 26 20.8
graduate student (Masters) 9 8 6.4
faculty member (tenured) 14 12 9.6
faculty member (untenured) 9 8 6.4
Other 60 51 40.8
NA 7 0 0.0
Note:
Percent is calculated as the proportion of completed responses.

Operating System

OS Frequency Percent
Linux 14 13.33
Mac 35 33.33
Windows 76 72.38
Note:
Completed responses only. Multiple mentions possible.

Intermediate or Advanced Programming Language Usage

Programming Language Frequency Percent
C 18 17.14
Matlab 15 14.29
Python 33 31.43
R 53 50.48
SAS 9 8.57
Stata 63 60.00
Note:
Completed responses only. Multiple mentions possible.

Command Line Usage

Feature Frequency Percent
Command line used often 64 60.95
Command line used once 23 21.90
System with >6 CPUs: often 40 38.10
System with >6 CPUs: once 13 12.38
Note:
Completed responses only.

One of the following (or a linear combination):

Survey 2

To identify what we should be speaking about on Day 2, please fill out this other survey:

https://cornell.yul1.qualtrics.com/jfe/form/SV_cNkhKL69K2Ob7o2

Results from Survey 2

Last updated: 17 October 2025, 15:50

Preferred_topic Frequency Percent
Advanced self-checking of reproducibility 10 34.48
Preserving raw survey data 6 20.69
Reproducibility when some data are confidential 11 37.93
NA 2 6.90

The last day serves to review the various materials, handle any questions not addressed (in detail) on the previous days, and discuss experiences and difficulties applying these principles in your work.

Topics may include:

Guidance

Some additional guidance can be found on the website of the Social Science Data Editors (URLs subject to change):

Examples of replication packages

With confidential data

Using containers:

Extra info