Rows: 17 Columns: 4
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (3): Timestamp, Topic, What is the name of the instructor?
dbl (1): How many browser tabs do you have open?
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Workshop reproducibility - Berlin
Session 1
Session 2
Session 3
Session 4
One of the following:
JIT Preservation of research output
Short presentation on concerns about ethics in data dissemination
In-depth tutorial on the use of Docker for economics research
Results
Topic | Frequency | Percent |
---|---|---|
Ethics and Privacy in Data Dissemination (discussion) | 3 | 17.65 |
In-depth Tutorial on Use of Containers | 8 | 47.06 |
Just-in-time Preservation of Research Data (hands-on) | 6 | 35.29 |
ggplot(data, aes(y=Frequency, x=Topic)) +
geom_bar(position='dodge', stat='identity')
Guidance
Some additional guidance can be found on the website of the Social Science Data Editors (URLs subject to change):
Examples of replication packages
With confidential data
- https://doi.org/10.3886/E154241V2 not only code, but faces the problem that IRS data cannot have variables revealed. Their workaround is not the same one as in this tutorial.
- https://doi.org/10.3886/E162581V1
Using containers:
- Kline et al (2024) “A Discrimination Report Card: primary replication package, with container specification, image on Docker Hub, and preserved image on Zenodo.
- Herbert et al (2024) “Reproduce to validate”: primary replication package, container specification and preserved image on Borealis.ca.
Extra info
- This document’s source: https://github.com/larsvilhuber/workshop-reproducibility-berlin
- Licensed under