Archiving Reproducible Research with R and Dataverse

Reproducible research and data archiving are increasingly important issues in research involving statistical analyses of quantitative data. This article introduces the dvn package, which allows R users to publicly archive datasets, analysis files, codebooks, and associated metadata in Dataverse Network online repositories, an open-source data archiving project sponsored by Harvard University. In this article I review the importance of data archiving in the context of reproducible research, introduces the Dataverse Network, explain the implementation of the dvn package, and provide example code for archiving and releasing data using the package.

Thomas J. Leeper

CRAN packages used

dvn, knitr, rfigshare, RCurl, XML, rfigshare, rdryad, OAIHarvester

CRAN Task Views implied by cited packages

WebTechnologies, Phylogenetics, ReproducibleResearch


Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".


For attribution, please cite this work as

Leeper, "The R Journal: Archiving Reproducible Research with R and Dataverse", The R Journal, 2014

BibTeX citation

  author = {Leeper, Thomas J.},
  title = {The R Journal: Archiving Reproducible Research with R and Dataverse},
  journal = {The R Journal},
  year = {2014},
  note = {},
  doi = {10.32614/RJ-2014-015},
  volume = {6},
  issue = {1},
  issn = {2073-4859},
  pages = {151-158}