rentrez: An R package for the NCBI eUtils API

The USA National Center for Biotechnology Information (NCBI) is one of the world’s most important sources of biological information. NCBI databases like PubMed and GenBank contain mil lions of records describing bibliographic, genetic, genomic, and medical data. Here I present rentrez, a package which provides an R interface to 50 NCBI databases. The package is well-documented, contains an extensive suite of unit tests and has an active user base. The programmatic interface to the NCBI provided by rentrez allows researchers to query databases and download or import particular records into R sessions for subsequent analysis. The complete nature of the package, its extensive test-suite and the fact the package implements the NCBI’s usage policies all make rentrez a powerful aid to developers of new packages that perform more specific tasks.

David J. Winter
2017-11-16

Supplementary materials

Supplementary materials are available in addition to this article. It can be downloaded at RJ-2017-058.zip

CRAN packages used

ape, RISmed, pubmed.mineR, rentrez, reutils, rotl, fulltext, treemap

CRAN Task Views implied by cited packages

Phylogenetics, Environmetrics, Genetics, Graphics, OfficialStatistics, WebTechnologies

Bioconductor packages used

genomes, RMassBank, MeSHSim, genbankr

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Winter, "rentrez: An R package for the NCBI eUtils API", The R Journal, 2017

BibTeX citation

@article{RJ-2017-058,
  author = {Winter, David J.},
  title = {rentrez: An R package for the NCBI eUtils API},
  journal = {The R Journal},
  year = {2017},
  note = {https://doi.org/10.32614/RJ-2017-058},
  doi = {10.32614/RJ-2017-058},
  volume = {9},
  issue = {2},
  issn = {2073-4859},
  pages = {520-526}
}