The R Journal: article published in 2016, volume 8:1

Using DECIPHER v2.0 to Analyze Big Biological Sequence Data in R PDF download
Erik S. Wright , The R Journal (2016) 8:1, pages 352-359.

Abstract In recent years, the cost of DNA sequencing has decreased at a rate that has outpaced improvements in memory capacity. It is now common to collect or have access to many gigabytes of biological sequences. This has created an urgent need for approaches that analyze sequences in subsets without requiring all of the sequences to be loaded into memory at one time. It has also opened opportunities to improve the organization and accessibility of information acquired in sequencing projects. The DECIPHER package offers solutions to these problems by assisting in the curation of large sets of biological sequences stored in compressed format inside a database. This approach has many practical advantages over standard bioinformatics workflows, and enables large analyses that would otherwise be prohibitively time consuming.

Received: 2016-01-29; online 2016-05-01
CRAN packages: RSQLite
Bioconductor packages: Biostrings, DECIPHER

CC BY 4.0
This article is licensed under a Creative Commons Attribution 3.0 Unported license .

  author = {Erik S. Wright},
  title = {{Using DECIPHER v2.0 to Analyze Big Biological Sequence Data
          in R}},
  year = {2016},
  journal = {{The R Journal}},
  doi = {10.32614/RJ-2016-025},
  url = {},
  pages = {352--359},
  volume = {8},
  number = {1}