Benchmarking R packages for Calculation of Persistent Homology

Several persistent homology software libraries have been implemented in R. Specifically, the Dionysus, GUDHI, and Ripser libraries have been wrapped by the TDA and TDAstats CRAN packages. These software represent powerful analysis tools that are computationally expensive and, to our knowledge, have not been formally benchmarked. Here, we analyze runtime and memory growth for the 2 R packages and the 3 underlying libraries. We find that datasets with less than 3 dimensions can be evaluated with persistent homology fastest by the GUDHI library in the TDA package. For higher-dimensional datasets, the Ripser library in the TDAstats package is the fastest. Ripser and TDAstats are also the most memory-efficient tools to calculate persistent homology.

Eashwar V. Somasundaram , Shael E. Brown , Adam Litzler , Jacob G. Scott , Raoul R. Wadhwa

CRAN packages used

TDA, TDAstats, readr, ggplot2, scatterplot3d, recexcavAAR, deldir, magick, bench, pryr

CRAN Task Views implied by cited packages

Graphics, Multivariate, Phylogenetics, Spatial, TeachingStatistics


Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".


For attribution, please cite this work as

Somasundaram, et al., "The R Journal: Benchmarking R packages for Calculation of Persistent Homology", {The R Journal}, 2020

BibTeX citation

  author = {Somasundaram, Eashwar V. and Brown, Shael E. and Litzler, Adam and Scott, Jacob G. and Wadhwa, Raoul R.},
  title = {The R Journal: Benchmarking R packages for Calculation of Persistent Homology},
  journal = {{The R Journal}},
  year = {2020},
  note = {},
  doi = {10.32614/RJ-2021-033},
  volume = {13},
  issue = {1},
  issn = {2073-4859},
  pages = {184-193}