Variable Clustering in High-Dimensional Linear Regression: The R Package clere

Dimension reduction is one of the biggest challenges in high-dimensional regression models. We recently introduced a new methodology based on variable clustering as a means to reduce dimen sionality. We present here the R package clere that implements some refinements of this methodology. An overview of the package functionalities as well as examples to run an analysis are described. Numerical experiments on real data were performed to illustrate the good predictive performance of our parsimonious method compared to standard dimension reduction approaches.

Loïc Yengo , Julien Jacques , Christophe Biernacki , Mickael Canouil
2016-04-03

CRAN packages used

glmnet, spikeslab, clere, Rcpp, RcppEigen, lasso2, flare

CRAN Task Views implied by cited packages

MachineLearning, NumericalMathematics, Bayesian, HighPerformanceComputing, Survival

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Yengo, et al., "Variable Clustering in High-Dimensional Linear Regression: The R Package clere", The R Journal, 2016

BibTeX citation

@article{RJ-2016-006,
  author = {Yengo, Loïc and Jacques, Julien and Biernacki, Christophe and Canouil, Mickael},
  title = {Variable Clustering in High-Dimensional Linear Regression: The R Package clere},
  journal = {The R Journal},
  year = {2016},
  note = {https://doi.org/10.32614/RJ-2016-006},
  doi = {10.32614/RJ-2016-006},
  volume = {8},
  issue = {1},
  issn = {2073-4859},
  pages = {92-106}
}