Dimension reduction is one of the biggest challenges in high-dimensional regression models. We recently introduced a new methodology based on variable clustering as a means to reduce dimen sionality. We present here the R package clere that implements some refinements of this methodology. An overview of the package functionalities as well as examples to run an analysis are described. Numerical experiments on real data were performed to illustrate the good predictive performance of our parsimonious method compared to standard dimension reduction approaches.
glmnet, spikeslab, clere, Rcpp, RcppEigen, lasso2, flare
MachineLearning, NumericalMathematics, Bayesian, HighPerformanceComputing, Survival
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
For attribution, please cite this work as
Yengo, et al., "Variable Clustering in High-Dimensional Linear Regression: The R Package clere", The R Journal, 2016
BibTeX citation
@article{RJ-2016-006, author = {Yengo, Loïc and Jacques, Julien and Biernacki, Christophe and Canouil, Mickael}, title = {Variable Clustering in High-Dimensional Linear Regression: The R Package clere}, journal = {The R Journal}, year = {2016}, note = {https://doi.org/10.32614/RJ-2016-006}, doi = {10.32614/RJ-2016-006}, volume = {8}, issue = {1}, issn = {2073-4859}, pages = {92-106} }