scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems

Comparing the results obtained by two or more algorithms in a set of problems is a central task in areas such as machine learning or optimization. Drawing conclusions from these comparisons may require the use of statistical tools such as hypothesis testing. There are some interesting papers that cover this topic. In this manuscript we present scmamp, an R package aimed at being a tool that simplifies the whole process of analyzing the results obtained when comparing algorithms, from loading the data to the production of plots and tables. Comparing the performance of different algorithms is an essential step in many research and practical computational works. When new algorithms are proposed, they have to be compared with the state of the art. Similarly, when an algorithm is used for a particular problem, its performance with different sets of parameters has to be compared, in order to tune them for the best results. When the differences are very clear (e.g., when an algorithm is the best in all the problems used in the comparison), the direct comparison of the results may be enough. However, this is an unusual situation and, thus, in most situations a direct comparison may be misleading and not enough to draw sound conclusions; in those cases, the statistical assessment of the results is advisable. The statistical comparison of algorithms in the context of machine learning has been covered in several papers. In particular, the tools implemented in this package are those presented in Demšar (2006); García and Herrera (2008); García et al. (2010). Another good review that covers, among other aspects, the statistical assessment of the results in the context of supervised classification can be found in Santafé et al. (2015).

Borja Calvo , Guzmán Santafé
2015-11-26

CRAN packages used

scmamp

CRAN Task Views implied by cited packages

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Calvo & Santafé, "scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems", The R Journal, 2015

BibTeX citation

@article{RJ-2016-017,
  author = {Calvo, Borja and Santafé, Guzmán},
  title = {scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems},
  journal = {The R Journal},
  year = {2015},
  note = {https://doi.org/10.32614/RJ-2016-017},
  doi = {10.32614/RJ-2016-017},
  volume = {8},
  issue = {1},
  issn = {2073-4859},
  pages = {248-256}
}