The R Journal: accepted article

This article will be copy edited and may be changed before publication.

Simulating Noisy, Nonparametric, and Multivariate Discrete Patterns PDF download
Ruby Sharma, Sajal Kumar, Hua Zhong and Mingzhou Song

Abstract Requiring no analytical forms, nonparametric discrete patterns are flexible in representing complex relationships among random variables. This makes them increasingly useful for data-driven applications. However, there appears to be no software tools for simulating nonparametric discrete patterns, which prevents objective evaluation of statistical methods that discover discrete relationships from data. We present a simulator to generate nonparametric discrete functions as contingency tables. User can request strictly many-to-one functional patterns. The simulator can also produce contingency tables representing dependent non-functional and independent relationships. An option is provided to apply random noise to contingency tables. We demonstrate the utility of the simulator by showing the advantage of the FunChisq test over Pearson’s chi-square test in detecting functional patterns. This simulator, implemented as function simulate_tables in the R package FunChisq (version 2.4.0 or greater), offers an important means to evaluate the performance of nonparametric statistical pattern discovery methods.

Received: 2017-05-09; online 2017-10-25
CRAN packages: stats, rTableICC, FunChisq


CC BY 4.0
This article is licensed under a Creative Commons Attribution 4.0 International license.

@article{RJ-2017-053,
  author = {Ruby Sharma and Sajal Kumar and Hua Zhong and Mingzhou Song},
  title = {{Simulating Noisy, Nonparametric, and Multivariate Discrete
          Patterns}},
  year = {2017},
  journal = {{The R Journal}},
  url = {https://journal.r-project.org/archive/2017/RJ-2017-053/index.html}
}