The R Journal: article published in 2014, volume 6:1

PivotalR: A Package for Machine Learning on Big Data PDF download
Hai Qian , The R Journal (2014) 6:1, pages 57-67.

Abstract PivotalR is an R package that provides a front-end to PostgreSQL and all PostgreSQL-like databases such as Pivotal Inc.’s Greenplum Database (GPDB) (Pivotal Inc., 2013a), HAWQ (Pivotal Inc., 2013b). When running on the products of Pivotal Inc., PivotalR utilizes the full power of parallel computation and distributive storage, and thus gives the normal R user access to big data. PivotalR also provides the R wrapper for MADlib. MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine-learning algorithms for structured and unstructured data. Thus PivotalR also enables the user to apply machine learning algorithms onto big data.

Received: 2013-09-21; online 2014-05-27
CRAN packages: PivotalR, RPostgreSQL, shiny
CRAN Task Views implied by cited CRAN packages: WebTechnologies

CC BY 4.0
This article is licensed under a Creative Commons Attribution 3.0 Unported license .

  author = {Hai Qian},
  title = {{PivotalR: A Package for Machine Learning on Big Data}},
  year = {2014},
  journal = {{The R Journal}},
  doi = {10.32614/RJ-2014-006},
  url = {} ,
  pages = {57--67},
  volume = {6},
  number = {1}