The R Journal: article published in 2014, volume 6:1

Taming PITCHf/x Data with XML2R and pitchRx PDF download
Carson Sievert , The R Journal (2014) 6:1, pages 5-19.

Abstract XML2R is a framework that reduces the effort required to transform XML content into tables in a way that preserves parent to child relationships. pitchRx applies XML2R’s grammar for XML manipulation to Major League Baseball Advanced Media (MLBAM)’s Gameday data. With pitchRx, one can easily obtain and store Gameday data in a remote database. The Gameday website hosts a wealth of XML data, but perhaps most interesting is PITCHf/x. Among other things, PITCHf/x data can be used to recreate a baseball’s flight path from a pitcher’s hand to home plate. With pitchRx, one can easily create animations and interactive 3D scatterplots of the baseball’s flight path. PITCHf/x data is also commonly used to generate a static plot of baseball locations at the moment they cross home plate. These plots, sometimes called strike-zone plots, can also refer to a plot of event probabilities over the same region. pitchRx provides an easy and robust way to generate strike-zone plots using the ggplot2 package.

Received: 2013-01-17; online 2014-03-03
CRAN packages: pitchRx, XML2R, ggplot2, rgl, dplyr, mgcv, knitr
CRAN Task Views implied by cited CRAN packages: Graphics, Bayesian, Econometrics, Environmetrics, Multivariate, Phylogenetics, ReproducibleResearch, SocialSciences, SpatioTemporal, WebTechnologies

CC BY 4.0
This article is licensed under a Creative Commons Attribution 3.0 Unported license .

  author = {Carson Sievert},
  title = {{Taming PITCHf/x Data with XML2R and pitchRx}},
  year = {2014},
  journal = {{The R Journal}},
  doi = {10.32614/RJ-2014-001},
  url = {} ,
  pages = {5--19},
  volume = {6},
  number = {1}