Department of Biostatistics Seminar/Workshop Series

Data Wrangling and Automating Summary Reports For Ultramarathon Finishing Times

Jeff Horner, BS

Computer Systems Analyst, Department of Biostatistics, Vanderbilt University School of Medicine

The marathon is a long distance running event, officially 42.195 kilometers (or 26.2 miles) in length. However ultramarathons are longer, ranging from 50 kilometers (31.0686 miles) to 100 miles and beyond. This talk will tackle web scraping ultramarathon data with the R packages XML and rjson, transforming data with the plyr package, and automating summary reports with knitr and brew. We will explore parsing HTML using the XPath query language, transforming JSON objects to data frames, combining both brew syntax and knitr syntax together, and automating the creation of knitr reports.

Notes from the Talk

Topic revision: r2 - 12 Feb 2014, JeffreyHorner
 

This site is powered by FoswikiCopyright © 2013-2022 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Vanderbilt Biostatistics Wiki? Send feedback