Department of Biostatistics Seminar/Workshop Series
R - When Fast is Slow and Slow is Fast
Cole Beck
Computer Systems Analyst, Department of Biostatistics
Vanderbilt University School of Medicine
This seminar will introduce a few methods for faster data set aggregation. An experienced R user will be familiar with the tapply and aggregate functions, as well as iterating over the output of split. These methods may be difficult to learn, and worse, suffer from slow performance. It may be worth investing the time into some R packages.This discussion will give an overview of data.table, dplyr, inline and Rcpp.