Applied Predictive Modeling in R

At useR! 2014 I'll be doing a morning tutorial (09:15 – 12:45) on at Monday, 30 June.

This course will provide an overview of using R for supervised learning. The session will step through the process of building, visualizing, testing and comparing models that are focused on prediction. The goal of the course is to provide a thorough workflow in R that can be used with many different modeling techniques. A case study is used to illustrate functionality. Basic familiarity with R is required.

Outline:

  • Introduction (philosophy, case study)
  • General strategies (data splitting, resampling, model tuning)
  • Data pre-processing (transformations, variable filtering)
  • Conventions in R (function interfaces, consistency)
  • Building and tuning models (performance metrics, trees, kernel methods)
  • Other Topics (as time allows) (comparing models via resampling, parallel processing)

I will be giving away about a dozen copies of the book (via names in a hat). Originally the book was to be included but since the tutorial is free, that isn't possible.

Materials

Installing Packages:

Note to Windows users: The last two classes that I have taught where Windows users installed packages via RStudio, all the dependencies have not been installed. I recommend installing packages using base R.

install.packages(c("caret", "pROC", "rpart", "partykit", 
                   "C50", "kernlab", "AppliedPredictiveModeling",
                   "earth", "mda", "nnet"),
                dependencies = c("Depends", "Imports", "Suggests"))

You can test the install by loading caret via

library(caret)