• Applied Predictive Modeling
  • Table of Contents
  • Data
  • Figures
  • Computing
  • Errata
  • Blog
  • About
  • Links
  • Training

Applied Predictive Modeling Applied Predictive Modeling

  • Applied Predictive Modeling
  • Table of Contents
  • Data
  • Figures
  • Computing
  • Errata
  • Blog
  • About
  • Links
  • Training

DataCamp Course

Zachary Deane-Mayer, who collaborates on caret, has put together a DataCamp course on Machine Learning in R.

Zach and DataCamp did a great job of developing a course that is just right for people who are relatively new to R.

The really cool thing about the course is that their system lets you execute the R code as the instructors walk you through it (on their system). If you are taking the class on your work machine and can’t easily get R, this takes all the burden of getting an install together. You can just focus on the code and the reasons why you might approach a problem in that way.

(Disclosure: I contributed some videos to the course but I don't make any money from it or DataCamp)

Tagged with R, Training.

September 26, 2016 by Max Kuhn.
  • September 26, 2016
  • Max Kuhn
  • R
  • Training
  • 1 Comment
1 Comment

Boston R User Group Talk [UPDATE]

I'll be giving a talk on Boston R user Group on Thursday March 10th at 6:00 PM. The talk will be on rule-based regression models.

The image above is the training/test set split for the data that I'll be using the illustrate the models.

Slides can be found here. Someone took video and I will link to that if it is posted soemwhere.

Tagged with R, R User Group, Rule-Based Models, presentations.

March 4, 2016 by Max Kuhn.
  • March 4, 2016
  • Max Kuhn
  • R
  • R User Group
  • Rule-Based Models
  • presentations
  • 2 Comments
2 Comments

Nonclinical Statistics Book

Springer has a new book (Amazon) edited by Lanju Zhang that captures the breadth of problems for statistics in the pharmaceutical industry including: compound optimization, genetic testing, high-throughput screening, safety testing, and manufacturing. From the first chapter:

'We define “Nonclinical Statistics” as statistics applied to areas other than clinical trials in pharmaceutical/biotechnology industries'

It's a big book (~700 pages) and has a lot of great content. I was a section editor for the drug discovery chapters:

  • Statistical Methods for Drug Discovery (Max Kuhn, Phillip Yates, and Craig Hyde)
  • High-Throughput Screening Data Analysis (Hanspeter Gubler)
  • Quantitative-Structure Activity Relationship Modeling and Cheminformatics (Max Kuhn)
  • GWAS for Drug Discovery (Yang Lu, Katherine Perez-Morera and Rita M. Cantor)
  • Statistical Applications in Design and Analysis of In Vitro Safety Screening Assays (Lei Shu, Gary Gintant and Lanju Zhang)

I particularly like the chapter by Bill Pikounis and Luc Bijnens ("How To Be a Good Nonclinical Statistician") which has a lot of excellent general advice.

The back cover blurb is:

'This book serves as a reference text for regulatory, industry and academic statisticians and also a handy manual for entry level Statisticians. Additionally it aims to stimulate academic interest in the field of Nonclinical Statistics and promote this as an important discipline in its own right. This text brings together for the first time in a single volume a comprehensive survey of methods important to the nonclinical science areas within the pharmaceutical and biotechnology industries. Specifically the Discovery and Translational sciences, the Safety/Toxiology sciences, and the Chemistry, Manufacturing and Controls sciences. Drug discovery and development is a long and costly process. Most decisions in the drug development process are made with incomplete information. The data is rife with uncertainties and hence risky by nature. This is therefore the purview of Statistics. As such, this book aims to introduce readers to important statistical thinking and its application in these nonclinical areas. The chapters provide as appropriate, a scientific background to the topic, relevant regulatory guidance, current statistical practice, and further research directions.'

The hardcopy format will be released on February 14, 2016. I couldn't say whether you should gift the important person in your life with nonclinical statistics...

Tagged with Books, Nonclinical Statistics, Drug Discovery.

February 8, 2016 by Max Kuhn.
  • February 8, 2016
  • Max Kuhn
  • Books
  • Nonclinical Statistics
  • Drug Discovery
  • 2 Comments
2 Comments
Newer
Older

Applied Predictive Modeling Applied Predictive Modeling

Applied Predictive Modeling is a book on the practice of modeling when accuracy is the primary goal.

  • Applied Predictive Modeling
  • Table of Contents
  • Data
  • Figures
  • Computing
  • Errata
  • Blog
  • About
  • Links
  • Training
Applied Predictive Modeling
$85.45
By Max Kuhn, Kjell Johnson
Buy on Amazon