Some recent features/changes:
- The license was changed to GPL >= 2 to accommodate new code from the
GApackage. - New feature selection functions
gafsandsafswere added, along with helper functions and objects, were added. The package HTML was updated to expand more about feature selection. I'll talk more about these functions in an upcoming blog post. - A reworked version of
nearZerVarbased on code from Michael Benesty was added the old version is now callednzvthat uses less memory and can be used in parallel. sbfControlnow has amultivariateoption where all the predictors are exposed to the scoring function at once.- Several regression simulation functions were added:
SLC14_1,SLC14_2,LPH07_1andLPH07_2 - For the input data
xtotrain, we now respect the class of the input value to accommodate other data types (such as sparse matrices). - A function
update.rfewas added.
Recently added models:
- From the
adabagpackage, two new models were added:AdaBagandAdaBoost.M1. - Weighted subspace random forests from the
wsrfpackage was added. - Additional bagged FDA and MARS models were added (model codes
bagFDAGCVandbagEarthGCV) were added that use the GCV statistic to prune the model. This leads to memory reductions during training. - Brenton Kenkel added ordered logistic or probit regression to
trainusingmethod = "polr"fromMASS - The adaptive mixture discriminant model from the
adaptDApackage - A robust mixture discriminant model from the
robustDApackage was added. - The multi-class discriminant model using binary predictors in the
bindapackage was added. - Ensembles of partial least squares models (via the
enplspackage) was added. plsRglmwas added.- From the
kernlabpackage, SVM models using string kernels were added:svmBoundrangeString,svmExpoString,svmSpectrumString - The model code for
adahad a bug fix applied and the code was adapted to use the "sub-model trick" so it should train faster.