for the Johns Hopkins Bloomberg School of Public Health
Data Science Specialization -- Developing Data Products
via Coursera.org
May 2015
This Shiny App uses the Auto dataset in the ISLR package. It sets up a Random Forest prediction model to determine a car's acceleration based on several attributes.
Feature Selection and Cross Validation Fine-Tuning
Random Forest has lower RMSE and explains more of the data variability.
Model | RMSE | $R^2$ | RMSE sd | $R^2$ sd |
---|---|---|---|---|
Random Forest | 1.4171 | 0.7508 | 0.2087 | 0.0730 |
Bayesian Generalized Linear Model | 1.5208 | 0.7210 | 0.2411 | 0.0970 |
Generalized Additive Model | 1.4838 | 0.7121 | 0.1437 | 0.0850 |