From hours to minutes: accelerating your tidymodels code
Details
Our January Meetup will be a remote gathering; the Zoom link will be posted the week before the event. The passcode is 502920.
This talk demonstrates a 145-fold speedup in training time for a machine learning pipeline with tidymodels through 4 small changes. By adapting a grid search on a canonical model to use a more performant modeling engine, hooking into a parallel computing framework, transitioning to an optimized search strategy, and defining the grid to search over carefully, users can drastically cut down on the time to develop machine learning models with tidymodels without sacrificing predictive performance. The talk is based on Efficient Machine Learning with R, an in-progress book available at emlwr.org.
Data Science
Data Visualization
R Project for Statistical Computing
Statistical Computing
Statistical Modeling





