A Decade of Using R in Production: A CRUG Brunch and Learn Brew and View
Details
R Finance regular, Szilard Pafka, moved from LA to Texas during the pandemic and the LA R User Group went with him. It's now called Real Data Science USA - R Meetup.
https://www.meetup.com/Real-Data-Science-USA-R-Meetup
Their first (online) meetup covers a topic that is near and dear to all of our hearts: R in production! Many of us got the, apparently, well-targeted notification.
The conversation went like this...
Justin Shea (CRUG Organizer):
> https://www.meetup.com/Real-Data-Science-USA-R-Meetup/events/281403393/
Fri 11:10am
CRUG member:
> So… who’s hosting a watch party for the Production R stream?
Fri 4:21pm
Justin Shea
> Idk…haymarket???
Fri 5:49pm
CRUG and Haymarket discussed and (after an embarrassing number of date and time corrections) determined that we could host our very first Brunch and Learn watch party for this event at 11am on 11/11.
Note: Please do not sign up for the Real Data Science USA event if you plan on attending our event. They have a limited number of online seats!
CRUG encourages people to get vaccinated. We are monitoring daily infection and fatality counts (currently falling locally and nationally) and will adjust the parameters of this event based on guidance from the Illinois and Chicago Departments of Public Health.
A Decade of Using R in Production
with Gergely Daroczi, Senior Director of Data Operations at System1
R is often not taken seriously by Real Programmers, and considered a scripting language for ad-hoc data analysis, which is a great interactive tool used by data scientists, but the models (obviously written in notebooks) will eventually need to be productionized using a Real Programming Language. I am also biased, and tend to be similarly opinionated about R (although in the opposite direction), but instead of arguing that R is now ready for production, this talk will just rather share some use-cases and best practices on how I managed to use R in production from small to large companies for ETL, reporting, modeling, live-scoring, stream processing and many more.
Bio: Gergely Daroczi is an enthusiast R user and package developer, Ph.D. in Sociology, former assistant professor, currently working in the industry with 15 years of experience in data science, engineering, cloud infrastructure, and data operations at SaaS, fintech, adtech, and healthtech companies with a strong interest in building scalable data platforms on the top of R and AWS. He maintains a dozen CRAN packages related to using R in production (automated reports, logging, database connections, API integrations), co-authored a number of journal articles in social and medical sciences, and wrote a book on "Mastering Data Analysis with R".
