
What we’re about
This meetup is run by practitioners with 20+ years of experience in the field and it is dedicated to real world data science. Real Data Science is about methods you can apply in your practice/business applications and not about some overhyped technology that is experimental at best and broken at worse (and in fact of very limited use to you). Many technologies labelled "big data", "AI", "self driving" etc. have been more of the latter. This meetup is operating from Texas, but was originally founded in 2014 in Los Angeles, California and was part of DataScience.LA (2014-2021). We moved to Texas and changed our name in July 2021.
X-post from: https://www.meetup.com/real-data-science-usa-r-meetup/events/301717728/?eventOrigin=group_featured_event
PLEASE RSPV USING THE LINK ABOVE (OTHER MEETUP GROUP)!
Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT
Szilard Pafka, PhD
Chief Scientist, Epoch
Gradient Boosting Machines (GBMs) have been considered (for more than a decade) as the best machine learning algorithm (in terms of highest accuracy) for supervised learning/predictive analytics with structured/tabular data (widely encountered in business applications). Are they still relevant in the age of Large Language Models (LLMs) and ChatGPT? This talk will tackle this very question and will also present updates to the author's GBM-perf benchmark (available on GitHub) including the newest results of training XGBoost and LightGBM on monster CPU servers (192 cores on c7i.metal-48xl and c7a.metal-48xl) and powerful GPUs (A100, H100).
Bio:
Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then in 2006 he moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He was the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la for more than a decade until he relocated to Texas in 2021. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL, H2O World, Data Science Pop-up, Dataworks Summit etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe).
LinkedIn: https://www.linkedin.com/in/szilard
Twitter: https://twitter.com/SzilardPafka/
Github: https://github.com/szilard/
Upcoming events (1)
See all- X-post: Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPTLink visible for attendees
X-post from: https://www.meetup.com/real-data-science-usa-r-meetup/events/301717728/?eventOrigin=group_featured_event
PLEASE RSPV USING THE LINK ABOVE (OTHER MEETUP GROUP)!
Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT
Szilard Pafka, PhD
Chief Scientist, EpochGradient Boosting Machines (GBMs) have been considered (for more than a decade) as the best machine learning algorithm (in terms of highest accuracy) for supervised learning/predictive analytics with structured/tabular data (widely encountered in business applications). Are they still relevant in the age of Large Language Models (LLMs) and ChatGPT? This talk will tackle this very question and will also present updates to the author's GBM-perf benchmark (available on GitHub) including the newest results of training XGBoost and LightGBM on monster CPU servers (192 cores on c7i.metal-48xl and c7a.metal-48xl) and powerful GPUs (A100, H100).
Bio:
Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then in 2006 he moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He was the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la for more than a decade until he relocated to Texas in 2021. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL, H2O World, Data Science Pop-up, Dataworks Summit etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe).LinkedIn: https://www.linkedin.com/in/szilard
Twitter: https://twitter.com/SzilardPafka/
Github: https://github.com/szilard/Not open