Skip to content

Real Data Science USA (formerly LA Data Science) cover photo

Real Data Science USA (formerly LA Data Science)

3,516 members · Public group

Organized by Szilard Pafka and 2 others

Share:

Join this group

Join this group

What we’re about

This meetup is run by practitioners with 20+ years of experience in the field and it is dedicated to real world data science. Real Data Science is about methods you can apply in your practice/business applications and not about some overhyped technology that is experimental at best and broken at worse (and in fact of very limited use to you). Many technologies labelled "big data", "AI", "self driving" etc. have been more of the latter. This meetup is operating from Texas, but was originally founded in 2014 in Los Angeles, California and was part of DataScience.LA (2014-2021). We moved to Texas and changed our name in July 2021.

Tue, Aug 19, 2025, 11:00 PM UTCX-post: Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT

Link visible for attendees

X-post: Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT cover photo

X-post from: https://www.meetup.com/real-data-science-usa-r-meetup/events/301717728/?eventOrigin=group_featured_event

PLEASE RSPV USING THE LINK ABOVE (OTHER MEETUP GROUP)!

Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT

Szilard Pafka, PhD
Chief Scientist, Epoch

Gradient Boosting Machines (GBMs) have been considered (for more than a decade) as the best machine learning algorithm (in terms of highest accuracy) for supervised learning/predictive analytics with structured/tabular data (widely encountered in business applications). Are they still relevant in the age of Large Language Models (LLMs) and ChatGPT? This talk will tackle this very question and will also present updates to the author's GBM-perf benchmark (available on GitHub) including the newest results of training XGBoost and LightGBM on monster CPU servers (192 cores on c7i.metal-48xl and c7a.metal-48xl) and powerful GPUs (A100, H100).

Bio:
Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then in 2006 he moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He was the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la for more than a decade until he relocated to Texas in 2021. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL, H2O World, Data Science Pop-up, Dataworks Summit etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe).

LinkedIn: https://www.linkedin.com/in/szilard
Twitter: https://twitter.com/SzilardPafka/
Github: https://github.com/szilard/

1 attendee

Not open

Upcoming events (1)

Tue, Aug 19, 2025, 11:00 PM UTCX-post: Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT
Link visible for attendees
X-post from: https://www.meetup.com/real-data-science-usa-r-meetup/events/301717728/?eventOrigin=group_featured_event

PLEASE RSPV USING THE LINK ABOVE (OTHER MEETUP GROUP)!

Gradient Boosting Machines (GBMs) in the Age of LLMs and ChatGPT

Szilard Pafka, PhD
Chief Scientist, Epoch

Gradient Boosting Machines (GBMs) have been considered (for more than a decade) as the best machine learning algorithm (in terms of highest accuracy) for supervised learning/predictive analytics with structured/tabular data (widely encountered in business applications). Are they still relevant in the age of Large Language Models (LLMs) and ChatGPT? This talk will tackle this very question and will also present updates to the author's GBM-perf benchmark (available on GitHub) including the newest results of training XGBoost and LightGBM on monster CPU servers (192 cores on c7i.metal-48xl and c7a.metal-48xl) and powerful GPUs (A100, H100).

Bio:
Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then in 2006 he moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He was the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la for more than a decade until he relocated to Texas in 2021. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL, H2O World, Data Science Pop-up, Dataworks Summit etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe).

LinkedIn: https://www.linkedin.com/in/szilard
Twitter: https://twitter.com/SzilardPafka/
Github: https://github.com/szilard/
1 attendee
Not open

Past events (56)

Thu, May 26, 2022, 4:00 PM UTCBest Algo for Tabular/Business Data? Sorry, It’s Not Deep Learning…
This event has passed
65 attendees+60

Related topics