
What we’re about
SF - Bay Area Data Science Initiative is a community enabling the current and next generation of Data Scientists, Entrepreneurs, Innovators, Mathematical and Logical Thinkers to focus on their data skills and personal development.
We provide a supportive and positive learning environment in which members are empowered to develop a network and data skills, resulting in greater technical skills.
Do you want to improve your technical interpersonal skills in order to become effective general communicators and gain confidence to become future technical leaders? Then join the SF - Bay Area Data Science Initiative and become a part of our community.
We organize 'learn-by-doing' workshops, informative meetups in which members perfect their data and technical skills in a non-pressure atmosphere, as well as an enlightening environment. We host industry professional speakers and become a hub where the knowledge is being transferred in a cozy environment.
Upcoming events
1
•OnlineOutclassing Frontier LLMs at Extracting Information
OnlinePlease register using the zoom link to get a reminder:
https://us02web.zoom.us/webinar/register/3317557643700/WN_t-UvP6PUQrugTmkVDzIcvA
Accurately extracting information from documents has been a decades-old dream. Important workflows — from automated back-office processing to enterprise RAG — depend on it. LLMs promise to fulfill this dream but currently fall short: they hallucinate information, struggle with long documents, and break down on complex layouts. The solution: LLMs specialized in information extraction. In this talk, I will present: - **NuExtract** — the first LLM specialized in extracting structured information (JSON output) - **NuMarkdown** — the first reasoning OCR LLM (RAG-ready Markdown output). **These low-hallucination [open-source] models outclass frontier LLMs like GPT-5 and Gemini 2.5 while being orders of magnitude smaller**, enabling private usage. I will demonstrate the abilities of these LLMs, show how to use them at scale, and discuss what’s coming next in information extraction.
Agenda:
(PST) 11:50 am - 11:55 am Arrival and socializing and Opening
(PST) 11:55 am - 1:00 pm "Outclassing Frontier LLMs at Extracting Information"
(PST) 1:00 pm - 1:10 pm Q&A
About Etienne Bernard
Co-founder & CEO - Company: NuMind - Etienne is an AI/ML expert, co-founder & CEO of [NuMind](https://www.numind.ai) — a startup developing LLMs specialized in information extraction. Etienne holds a physics PhD (ENS+MIT), led the ML group of Wolfram Research, and wrote [Introduction to Machine Learning](https://www.amazon.com/Introduction-Machine-Learning-Etienne-Bernard/dp/1579550487?ref=d6k_applink_bb_dls&dplnkId=d2b94865-0ad9-46fb-94ae-43d55b9c3f64&dplnkId=561af1be-731e-4c4a-ba3e-ec80d95ff29d). Additional key points: - Spoke at 100+ events such as - ML Prague (keynote): https://www.mlprague.com/prague2018/ - SXSW: https://schedule.sxsw.com/2016/events/event_PP54827 - Invited guest on France 24: https://www.youtube.com/watch?v=jnVFExf1nbk - Authored
Please register using the zoom link to get a reminder:
https://us02web.zoom.us/webinar/register/3317557643700/WN_t-UvP6PUQrugTmkVDzIcvA9 attendees
Past events
277

