Unstructured Data in LLMs


Details
Hello Streamers!
You are invited to attend a meetup on Tuesday, October 24th starting at 5:30pm PDT hosted by our friends at Bay Area Unstructured Data group and co-sponsored by Zilliz and Confluent.
Please make sure you're registered by clicking here. Github will email you a form before the event, which you will need to complete for your access pass.
***PLEASE NOTE, VERY IMPORTANT***: If you are interested to attend this meetup, please register here. Registration for this meetup will not be open on this page.
https://www.meetup.com/unstructured-data-bay-area/events/296232232/
Agenda:
5:30 - 6:30 - Welcome/Networking/Registration
6:30 - 6:50 - Jack Retterer, DevRel, [Unstructured.io](https://unstructured.io/)
6:50 - 7:10 - Filip Haltmayer, SWE, Zilliz
7:10 - 7:30 - Rob Crystal-Ornelas,PhD, Data Analyst III and Dimitrios Philliou, Growth Product Manager, GitHub
7:30 - 8:00 - Networking
Who Should attend:
Anyone interested in talking and learning about Unstructured Data and LLM Apps.
When:
October 24th, 2023
5:30PM
Where:
This is an in-person event! To secure your spot, kindly Register using this link (registering here on Meetup won't guarantee your entry). Thank you!
Co-sponsored by Zilliz and Confluent.
Tech Talk 1: How to extract Tabular Data from PDFs for RAG Pipelines
Speaker: Jack Retterer, DevRel, Unstructured.io
Abstract: With the continuous advancement in data processing and extraction, delving into table extraction from PDFs has become increasingly significant. Jack will shed light on innovative solutions being built for table extractions using RAG pipelines. As his team ventures into training and testing a new model, Jack will discuss the challenges faced, the need for such a solution, and provide insights into their current progress.
Tech Talk 2: How to stream data with Kafka into your RAG application
Speaker: Filip Haltmayer, SWE, Zilliz
Abstract: In this talk, Filip will delve into the integration of Kafka (Confluent Cloud) with Zilliz Cloud (Hosted Milvus), showcasing how this synergy enables real-time data ingestion, parsing, and processing. The primary objective is to enhance the accuracy and performance of your Retrieval Augmented Generation (RAG) applications. Additionally, Filip will provide a comprehensive demonstration, offering practical insights into the capabilities and benefits of this integration.
Tech Talk 3: AI-Powered Pair Programming with GitHub Copilot
Speakers: Rob Crystal-Ornelas and Dimitrios Philliou, GitHub
Abstract: Learn how GitHub’s AI-powered code assistant can help you save time and innovate at any career stage. We’ll demo GitHub Copilot and show the newly beta-released Copilot Chat. Also, we’ll share tips on how to prompt Copilot to make the most of this AI pair-programmer.
COVID-19 safety measures

Unstructured Data in LLMs