

What we’re about
To see all meetups in this group: https://www.meetup.com/pro/ibm-community/
This is an IBM sponsored Meetup group geared towards developers, data scientists, data engineers, and ALL Big Data, Cloud and AI enthusiasts. Our meetups provide an opportunity to work hands on with the solutions and tools in our Big Data portfolio and to interact and share knowledge with experts at IBM and in our extended community.Our Meetups typically include a presentation from a technical expert that serves as an introduction and overview for a specific Big Data technology, as well as an opportunity to collaborate with fellow developers and apply your Big Data skills.
For Hands-on Meetups, depending upon the location, we provide a cloud environments that you can run through the browser of your laptop or a VM image at NO cost to you. Our Meetups are FREE and sponsored by IBM.
Meetup topics include:
- Hadoop-based analytics
- Open Source Hadoop technologies
- SQL on Hadoop,
- R on Hadoop
- Real Time Analytics & Stream Computing
- Text Analytics
- Visualization and Discovery tools for Big Data
- Big Data App Development
- Big Data & Cloud
- NoSQL
- Internet of Things (IoT)
- Deep dives into the technologies that makes big data processing possible
- Anything and everything about Big Data---is there a topic you'd like covered? Let us know!
For more information about these topics and this Meetup group visit: ibm.biz/hadoopdev or follow @Hadoop_Dev on Twitter.
Join us and meet fellow practitioners, grow your skills, and get a hands on software development experience!
Sponsors
See allUpcoming events (2)
See all- Network event182 attendees from 111 groups hosting[AI Alliance] GneissWeb: Preparing High Quality Data for LLMs at ScaleLink visible for attendees
Details
IBM recently released GneissWeb, a large dataset yielding around 10 trillion tokens that caters to the data quality and quantity requirements of training Large Language Models. In this talk i will do a deep dive on the philosophy behind this dataset, where it stands w.r.t the other datasets out there, how to recreate it based on the tools IBM has open sourced and some performance figures with it. This talk will be a followup of the talk given by Shahrokh Daijavad of IBM in the month of March.Prerequisites
This is a follow up to our March 6, 2025 session “Introducing GneissWeb - a state-of-the-art LLM pre-training dataset“:- Check the GitHub show notes
- Re-watch on YouTube
About the presenter
Bishwaranjan Bhattacharjee (LinkedIn), Senior Technical Staff Member and Master Inventor, IBM ResearchAbout the AI Alliance
The AI Alliance is an international community of researchers, developers and organizational leaders committed to support and enhance open innovation across the AI technology landscape to accelerate progress, improve safety, security and trust in AI, and maximize benefits to people and society everywhere. Members of the AI Alliance believe that open innovation is essential to develop and achieve safe and responsible AI that benefit society rather than benefit a select few big players. - Network event98 attendees from 111 groups hosting[AI Alliance] Knowledge Graphs for Enterprise AILink visible for attendees
Description
Proscenium is an emerging library of composable glue focused on enterprise AI applications. It prioritizes support for domains where the creation and use of structured data is critical. This talk will walk through the construction an application for the legal domain built with Proscenium that involves:- Document enrichment
- Entity resolution
- Knowledge Graph construction
- Query handling
- Chat integration
Finally, we'll cover the future roadmap and ways that you could contribute!
Speaker Bio
Adam Pingel (LinkedIn, GitHub) is IBM's Head of Open Tools and Applications for the AI Alliance. Adam has been fascinated by AI and chatbots since playing with Racter in the 80’s. But the “winters” were long and frequent. The stars aligned in 2015 when he became VPE at Ravel Law. Ravel was building AI-powered tools for the legal industry and was working with Harvard Law School on what is now known as the Caselaw Access Project. After an acquisition by LexisNexis in 2017, he moved his family to Raleigh (in 2019) to take the role of CTO of Global Platforms. In 2022 he joined IBM to work on domain-specific applications of generative AI. Adam holds an MS and BS in CS from UCLA and Stanford, respectively. When not at a keyboard, he enjoys spending time with his family.About the AI Alliance
The AI Alliance is an international community of researchers, developers and organizational leaders committed to support and enhance open innovation across the AI technology landscape to accelerate progress, improve safety, security and trust in AI, and maximize benefits to people and society everywhere. Members of the AI Alliance believe that open innovation is essential to develop and achieve safe and responsible AI that benefit society rather than benefit a select few big players.
Past events (203)
See all- Network event244 attendees from 109 groups hosting[AI Alliance] Chat with your website using an LLMThis event has passed