Data Warehouse
Meet other local people interested in Data Warehouse: share experiences, inspire and encourage each other! Join a Data Warehouse group.
1,616
members
2
groups
Largest Data Warehouse groups
Newest Data Warehouse groups
Frequently Asked Questions
Yes! Check out data warehouse events happening today here. These are in-person gatherings where you can meet fellow enthusiasts and participate in activities right now.
Discover all the data warehouse events taking place this week here. Plan ahead and join exciting meetups throughout the week.
Absolutely! Find data warehouse events near your location here. Connect with your local community and discover events within your area.
Data Warehouse Events This Week
Discover what is happening in the next few days
Language Exchange
\
ストーリーハウスの言語交換会を再開します。
日本語または英語の上達を目指して、ぜひチャレンジしてみてください!ポジティブな気持ちで参加して、お互いに励まし合いながら、集中して成長しましょう。言語交換会は、日本語と英語の半分ずつに分けて行います。他の言語を開催できる場合は、時間を追加します。
使いたい教材があればご持参ください。また、必要に応じてメモが取れるように、ノートも必ずご持参ください。気軽におしゃべりしたり、会話に参加したりしてください。
参加費は無料ですが、カフェでご注文をお願いいたします。
We are reintroducing our Storyhouse Language Exchange.
Push yourself to improve Japanese or English! Come with a positive attitude, and encourage each other to focus and grow. We will split the language exchange into half, between Japanese and English. If additional languages are available, we will add time.
Bring any study materials you want to use. And be sure to bring writing utensils so you can take notes as you need. Feel free to talk and participate in conversations.
There is no participation fee, but please make your order at the cafe.
弁天町で英会話! Learn English with a teacher in Bentencho!
Location: Takoyaki cafe Misora
〒552-0001 大阪府大阪市港区波除5丁目3−9 玉野 1階
ミータップを使わない地元の人達はよく参加します!
**GOOGLE MAPS LINK**
https://maps.app.goo.gl/QwkUTJdBvXkWBEXo8
レベルは何でできる楽しい英語のチャットです!
All levels English chat (materials provided) in Bentencho!
The participation fee is 700 yen. Please also buy something from the restaurant (drinks start at 300 yen).
参加費は700円です。店でも飲み物か食事を注文してください。飲み物は300円〜。
This will be fun and we will also learn a lot!
他のアプリを使っている方も参加しています!
Data Warehouse Events Near You
Connect with your local Data Warehouse community
Data Cleansing using Data Bricks
The May Ohio North Database Training user group meeting will be held on **May 5th, 2026 at 5:00PM**. This will be a **HYBRID** event and we will be joined in person by **Sam Nasr.**
You're welcome to come meet in-person at our meeting location, the offices of Improving at
**[6000 Freedom Square Dr,](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
**[Unit 110,](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
**[Independence, OH 44131](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
[Teams Link ](https://teams.microsoft.com/meet/287759659366576?p=kCaammjECnUCvZzEJv)if anyone needs it after RSVP-ing for in person.
If you would like to subscribe to our email list outside of Meetup, we have changed platforms recently and you will need to register [here in Kit ](https://ohio-north-data-training.kit.com/b8f036f615)instead to receive emails.
Agenda:
**5:00 PM EST**: Online and in-person meeting begins with a social hour. This is an unstructured hour where you can join us to catch up and meet other group members before the session starts. There will be food brought in for in-person attendees.
**6:00 PM EST**: Elections, announcements, followed by our feature presentation. See below for presentation details.
**7:30 PM EST**: Optionally after the main presentations, the in-person crowd may go out for snacks and drinks at a local establishment.
We hope to see you there!
Session Abstract
### Data Cleansing using Data Bricks
Machine Learning is highly dependent on adequate data. Not only does quantity matter, but more importantly quality. In this session we’ll cover how to build a custom automated process using Data Bricks. This will provide methods for cleaning data in a data lake using functions in Azure.
\*Please note, that we will be using Microsoft Teams for the online portion of this meeting. You may want to join a few minutes early to ensure you do not have any issues. If you are attending in person, there are large TVs at the office, and you do not need to bring a laptop or use Teams.
CBusData - Practical AI for Power BI Developers
Practical AI for Power BI Developers
A year ago, “agentic AI” was mostly hype for Power BI teams. Today, it deserves your undivided attention. For Power BI pros, there is now a real opportunity to reduce repetitive development work, accelerate delivery, and help developers do more, but only when strong DataOps practices are in place to make AI workflows effective.
This session is a no-nonsense introduction to effective AI patterns for Power BI and Fabric development. Along the way, we will make sense of the growing pile of terminology, including skills, plugins, hooks, and MCP. You will see examples of how modern AI tooling can help with development tasks across Power BI and Fabric, along with the prerequisites, guardrails, and DataOps principles needed to use it responsibly.
Whether you're burned out on AI hype or already using Copilot CLI daily, this session will show you the foundations that are finally making AI-assisted development genuinely useful.
Building Scalable Customer Identity Resolution Pipelines on AWS Using AI
Customer identity resolution becomes increasingly complex as organizations scale across multiple systems, regions, and data formats. Traditional rule-based approaches often fail to keep up with data variability, require constant manual tuning, and struggle with real-time processing needs.
This session presents a practical approach to building a scalable identity resolution pipeline using AWS services and modern AI techniques. The architecture combines data ingestion through Amazon S3 and AWS Glue, transformation pipelines using Spark on EMR, and machine learning models deployed via SageMaker for entity matching and standardization. Graph-based relationship modeling is implemented using Amazon Neptune to improve resolution accuracy by incorporating household and shared attribute context.
We will walk through how machine learning models can be used for name and address normalization, how intelligent blocking strategies improve matching efficiency, and how feedback loops can be introduced to continuously improve accuracy. The session also highlights how serverless components such as AWS Lambda can be used for orchestration and real-time processing.
**SPEAKER BIO**
Mosaic Syed is a Senior Data Engineering and Cloud Solutions Architect with over 20 years of experience designing and delivering scalable, secure, and high-performance data solutions across global enterprise environments.
https://www.linkedin.com/in/mosaic-basha-syed-92300856
**CALL FOR SPEAKERS**
Learn more: [https://www.awscolumbus.com/get-involved/](https://www.awscolumbus.com/get-involved/)
**THANK YOU** *VEEAM* for hosting our meetup! To learn more about *Veeam*, please visit their website: [https://www.veeam.com/](https://www.veeam.com/)
**DIRECTIONS**
8800 Lyra Dr #450 · Columbus, OH
go to 4th floor.
**Want to sponsor the pizza and/or bar tab?**
Please contact me if you would like to sponsor this meetup's pizza and/or bar tab: angelo@mandato.com
The Mythical Data Warehouse: The World Is Hybrid
Dear Illinois Prairie PUG members,
Our next meetup will take place in a **new location**. Please read **the whole announcement** carefully to note all the changes.
Our new meeting place is **Chicago Innovations at 1 W. Monroe**.
*Talk Title*: pg_lake: Unifying transactional and analytical data with Postgres
*Speaker*: Elizabeth Christensen, Snowflake
*Talk Description*
The data world used to be defined by “transactional" (OLTP) and “analytical” (OLAP) workloads, but we’ve asked ourselves, “Why not both?” A new series of extensions called pg_lake has just been released to connect Postgres to object storage and open table formats - like csv, Parquet, and Iceberg. pg_lake bridges the two worlds of transactional and analytical data for a vendor neutral, open source, unified data stack.
This talk will explore the pg_lake extension, including how to build it, and demos of using it with modern data workloads in object storage like Amazon S3. We’ll create simple data pipelines with no ETL and high performance analytics.
pg_lake is more than just an extension. It is the foundation to a fully unified data path with 100% open source tools backed by PostgreSQL, DuckDB, Iceberg, and Polaris.
*Agenda*
5:20 - doors open
5:30 - pizza arrives
6:00 - 6:10 - Hettie D. Opening remarks
6:10- 6:20 - Hettie D. Data warehouse history overview
6:20 - 6:50 Elizabeth C. pg_lake: Unifying transactional and analytical data with Postgres
6:50 - 7:00 Q&A
7:00 - 7:50 - Open discussion and networking
7:50 - 8:00 - Cleanup time and closing
*Notes about our new venue*.
I am delighted to have an independent venue for the first time since I am hosting the meetups. I hope that this will be our permanent home. And as you all know, with more freedom comes more responsibilities.
You will notice the change in the RSVP form.
We are thankful to our hosts and promise to be responsible.
We comply with the [PostgreSQL Code of Conduct](https://www.postgresql.org/about/policies/coc/).
We do not want to waste food, so please indicate your dietary preferences.
We will not close the RSVP in the morning of the meetup, but please do your best to RSVP in advance so that we can order the appropriate amount of food and drinks.
Also, all meetups will be hybrid - please don't forget to indicate whether you are attending in-person or virtually.
Thank you, and I look forward to seeing you all at our new location!
Hettie Dombrovskaya
Illinois Prarier PUG Organizer
COhPy Monthly Meeting
**Improving Office in Franklinton**
Physical location:
Improving Office
330 Rush Alley Suite #150
Columbus, OH 43215
Schedule:
6:00 p.m.: Socialize, eat, and drink. Improving will be providing pizza and beverages.
6:30 to 8:00 pm. Main meeting and presentation(s).
Topic: This month John Lairson will share a notebook describing the Alpaca (Paper) Trading API and discuss different algorithms for evaluating stock trades.
We meet on the last Monday of each Month. Presentations are given by members and friends of this group. If you would like to do a presentation (small or large) on a python topic, please contact Central OH Python at centralohpython@gmail.com
Building Agents with Microsoft Agent Framework
We will show how to build custom agents with Microsoft Agent Framework. Attendees will learn how to build and custom host agents when Microsoft Foundry is not a viable option.










