Skip to content

Data Pipelines

Meet other local people interested in Data Pipelines: share experiences, inspire and encourage each other! Join a Data Pipelines group.
pin icon
2,249
members
people1 icon
2
groups

Largest Data Pipelines groups

Frequently Asked Questions

Yes! Check out data pipelines events happening today here. These are in-person gatherings where you can meet fellow enthusiasts and participate in activities right now.

Discover all the data pipelines events taking place this week here. Plan ahead and join exciting meetups throughout the week.

Absolutely! Find data pipelines events near your location here. Connect with your local community and discover events within your area.

Data Pipelines Events This Week

Discover what is happening in the next few days

弁天町で英会話! Learn English with a teacher in Bentencho!
弁天町で英会話! Learn English with a teacher in Bentencho!
Location: Takoyaki cafe Misora 〒552-0001 大阪府大阪市港区波除5丁目3−9 玉野 1階 ミータップを使わない地元の人達はよく参加します! **GOOGLE MAPS LINK** https://maps.app.goo.gl/QwkUTJdBvXkWBEXo8 レベルは何でできる楽しい英語のチャットです! All levels English chat (materials provided) in Bentencho! The participation fee is 700 yen. Please also buy something from the restaurant (drinks start at 300 yen). 参加費は700円です。店でも飲み物か食事を注文してください。飲み物は300円〜。 This will be fun and we will also learn a lot! 他のアプリを使っている方も参加しています!
Language Exchange
Language Exchange
\ ストーリーハウスの言語交換会を再開します。 日本語または英語の上達を目指して、ぜひチャレンジしてみてください!ポジティブな気持ちで参加して、お互いに励まし合いながら、集中して成​​長しましょう。言語交換会は、日本語と英語の半分ずつに分けて行います。他の言語を開催できる場合は、時間を追加します。 使いたい教材があればご持参ください。また、必要に応じてメモが取れるように、ノートも必ずご持参ください。気軽におしゃべりしたり、会話に参加したりしてください。 参加費は無料ですが、カフェでご注文をお願いいたします。 We are reintroducing our Storyhouse Language Exchange. Push yourself to improve Japanese or English! Come with a positive attitude, and encourage each other to focus and grow. We will split the language exchange into half, between Japanese and English. If additional languages are available, we will add time. Bring any study materials you want to use. And be sure to bring writing utensils so you can take notes as you need. Feel free to talk and participate in conversations. There is no participation fee, but please make your order at the cafe.

Data Pipelines Events Near You

Connect with your local Data Pipelines community

Building Scalable Customer Identity Resolution Pipelines on AWS Using AI
Building Scalable Customer Identity Resolution Pipelines on AWS Using AI
Customer identity resolution becomes increasingly complex as organizations scale across multiple systems, regions, and data formats. Traditional rule-based approaches often fail to keep up with data variability, require constant manual tuning, and struggle with real-time processing needs. This session presents a practical approach to building a scalable identity resolution pipeline using AWS services and modern AI techniques. The architecture combines data ingestion through Amazon S3 and AWS Glue, transformation pipelines using Spark on EMR, and machine learning models deployed via SageMaker for entity matching and standardization. Graph-based relationship modeling is implemented using Amazon Neptune to improve resolution accuracy by incorporating household and shared attribute context. We will walk through how machine learning models can be used for name and address normalization, how intelligent blocking strategies improve matching efficiency, and how feedback loops can be introduced to continuously improve accuracy. The session also highlights how serverless components such as AWS Lambda can be used for orchestration and real-time processing. **SPEAKER BIO** Mosaic Syed is a Senior Data Engineering and Cloud Solutions Architect with over 20 years of experience designing and delivering scalable, secure, and high-performance data solutions across global enterprise environments. https://www.linkedin.com/in/mosaic-basha-syed-92300856 **CALL FOR SPEAKERS** Learn more: [https://www.awscolumbus.com/get-involved/](https://www.awscolumbus.com/get-involved/) **THANK YOU** *VEEAM* for hosting our meetup! To learn more about *Veeam*, please visit their website: [https://www.veeam.com/](https://www.veeam.com/) **DIRECTIONS** 8800 Lyra Dr #450 · Columbus, OH go to 4th floor. **Want to sponsor the pizza and/or bar tab?** Please contact me if you would like to sponsor this meetup's pizza and/or bar tab: angelo@mandato.com
CBusData - Practical AI for Power BI Developers
CBusData - Practical AI for Power BI Developers
Practical AI for Power BI Developers A year ago, “agentic AI” was mostly hype for Power BI teams. Today, it deserves your undivided attention. For Power BI pros, there is now a real opportunity to reduce repetitive development work, accelerate delivery, and help developers do more, but only when strong DataOps practices are in place to make AI workflows effective. This session is a no-nonsense introduction to effective AI patterns for Power BI and Fabric development. Along the way, we will make sense of the growing pile of terminology, including skills, plugins, hooks, and MCP. You will see examples of how modern AI tooling can help with development tasks across Power BI and Fabric, along with the prerequisites, guardrails, and DataOps principles needed to use it responsibly. Whether you're burned out on AI hype or already using Copilot CLI daily, this session will show you the foundations that are finally making AI-assisted development genuinely useful.
Building Agents with Microsoft Agent Framework
Building Agents with Microsoft Agent Framework
We will show how to build custom agents with Microsoft Agent Framework. Attendees will learn how to build and custom host agents when Microsoft Foundry is not a viable option.
Best Practices for Building a Reliable Lakehouse
Best Practices for Building a Reliable Lakehouse
**Abstract:** This is a practical playbook for building a production-grade data lakehouse. It walks through foundational principles — naming conventions, least-privilege access, automated CI/CD testing — before diving into medallion architecture. Furthermore, metadata-driven design patterns show how configuration tables and dynamic notebook orchestration eliminates hard-coded pipelines. The deck covers star schema modeling, guidance on choosing between Spark, Pandas, and SQL, and data quality enforcement using DQX with YAML data contracts. Finally, we dive into security best practices and performance optimizations. **Host:** Justin Shea, Mehdi Jeddi, Erik Pak, and Sou-Cheng Choi **Talk Format:** This is a hybrid event. To attend online, join us on Zoom here at 6pm: https://iit-edu.zoom.us/j/89379230295?pwd=NdETyE5sdYuSrvsrBZXSBFkUESBVkg.1 Meeting ID: 893 7923 0295 Passcode: 5t5WYn **Sponsor:** Adyen, UIC College of Business, and PyData Chicago co-host this event. UIC will provide the meeting site. Adyen will sponsor pizza and soft drinks for the onsite participants. **Address:** University of Illinois - Chicago, Douglass Hall, Room 220, 705 S Morgan St, Chicago, IL 60607 **Logistics:** “UIC Douglass Hall” is recognized on Google Maps, which can guide you through campus. Once you arrive, proceed to the second floor, room number 220
COhPy Monthly Meeting
COhPy Monthly Meeting
**Improving Office in Franklinton** Physical location: Improving Office 330 Rush Alley Suite #150 Columbus, OH 43215 Schedule: 6:00 p.m.: Socialize, eat, and drink. Improving will be providing pizza and beverages. 6:30 to 8:00 pm. Main meeting and presentation(s). Topic: This month John Lairson will share a notebook describing the Alpaca (Paper) Trading API and discuss different algorithms for evaluating stock trades. We meet on the last Monday of each Month. Presentations are given by members and friends of this group. If you would like to do a presentation (small or large) on a python topic, please contact Central OH Python at centralohpython@gmail.com
Building Momentum: From Ambiguity to Execution
Building Momentum: From Ambiguity to Execution
**Building a great product is one thing—building momentum behind it is another.** Join **Senior Product Manager Adam Solaiman** and **User Experience Manager Tyson Smith** for a behind-the-scenes look at what it takes to turn complex ideas into scalable products inside large organizations. In this session, they’ll share how teams move from ambiguity to execution—navigating organizational complexity, aligning stakeholders, and continuously evolving products after launch. You’ll walk away with insights on how to: * Build and sustain momentum across teams * Adapt to changing priorities without losing direction * Scale products thoughtfully in complex environments Whether you're driving a new initiative or growing an existing product, this conversation will give you practical strategies to keep things moving forward. Come connect, learn, and swap stories with fellow product professionals. \-\-\- Food and drinks will be provided by Switchbox, our generous host. Free parking will be available at the front and back sides of the Switchbox Office.
LLM Showdown: ChatGPT vs Claude vs Gemini vs Local Models
LLM Showdown: ChatGPT vs Claude vs Gemini vs Local Models
Join us for a practical, beginner-friendly guide to choosing the right large language model. We’ll compare major models like ChatGPT, Claude, Gemini, and Llama, talk about when to use hosted APIs versus local models, and break down the tradeoffs around cost, speed, quality, privacy, context windows, coding ability, and reliability. You’ll leave with a clearer mental model for picking an LLM based on your actual use case instead of hype, benchmarks, or brand names. No deep AI background required. LOGISTICS AND PARKING: The talk starts at 7:00 PM. The first half hour is reserved for everyone to get set up and mingle. Free pizza and drinks! The cheapest parking option is to find street parking, which will only cost you a few bucks. Otherwise, park in the nearby veteran's museum lot for $8. It's highly recommended you avoid the nearby $15 garage parking.