Data Science
Meet other local people interested in Data Science: share experiences, inspire and encourage each other! Join a Data Science group.
31,488
members
32
groups
Largest Data Science groups
Newest Data Science groups
Frequently Asked Questions
Yes! Check out data science events happening today here. These are in-person gatherings where you can meet fellow enthusiasts and participate in activities right now.
Discover all the data science events taking place this week here. Plan ahead and join exciting meetups throughout the week.
Absolutely! Find data science events near your location here. Connect with your local community and discover events within your area.
Data Science Events This Week
Discover what is happening in the next few days
Language Exchange
\
ストーリーハウスの言語交換会を再開します。
日本語または英語の上達を目指して、ぜひチャレンジしてみてください!ポジティブな気持ちで参加して、お互いに励まし合いながら、集中して成長しましょう。言語交換会は、日本語と英語の半分ずつに分けて行います。他の言語を開催できる場合は、時間を追加します。
使いたい教材があればご持参ください。また、必要に応じてメモが取れるように、ノートも必ずご持参ください。気軽におしゃべりしたり、会話に参加したりしてください。
参加費は無料ですが、カフェでご注文をお願いいたします。
We are reintroducing our Storyhouse Language Exchange.
Push yourself to improve Japanese or English! Come with a positive attitude, and encourage each other to focus and grow. We will split the language exchange into half, between Japanese and English. If additional languages are available, we will add time.
Bring any study materials you want to use. And be sure to bring writing utensils so you can take notes as you need. Feel free to talk and participate in conversations.
There is no participation fee, but please make your order at the cafe.
Ukulele Club
Ukulele Club is a fun group for all ages and skill levels! There is no teacher, but we all help teach each other. Think of songs you would like to learn, and we can learn them together.
If you don't have an ukulele, we have extra ukuleles
When: *almost* Every Saturday, 10am-11:30am
Admission: food/drink order
Data Science Events Near You
Connect with your local Data Science community
CBusData - Practical AI for Power BI Developers
Practical AI for Power BI Developers
A year ago, “agentic AI” was mostly hype for Power BI teams. Today, it deserves your undivided attention. For Power BI pros, there is now a real opportunity to reduce repetitive development work, accelerate delivery, and help developers do more, but only when strong DataOps practices are in place to make AI workflows effective.
This session is a no-nonsense introduction to effective AI patterns for Power BI and Fabric development. Along the way, we will make sense of the growing pile of terminology, including skills, plugins, hooks, and MCP. You will see examples of how modern AI tooling can help with development tasks across Power BI and Fabric, along with the prerequisites, guardrails, and DataOps principles needed to use it responsibly.
Whether you're burned out on AI hype or already using Copilot CLI daily, this session will show you the foundations that are finally making AI-assisted development genuinely useful.
Building Scalable Customer Identity Resolution Pipelines on AWS Using AI
Customer identity resolution becomes increasingly complex as organizations scale across multiple systems, regions, and data formats. Traditional rule-based approaches often fail to keep up with data variability, require constant manual tuning, and struggle with real-time processing needs.
This session presents a practical approach to building a scalable identity resolution pipeline using AWS services and modern AI techniques. The architecture combines data ingestion through Amazon S3 and AWS Glue, transformation pipelines using Spark on EMR, and machine learning models deployed via SageMaker for entity matching and standardization. Graph-based relationship modeling is implemented using Amazon Neptune to improve resolution accuracy by incorporating household and shared attribute context.
We will walk through how machine learning models can be used for name and address normalization, how intelligent blocking strategies improve matching efficiency, and how feedback loops can be introduced to continuously improve accuracy. The session also highlights how serverless components such as AWS Lambda can be used for orchestration and real-time processing.
**SPEAKER BIO**
Mosaic Syed is a Senior Data Engineering and Cloud Solutions Architect with over 20 years of experience designing and delivering scalable, secure, and high-performance data solutions across global enterprise environments.
https://www.linkedin.com/in/mosaic-basha-syed-92300856
**CALL FOR SPEAKERS**
Learn more: [https://www.awscolumbus.com/get-involved/](https://www.awscolumbus.com/get-involved/)
**THANK YOU** *VEEAM* for hosting our meetup! To learn more about *Veeam*, please visit their website: [https://www.veeam.com/](https://www.veeam.com/)
**DIRECTIONS**
8800 Lyra Dr #450 · Columbus, OH
go to 4th floor.
**Want to sponsor the pizza and/or bar tab?**
Please contact me if you would like to sponsor this meetup's pizza and/or bar tab: angelo@mandato.com
May Ann Arbor R Users' Group Meeting - AI in Positron
**We'll have two meetings covering AI - this second one is for Positron**
We will review which AI features are linked in for Positron, and how to use them.
**Source documents:** the repository is public, at: https://github.com/BarryDeCicco/AARUG_2026_04_09_AI_In_Positron
You can download documents, or clone/fork the repository.
**Location:** The meetup will be at SPARK's ([https://annarborusa.org/](https://annarborusa.org/)) Ann Arbor site: [SPARK HQ (Ann Arbor)](https://urldefense.com/v3/__https://www.google.com/maps/place/Ann*Arbor*SPARK*Headquarters/@42.2792515,-83.7447708,17z/data=!3m1!4b1!4m5!3m4!1s0x883cae3effe193cb:0x5296a53db2a282bf!8m2!3d42.2792515!4d-83.7447708?hl=en-US__;Kysr!!HXCxUKc!3EEdBNXRJKknP6LCGkZetSuEsdtChFojQnOVitFUC5C0fyilqXEbiMstT9ajBR3Cw-55qoFkrElCjQvjMdTxUw$). There is on street parking, and parking at local structures.
**This will be a hybrid meeting - the Zoom session starts at 6:30 PM.**
**Time:** The doors will be open at 6:00, with pizza and beverages provided. We will have a meet-and-greet-and-pizza session, and then at 6:30 we'll have a presentation.
**Zoom information:**
https://us06web.zoom.us/j/6658850479?pwd=Snd1UmZTT3pjZktENlczUXh4SERwUT09&omn=89012971377
Passcode: 940392
**If you can't get in, please call me at: 734 223-3307**
**The github repository is at: https://github.com/BarryDeCicco/AARUG_2026_04_09_AI_In_Positron**
Quarterly Community Gathering
Join the Columbus AI community for our quarterly gathering — a casual, community-focused evening where everyone has a chance to share, learn, and connect. These open mic–style events give anyone in the community up to **5 minutes** to present a project, share a tool, pose a question, or offer a perspective on the evolving AI space.
No slides required — just a welcoming space to exchange ideas and keep the local AI conversation moving.
If you’d like to take the stage, message \*\*Chris (the organizer)\*\*with a **title and short description** of what you’d like to share.
Whether you’re deep in the field or just getting curious, come connect with others building and exploring AI in Columbus.
Sponsored by [Transform Labs](https://www.linkedin.com/company/transformlabs/)
Sign up also accessible via [Transform Labs Luma](https://luma.com/transformlabshq)
COhPy Monthly Meeting
**Improving Office in Franklinton**
Physical location:
Improving Office
330 Rush Alley Suite #150
Columbus, OH 43215
Schedule:
6:00 p.m.: Socialize, eat, and drink. Improving will be providing pizza and beverages.
6:30 to 8:00 pm. Main meeting and presentation(s).
Topic: This month John Lairson will share a notebook describing the Alpaca (Paper) Trading API and discuss different algorithms for evaluating stock trades.
We meet on the last Monday of each Month. Presentations are given by members and friends of this group. If you would like to do a presentation (small or large) on a python topic, please contact Central OH Python at centralohpython@gmail.com
Best Practices for Building a Reliable Lakehouse
**Abstract:** This is a practical playbook for building a production-grade data lakehouse. It walks through foundational principles — naming conventions, least-privilege access, automated CI/CD testing — before diving into medallion architecture. Furthermore, metadata-driven design patterns show how configuration tables and dynamic notebook orchestration eliminates hard-coded pipelines. The deck covers star schema modeling, guidance on choosing between Spark, Pandas, and SQL, and data quality enforcement using DQX with YAML data contracts. Finally, we dive into security best practices and performance optimizations.
**Host:** Justin Shea, Mehdi Jeddi, Erik Pak, and Sou-Cheng Choi
**Talk Format:** This is a hybrid event. To attend online, join us on Zoom here at 6pm:
https://iit-edu.zoom.us/j/89379230295?pwd=NdETyE5sdYuSrvsrBZXSBFkUESBVkg.1
Meeting ID: 893 7923 0295
Passcode: 5t5WYn
**Sponsor:** Adyen, UIC College of Business, and PyData Chicago co-host this event. UIC will provide the meeting site. Adyen will sponsor pizza and soft drinks for the onsite participants.
**Address:** University of Illinois - Chicago, Douglass Hall, Room 220, 705 S Morgan St, Chicago, IL 60607
**Logistics:** “UIC Douglass Hall” is recognized on Google Maps, which can guide you through campus. Once you arrive, proceed to the second floor, room number 220
TBD
**Important time note:** Please plan on arriving between 5:30 and 6:00 as the elevators lock after 6 and you'll need to message us and we'll need to come get you.
The building address is 4450 Bridge Park
The entrance is 6620 Mooney St, Suite 400
You will need to scan your ID at the door to get a visitor badge.
**Abstract**
TBD
**YouTube Link**
TBD


















