Big Data
Meet other local people interested in Big Data: share experiences, inspire and encourage each other! Join a Big Data group.
5,389
members
6
groups
Largest Big Data groups
Newest Big Data groups
Frequently Asked Questions
Yes! Check out big data events happening today here. These are in-person gatherings where you can meet fellow enthusiasts and participate in activities right now.
Discover all the big data events taking place this week here. Plan ahead and join exciting meetups throughout the week.
Absolutely! Find big data events near your location here. Connect with your local community and discover events within your area.
Big Data Events Today
Join in-person Big Data events happening right now
Designing for the Age of AI: When Should the System Decide?
Are you a creative professional wondering how your skills translate into the world of tech? ✨
The demand for designers who understand AI is growing fast — and your creative background is already a huge advantage. What you need is a structured way to apply it.
**Register for our Event Here:** [https://3vgdb.share.hsforms.com/2QKa2PiXaTYOqv3aKMLQ6Ig](https://3vgdb.share.hsforms.com/2QKa2PiXaTYOqv3aKMLQ6Ig)
Join us at WBS CODING SCHOOL for a hands-on rapid prototyping workshop where you'll design a real booking experience that balances human control with AI assistance.
**What you'll learn:**
* Human + AI Design Frameworks: Explore when automation helps users — and when it doesn't.
* Rapid Prototyping Skills: Learn to validate ideas quickly using simple, low-fidelity methods.
* UX in the Age of AI: Understand how the role of UX design is evolving right now.
* Your Creative Edge: Discover exactly where your creative skills fit into modern product design.
This session is perfect for creative professionals and re-skillers looking to break into UX/UI design. Food and drinks will be provided. 🤝
**What to bring:** Pen & paper, or a laptop if you'd like to use FigJam.
**Event Details:**
* Date: Wednesday, April 22
* Time: 18:00 – 20:00
* Location: Cuvrystraße 1, 10997 Berlin
\-\-\-
**About WBS CODING SCHOOL**
Empowering ambitious minds to conquer the tech world. Since 2019, we’ve been breaking barriers to make tech careers accessible to everyone. From AI, Data, UX/UI to AI Software Development, our expert-led courses teach skills that matter. But we’re more than just a school – we’re a thriving community where passion meets opportunity. Ready to launch your future? Join us and build the career you deserve.
Practical observations on performance, legitimacy, and leadership
**ESMT Insight Hour with Paul M. Achleitner**
**Schedule:**
5:30 p.m. - Registration opens
6:00 p.m. - Welcome from host, ESMT Berlin
6:05 p.m. - Talk of Paul M. Achleitner
6:30 p.m. - Discussion and Q&A
7:00 p.m. - Networking, drinks and nibbles will be provided
8:30 p.m. - Close
Experience used to accumulate slowly. Today, decisions travel instantly, legitimacy can evaporate overnight, and leaders operate under permanent public scrutiny.
So how do you lead and stay legitimate in a world that no longer feels stable?
In this ESMT Insight Hour, **[Paul M. Achleitner](https://www.linkedin.com/in/paul-achleitner/)** joins us for a moderated conversation on leadership, corporate legitimacy, and performance in times of geopolitical disruption, digital transparency, and shifting societal expectations.
Drawing on more than four decades across Goldman Sachs, Allianz, Deutsche Bank, and multiple supervisory boards, and reflecting the themes of his recent book, *Accelerate Your Experience: Principles for Success in a Fluid World*, Achleitner proposes three interconnected dimensions of modern management:
* **Legitimacy** – why companies must earn their place in society beyond financial performance
* **Performance** – how sustained value creation requires focus, discipline, and strategic clarity
* **Leadership** – what integrity, intellectual humility, and judgment mean in an era of permanent visibility
Rather than offering prescriptions, he distills experience into principles on stakeholder dynamics, public attention, geopolitical disruption, AI, crisis management, and the realities of “muddling through.”
The conversation will explore:
* Why legitimacy has become the decisive constraint for corporations
* How leaders balance long-term value creation with short-term scrutiny
* What performance really means when underperformance becomes normalized
* The evolving role of boards in turbulent environments
* Whether experience can, in fact, be “accelerated”
Join us for an hour of reflection on leadership under pressure and on what it takes to remain effective, credible, and resilient.
Moderated by **[Jörg Rocholl](https://esmt.berlin/person/jorg-rocholl)**[,](https://esmt.berlin/person/jorg-rocholl) President, ESMT Berlin.
**About the speaker**
**[Paul M. Achleitner](https://www.linkedin.com/in/paul-achleitner/)**[ ](https://www.linkedin.com/in/paul-achleitner/)is an investor, advisor, and corporate director with more than four decades of experience in global finance and European corporate governance. He began his career at Bain & Company before joining Goldman Sachs in 1988, where he worked in New York, London, and Frankfurt and became a partner in 1994. In 2000, he joined Allianz SE as CFO, serving on the board of management until 2012.
From 2012 to 2022, he served as chair of the supervisory board of Deutsche Bank. He has been a member of the supervisory board of Bayer AG since 2002 and has served on supervisory boards across financial services, pharmaceuticals, and industry. He continues to advise leading academic, policy, and business institutions in Europe and the US.
He holds a doctorate from the University of St. Gallen and has longstanding ties to Harvard Business School and WHU – Otto Beisheim School of Management, where he is professor emeritus.
**About the moderator**
**[Jörg Rocholl](https://esmt.berlin/person/jorg-rocholl)** is president of ESMT Berlin and Deutsche Bank professor in sustainable finance. He is chair of the advisory board of the German Federal Ministry of Finance and chair of the steering committee of the Global Network for Advanced Management (GNAM). Furthermore, he is a member of the supervisory board at RWE AG, a member of the board of the Schmalenbach Society, a member of acatech (German Academy of Science and Engineering), a research fellow at the Centre for Economic Policy Research (CEPR) and a research member of the European Corporate Governance Institute (ECGI).
Designing Responsible AI: Trust, Autonomy & Hidden Risk
*Please note, this event is free but please sign up here so we can inform the venue of your attendance: [https://uxdx.com/community/community-berlin-2026-04-22/](https://uxdx.com/community/community-berlin-2026-04-22/)*
**What's On**
**📋 6:00pm:** Registration and Networking
**🎤 6:30 pm:** Keynote: *"More Than a Matter of Principle: How design gets us beyond theoretical AI Ethics"* with **[Noah Fraenkel](https://www.linkedin.com/in/noah-fraenkel-4b17ba262/)**, GovTech Consultant at Possible
Drawing on his work at the intersection of government, technology, and public trust, Noah will challenge how we think about AI's role in systems that affect real people's lives.
**💬 7:00 pm:** Panel Discussion: The keynote ignites the conversation, the panel takes it forward.
Moderated by **[Wiebke Steffen](https://www.linkedin.com/in/wiebke-steffen/)**, Senior UX Researcher at GetYourGuide, our panelists bring perspectives from across the product spectrum:
**Noah Fraenkel** \- GovTech Consultant @ Possible
**[Ziyong Lin](https://www.linkedin.com/in/ziyong-lin-55714b25/)** \- Lead UX Researcher @ GetYourGuide
**[Jake Mongaya](https://www.linkedin.com/in/jakemongaya/)** \- Engineering Manager @ SumUp
**[Maxim Romanovsky](https://www.linkedin.com/in/maxim-romanovsky/)** \- VP \- Head of AI & Product Engineering @ Deutsche Bank
Expect honest takes, productive tension, and the kind of dialogue that actually moves the needle.
**🤝7:45 pm:** Networking
\* \* \* \* \* \* \* \* \* \*
**Everyone is racing to build with AI. But who is building it responsibly?**
There's a gold rush happening in tech right now and almost everyone is caught up in the excitement. AI features are being shipped at a rapid speed, products are being "enhanced" with machine intelligence overnight, and the pressure to move fast has never been higher.
But in the rush to innovate, some critical questions are getting buried: *Who's accountable when AI gets it wrong? How much should we hand over to automation and what do we silently lose when we do? And what are the risks we're not even seeing yet?*
These aren't hypothetical concerns. They're live, urgent, and sitting inside the products we're building right now.
**It's time to have the conversation that some teams are avoiding.**
UXDX Berlin is back! Join us at the GetYourGuide office in Prenzlauer Berg, where we will tackle the theme that every designer, researcher, and engineer needs to reckon with:
**"Designing Responsible AI: Trust, Autonomy & Hidden Risk"**
**Why This Event Matters**
The tools are powerful. The timelines are short. The consequences for users, for society, for trust in technology are long. If you're building products with AI, this isn't an optional conversation. It's the one you need to be in the room for.
**Come ready to think. Come ready to question. Come ready to build better.**
Thank you [GetYourGuide](https://www.getyourguide.com/) for supporting the event!
Bachata Dance Class Berlin - Beginners-+ Tanzkurs (Mittwoch)
Bachata (the dance of our generation) is a social couple dance like Salsa - no choreography. The community in Berlin is active daily - you will meet a lot of open-minded people and make new friends quickly!
Every Wednesday we have Bachata dance classes in Berlin - we are usually a group of around 20 people (95% beginners):
🕖 19:00 - class: Technique for Beginners+ (0-1+ years experience)
• No partner needed (we switch).
• You can start from zero in any of these classes!
📍 Adress: Klosterstraße 44 ([Google Maps Link](https://maps.app.goo.gl/5VGEqaVSy9xBCoHv9)) - near S+U Alexanderplatz
• Entry: Entrance A, code: 5928# - 5th floor (500), then black door.
• Price: 12€ per class.
✅ Dance teacher contact: [WhatsApp-group](https://chat.whatsapp.com/IWKPsid0cBDAUHCWKZNL2B) / [Instagram](https://instagram.com/mahir.bachata.berlin)
🇩🇪: Bachata (der Tanz unserer Generation) ist ein sozialer Paartanz wie Salsa - keine Choreographie. Die Bachata-Szene in Berlin ist groß und täglich aktiv - du wirst schnell viele offene Leute kennenlernen und neue Freundschaften schließen!
Jeden Mittwoch findet dieser Bachata Tanzkurs statt - wir sind meistens eine Gruppe von 20 Leuten (95% Anfänger):
🕖 19 Uhr - Kurs: Technik für Anfänger+ (0-1+ Jahre Erfahrung)
• Kein Partner nötig (wir wechseln).
• Du kannst in diesen Kursen jederzeit von null starten!
📍 Adresse: Klosterstraße 44 ([Google Maps Link](https://maps.app.goo.gl/5VGEqaVSy9xBCoHv9)) - Nähe S+U Alexanderplatz
• Zugang: Eingang A, Code: 5928# - drinnen Knopf drücken links an der Tür. 5. Etage (500).
• Preis: 12€ je Kurs.
✅ Tanzlehrer Kontakt: [WhatsApp-Gruppe](https://chat.whatsapp.com/IWKPsid0cBDAUHCWKZNL2B) / [Instagram](https://instagram.com/mahir.bachata.berlin)
Schreib uns gerne eine Nachricht, wenn du offene Fragen hast! Wir freuen uns auf dich ❤
Berlin Cybersecurity Social #25
This session is part of the Berlin Cybersecurity Social community’s monthly meetup, where security professionals, leaders, and enthusiasts come together to share knowledge and connect. So are you a cybersecurity professional looking to connect with like-minded professionals, share experiences, and make friends? Look no further!
**Transforming Classical Encryption to Post-Quantum Encryption for Financial Services**
Quantum computing is set to break today’s encryption, creating urgent risks for financial institutions. This session explores how organizations can transition from classical cryptography to post-quantum security using hybrid approaches, improved cryptographic visibility, and emerging compliance frameworks.
Revan cover practical steps to become quantum-ready without disrupting existing systems, with a focus on balancing security, compliance, and operational continuity.
**What you’ll learn:**
• Why the quantum threat is already relevant today, not just in the future
• How classical encryption (RSA, ECC) will be impacted
• Why hybrid post-quantum cryptography is the safest transition path
• The role of cryptographic visibility (CBOM) in managing risk
• How regulations like DORA and NIS2 are accelerating adoption
• How early adoption can strengthen security and create competitive advantage
**About the Speaker: Revan Ande** is a security researcher and founder of RivicQ, a startup focused on advancing next-generation cryptographic solutions. With a strong interest in post-quantum security and its impact on financial systems, he works on bridging the gap between classical encryption methods and quantum-resilient technologies.
He actively shares insights on emerging security challenges and innovations, with a focus on making post-quantum concepts practical and actionable for modern organizations.
**About Berlin Cybersecurity Social:** This meetup is open to cybersecurity professionals of all levels, from beginners to experts. Whether you're a seasoned pro or just starting your journey in the field, this event is the perfect opportunity to connect with others who share your passion for cybersecurity.
Maintainable Frontends for Symfony & Why every CMS needs MCP
Hello fellow Symfonians,
we are thrilled to announce the next user group meeting of the year, featuring two insightful talks that you **won't want to miss**!
This time we are hosted by c-base (Rungestrasse 20
10179 Berlin).
Come by and enjoy an evening of learning, networking, and socializing with fellow Symfony and PHP developers.
**Agenda:**
18:30: Doors open
19:00: Welcome and Introduction
19:20: **Talk 1: "Maintainable Frontends for Symfony" by Daniela Berger**
19:50: Break & Snack
20:00: **Talk 2: "Why every CMS needs MCP" by Roland Golla**
20:40: Socializing
**Talk Details:**
**Talk 1: "Maintainable Frontends for Symfony" by Daniela Berger**
In many web projects we find more backend devs than frontend devs, sometimes significantly so.
This often leads to devs with little frontend experience being asked to do frontend development anyway, and they often end up shaping their frontends in ways that will make their lives unnecessarily hard.
One common result are frontends that are functionally unmaintainable because each change - whether it is a new feature or a bugfix - requires implementing an exception to existing code, thus making the code increasingly and unnecessarily complex.
This is especially upsetting because with a bit of experience it is possible to implement light-weight frontend components with equally elegant CSS and JavaScript that are easy to extend and to adapt to new requirements without digging oneself into a deeper hole with each PR.
And since Symfony is making frontend development steadily more accessible to backend devs with Symfony US / Stimulus, this is a good time to take a closer look at \*how\* to structure these frontends.
This talk aims to a) introduce core frontend concepts that backend devs might not be aware of, b) show best frontend practices both on the code level and on the architecture level, c) show a few common mistakes that can be found in inexpertly implemented frontends, and c) introduce tools that will make consistent frontend development easier.
**Talk 2: "Why every CMS needs MCP" by Roland Golla**
Your content team already works in ChatGPT, Claude, and Gemini. Yet the content still lands in the CMS backend manually, clumsily, slowly. MCP changes that. One sentence in chat becomes a published article. No copy pasting, no formatting, no clicking.
And the content ranks. On Google, on ChatGPT, on Perplexity, on whatever comes next. Good content written by AI directly into the CMS goes live faster and gets found.
I show you the MCP plugin for Sulu CMS: open source, built on Symfony, ready to deploy. But this is not about implementation. It is about the three questions every content team must ask: Why does every website need MCP? What does good AI content look like? And why is conversational content management better than anything you click together in a backend?
MCP makes content creation as easy as chat. If you can write, you can publish. No CMS training, no workflow, no waiting.
Live on stage: a complete workflow from idea in chat to published article in Sulu CMS. Everything open source on GitHub.
Don't miss these insightful talks, engaging discussions, and networking opportunities. We can't wait to see you at the **April Symfony User Group**!"
If you have any questions or accessibility requirements, please reach out to us. Also, if your company wants to be the next host for the User Group, just let us know!
SHUFFLE DANCE CLASS
## DNA. presents ''SHUFFLE FUSION''
**A smooth Dance Style connected to Electronic Music**
This workshop guides You through the numerous facets of shuffle movements and shows You how to connect them into choreographies. Lead by our two amazing community members & Your new Shuffle Teachers Marcel & Dani Ospina - talented shuffle teachers that started shuffling at a very young age and leads this class with his experience connected to a lifetime of passion.
Wednesdays (weekly, regular)- Marcel & [Dani](https://www.instagram.com/daniospina20?igsh=bnBrM3BxY2xuNnll)
##### *Try something new and experience an energetic dance style full of passion and great rhythms.*
See Us Soon.
Love,
Marcel, Dani & DNA.
_________________________________
For more insights, follow us on IG: [@nightart.club](https://www.instagram.com/nightart.club)
\*Members of Dair Night Art e.V. and Subscribers of DNA. Art GbR.
Big Data Events This Week
Discover what is happening in the next few days
Building in Data: From AI Agents to Career Shifts | Data Engineering Meetup
Dear data-loving community, we’re excited to invite you to our next Meetup! This time in collaboration with [Spiced Academy](https://www.spiced-academy.com/en), who will be hosting us at their space.
Join us on April 23 in Berlin and bring all your questions! :)
**Tom Kaltofen: *"Building Deterministic Context Layers for AI Agents"***
[Tom Kaltofen](https://www.linkedin.com/in/tomkaltofen/) is an Engineer at [DHL Data & AI](https://www.linkedin.com/company/dhl-data-ai/) and a Creator at [mloda.ai.](http://mloda.ai.)
About his keynote:
"Data access and reuse are still unsolved, and AI agents are making it worse. This talk goes deeper into that problem: AI agents depend on reliable context (data, features, intermediate state) to make correct decisions. In practice, this context is tied to specific pipelines or infrastructure, leading to brittle systems when moving from prototype to production.
I'll show how a plugin-based approach lets teams build deterministic context layers: separating what you compute from how you compute it, so the same feature definitions work on a laptop and in production.
The talk includes a live demo where an AI agent discovers and queries data features programmatically. "
**Behnaz Derakhshani: *"What If I Started Today? Rethinking Career Switching in the AI Era"***
[Behnaz Derakhshani](https://www.linkedin.com/in/behnaz-derakhshani-63342775/) works as a Data Engineer at [Diconium](https://diconium.com). She shares her personal career shift from finance to data engineering, including the unfiltered challenges and lessons along the way.
About her keynote:
"Eight years ago, there was no AI to debug my logic, just documentation and Stack Overflow. Now as a Data Engineer, I’m breaking down the lessons learned from my finance to tech transition and why AI makes this the most exciting (and accessible) time to pivot."
✧ ✧ ✧
**What to expect:**
* Two expert talks and Q&A
* A welcoming atmosphere with networking opportunities
* Some snacks & drinks to fuel your thoughts :)
✧ ✧ ✧
**Timetable:**
* 18:30 - Event admission
* 18:50 - Welcome & Introduction
* 19:00 - Tom Kaltofen: *"Building Deterministic Context Layers for AI Agents"*
* 19:30 - 5 minutes break
* 19:35 - Behnaz Derakhshani: *"What If I Started Today? Rethinking Career Switching in the AI Era"*
* 20:05 - Snacks, Drinks & Networking
* 21:30 - End
✧ ✧ ✧
More on the **-> [applydata data engineering meetup page](https://applydata.io/data-engineering-meetup/)**.
**Our goal is to form a local data-loving community, so join us and let's talk data together!**
✧ ✧ ✧
*At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find [more information on data protection here](https://applydata.io/events/information-on-data-protection/).*
Spring Data Allergy
Spring is here. And so are the sniffles. Not from pollen, but from data pipelines, AI hype, and the occasional rogue JSON.
Join us for a spring edition of Data Berlin: an evening of talks on data and AI, good people, and plenty of networking. As always, free entry, drinks, and a few surprises.
**We collect the [RSVP on Luma](https://luma.com/pkad09m9).**
**Agenda**
18:30 – Doors open & networking
19:00 – Welcome remarks
19:10 – Talk 1 (TBD)
19:35 – Talk 2 (TBD)
20:00 – Break
20:10 – Talk 3 (TBD)
20:35 – Closing remarks & networking
**About our host:**
[SumUp](https://www.sumup.com/?utm_source=luma) is a leading global financial technology company with the vision to create a world where everyone can build a thriving business. SumUp supports over 4 million merchants in 36 markets across Europe, the U.S., Latin America and Australia, with tools and services merchants need to start, run, and grow their business, tailor-made for small, micro, and nano segments.
Committed to leveraging its success to make the world a better place, SumUp has pledged to donate 1% of future net revenues to environmental causes.
**Want updates or more info?**
Subscribe to our **newsletter**: [databerlin.substack.com](https://databerlin.substack.com).
Follow us on **[LinkedIn](https://www.linkedin.com/company/data-berlin?utm_source=luma).**
Looking for a job? [databerlin.net/jobs](https://databerlin.net/jobs).
Join our **[Slack](https://join.slack.com/t/data-brln/shared_invite/zt-2ued0xvdu-aihzi2cKEwD_6_KDRd_1ag?utm_source=luma)**[ community](https://join.slack.com/t/data-brln/shared_invite/zt-2ued0xvdu-aihzi2cKEwD_6_KDRd_1ag?utm_source=luma).
Data Science Retreat Demo Day #45
**Hey Berlin Data Folks!**
We’re looking forward to our first meetup of the spring on **23rd April**. Our Batch 45 participants have been working hard on their final projects, and they’re ready to share what they’ve built.
It’s a casual evening to see some practical AI applications, meet others in the local data community, and chat about new ideas in the field. We’ll also have some pizza and drinks to keep the conversation going. 🍕🍻
**Free to Attend!**
**Agenda:**
**17:30** \- Drinks and Networking
**18:00** \- Welcome & Introduction
Followed by Project Presentations
**Project Ideas:**
**1\. AI/ML – Office Posture Classification**
***Project by Mariami Marsagishvili***
An intelligent desktop application using a computer vision pipeline (TensorFlow/MoveNet) to monitor 17 body keypoints in real time. It calculates joint angles to detect slouching and provides automated stretch recommendations.
**2\. RestockVi — Smartphone\-Based Inventory Intelligence**
***Project by Vikhyati Singh***
A novel retail solution utilizing "Rectangular Lattice Gap Detection" (RLGD). This identifies out-of-stock items via smartphone scan without requiring pre-trained product datasets, making high-end inventory AI accessible to independent stores.
**3\. Fraud Eye — Intelligent Verification for Insurance**
***Project by Juliya Sebastian***
An advanced verification layer that detects AI-generated or manipulated images in insurance claims. It analyzes pixel-level consistency and physical plausibility (lighting/reflections) to combat sophisticated digital fraud.
**4\. MigraineChat — Voice\-First LLM Health Logging**
***Project by Isabella Boux and Maxim Smirnov***
A voice-to-data system that uses an LLM-powered extraction pipeline to transform unstructured speech into structured longitudinal health records, enabling predictive modeling for personal triggers.
**19:30** \- Open for networking
**20:30** \- Wrap up
We have limited seat so please RSVP soon. See you all at the event.
CorrelAid Berlin – Monthly Stammtisch ☕📊
Hi everyone,
Our next CorrelAid Berlin in-person meetup is coming up – this time with a special theme: **Data Journalism** 📰📊
🗓️ When: Thursday, April 23, 2026
🕠 Time: 18:30 \~ 20:00
📍 Where: Café Milagro, Bergmannkiez, Kreuzberg
We’ll have a casual chat about how data is used in journalism, look at inspiring examples, and share ideas on how data skills can help tell better stories.
Whether you’re already involved in projects or just curious about CorrelAid Berlin and data journalism, you’re very welcome to join – grab a drink, meet others, and exchange ideas.
Looking forward to seeing you there!
CorrelAid Berlin team 💙
April 24 - Berlin AI, ML and Computer Vision Meetup
Join our in-person meetup on April 24th to hear talks from experts on cutting-edge topics across AI, ML, and computer vision.
**[Register to reserve your seat.](https://voxel51.com/events/berlin-ai-ml-and-computer-vision-meetup-april-24-2026)** Space is limited!
**Date, Time and Location**
Apr 24, 2026
5:30 PM - 8:30 PM
[MotionLab](https://motionlab.berlin/)
Bouchéstraße 12/Halle 20
12435 Berlin
**Kaputt: A Large-Scale Dataset for Visual Defect Detection**
We present a novel large-scale dataset for defect detection in a logistics setting. Recent work on industrial anomaly detection has primarily focused on manufacturing scenarios with highly controlled poses and a limited number of object categories. Existing benchmarks like MVTec-AD (Bergmann et al., 2021) and VisA (Zou et al., 2022) have reached saturation, with state-of-the-art methods achieving up to 99.9% AUROC scores. In contrast to manufacturing, anomaly detection in retail logistics faces new challenges, particularly in the diversity and variability of object pose and appearance. Leading anomaly detection methods fall short when applied to this new setting.
To bridge this gap, we introduce a new benchmark that overcomes the current limitations of existing datasets. With over 230,000 images (and more than 29,000 defective instances), it is 40 times larger than MVTec and contains more than 48,000 distinct objects. To validate the difficulty of the problem, we conduct an extensive evaluation of multiple state-of-the-art anomaly detection methods, demonstrating that they do not surpass 56.96% AUROC on our dataset. Further qualitative analysis confirms that existing methods struggle to leverage normal samples under heavy pose and appearance variation. With our large-scale dataset, we set a new benchmark and encourage future research towards solving this challenging problem in retail logistics anomaly detection. The dataset is available for download under [https://www.kaputt-dataset.com](https://www.kaputt-dataset.com).
*About the Speaker*
[Sebastian Höfer](https://www.linkedin.com/in/sebastian-h%C3%B6fer-891178121/) is an Applied Science Manager at Amazon Fulfillment Technologies & Robotics, leading machine learning and computer vision research for large-scale robotics and warehouse automation. He received his PhD from the Robotics & Biology Lab at TU Berlin, focusing on Sim2Real transfer and robotic perception. His recent work, “Kaputt: A Large-Scale Dataset for Visual Defect Detection” (ICCV 2025) [37], established a major benchmark for industrial anomaly detection, reflecting his expertise at the intersection of academic research and real-world deployment.
**Data Foundations for Vision-Language-Action Models**
Model architectures get the papers, but data decides whether robots actually work. This talk introduces VLAs from a data-centric perspective: what makes robot datasets fundamentally different from image classification or video understanding, how the field is organizing its data (Open X-Embodiment, LeRobot, RLDS), and what evaluation benchmarks actually measure. We'll examine the unique challenges such as temporal structure, proprioceptive signals, and heterogeneity in embodiment, and discuss why addressing them matters more than the next architectural innovation.
*About the Speaker*
[Harpreet Sahota](https://www.linkedin.com/in/harpreetsahota204/) is a hacker-in-residence and machine learning engineer with a passion for deep learning and generative AI. He’s got a deep interest in VLMs, Visual Agents, Document AI, and Physical AI.
**Most AI Agents Are Broken. Let’s Fix That**
AI agents are having a moment, but most of them are little more than fragile prototypes that break under pressure. Together, we’ll explore why so many agentic systems fail in practice, and how to fix that with real engineering principles. In this talk, you’ll learn how to build agents that are modular, observable, and ready for production. If you’re tired of shiny agent demos that don't deliver, this talk is your blueprint for building agents that actually work.
*About the Speaker*
[Bilge Yücel](https://www.linkedin.com/in/bilge-yucel/) is a Senior Developer Relations Engineer at deepset, helping developers build agentic AI apps with Haystack. Passionate about AI, she makes complex concepts approachable through hands-on tutorials, both online and at real-life events.
**Operationalizing Computer Vision for Overhead Lines: Beyond the Demo**
At first glance, visual inspection of high-voltage power lines seems straightforward: collect imagery, run one or two AI models, and report the findings. In practice, moving beyond a proof of concept reveals a range of issues that can make or break a campaign. Common concerns include data quality and coverage, scarcity of the most relevant cases and abundance everywhere else, variations in pylon geometry and asset types across regions, calibration and GIS alignment challenges, and a long tail of edge cases that emerge in real-world operations.
This talk introduces Siemens Energy’s end-to-end overhead line inspection solution and shares key learnings from inspecting more than 10,000 km of power lines for real customers across several continents. We will show how raw 2D/3D data is transformed into structured information, delivering insights into asset inventory as well as defects, and supporting maintenance and planning decisions for critical infrastructure. The focus is on the combination of algorithmic building blocks and scalable processing, designed for robustness and consistency at scale, where even low error rates can become operationally significant.
*About the Speaker*
[Stefan Wakolbinger](https://www.linkedin.com/in/stefan-wakolbinger-aa0ba874/) is the Development Team Lead for AI & Analytics at SIEAERO, Siemens Energy's digital powerline inspection service. He leads the development of cutting-edge AI and analytics solutions that transform aerial powerline inspection through multi-sensor technology. His team creates digital twins of powerline infrastructure, automates fault detection, and monitors vegetation management—making powerline inspection safer, more precise, and more efficient. Stefan has been driving innovation in this role since September 2022.
**Search your video library like a database**
Drop in YouTube URLs or upload files and query content four ways: exact keyword matching, semantic search across transcripts, visual scene search via SigLIP2, and LLM-generated answers that synthesise across segments.
[Paras Mehta](https://www.linkedin.com/in/pmehtaeu/) is a Berlin-based AI engineer and CTO/co-founder of Sylby, a language learning app he built from scratch, reaching 10,000 users and raising €350K. Previously: data scientist at Motionlogic, senior software engineer at Volkswagen, a PhD from Freie Universität Berlin, and a visiting stint at Cambridge. He now works as an AI engineer at HPI's AI Service Centre.
Alerting Best Practices | Customer Story | Platform Engineering
**🏆 Win a free ticket to [DASH26](https://dash.datadoghq.com/)!** We’re hosting an on-site raffle where the grand prize is a ticket to Datadog’s annual conference in New York City.
\-\-\-\-
All talks will be **presented in English**, to ensure that as many people as possible can participate and engage at this event.
**If you want to attend, please RSVP to secure your spot - this will make organizing easier. Thank you so much ♥️**
**Location:** [The-B Berlin, Revaler Str. 32, 10245 Berlin](https://www.theb-berlin.com/)
\-\-\-\-\-\-
**🏠 18:00 - Arrival: Networking, Drinks & Snacks (30 min.)**
Grab yourself snacks & drinks and say hello to everybody else!
**📅 18:30 - Introduction & What's new at Datadog? (15 min.)**
**🎙️**Speaker: Marcel Drechsler, User Group Leader & Product Owner Internal Developer Platform @ **[andsafe](https://andsafe.de)**
Introduction into the evening and highlights of Datadog's recent new features and products.
**📅 18:45 - Powering Platform Engineering through Datadog (30 min.)**
**🎙️**Speaker: Marcel Drechsler, User Group Leader & Product Owner Internal Developer Platform @ **[andsafe](https://andsafe.de)**
In the rapidly changing landscape of Platform Engineering, Datadog has evolved from a monitoring tool into a comprehensive foundation for Internal Developer Platforms (IDPs). This session explores the journey of scaling observability and security into a unified platform strategy that reduces developer friction. We will dive into how Datadog’s expanding ecosystem provides the essential building blocks for modern self-service infrastructure. Attendees will learn how to leverage these integrated features to build a more resilient and transparent developer experience. Discover how to transform your Datadog instance into a strategic asset for your platform’s success.
**📅 19:15 - Logs as a First-Class Citizen - How Lightspeed Commerce evolved logs to unlock the full power of Datadog**
**(30 min.)**
**🎙️**Speaker: Rein Martha, Staff Software Engineer, Lightspeed Commerce
When Lightspeed started with Datadog, we didn't begin with traces or metrics — we began with evolving our logs. Raw, unstructured, and full of noise. The first step was making them worth keeping: trimming duplicates, removing what no one ever read, and transforming what remained into structured, queryable signals.
That foundation changed everything. Once logs became first-class — with clean attributes, consistent structure, and a clear purpose for every line — the rest of the observability stack followed naturally. Monitors built on log queries. Dashboards that actually meant something on incidents. Metrics generated directly from log attributes, giving us long-term retention without the cost of keeping everything raw.
**📅 19:45 - Best Practices for Alerting with Datadog (30 min.)**
**🎙️**Speaker: Santiago Gomez Saez, Datadog ambassador & Principal Cloud Architect @ **[dxone](https://www.dx.one.gmbh/)**
Operational excellence is the main objective of SRE teams. Focusing on alerting, this talk shares common pitfalls and best practices on how and when to alert when incidents occur. In addition, we show how to self-heal in some cases requiring no manual intervention.
**🥗 20:15 - Drinks, Food & Networking**
Enjoy refreshments while networking with community peers!
**👋 21:00 - Goodbye, see you next time!**
Women in Tech Night: Women Defining the Future of Corporate Bank Technology
***This is an inclusive event – all genders are welcome!***
**Women in Technology at Deutsche Bank** are delighted to invite you to a special evening bringing together leaders and experts from across the technology community.
The program will offer insights into recent technology achievements in Corporate Banking Technology and celebrate the women behind them. The evening will also feature a fireside chat with senior technology leaders from Deutsche Bank discussing leadership, innovation, and the evolving role of technology in the financial industry.
**🗓 Agenda**
* **18:30** – Doors open
* **18:55** – Welcome & introductions
* **19:00 – 19:30** – *The Digital Uplate from Corporate Banking Technology* by Padmavathi Ravi (Deutsche Bank)
* **19:30 – 19:45** – *Wero Payments for the Future of Commerce in Europe* by Hama Kasiri (Deutsche Bank)
* **19:45 – 20:15** – Break
* **20:15 – 21:00** – Fireside chat **Mary Hynes-Martin** (Head of Strategy for Corporate Bank Technology at Deutsche Bank), **Juliet Parab** (CIO Security Services at Deutsche Bank)
* **20:15 – 22:00** – Networking
* **22:00** – Doors close
Big Data Events Near You
Connect with your local Big Data community
Data Cleansing using Data Bricks
The May Ohio North Database Training user group meeting will be held on **May 5th, 2026 at 5:00PM**. This will be a **HYBRID** event and we will be joined in person by **Sam Nasr.**
You're welcome to come meet in-person at our meeting location, the offices of Improving at
**[6000 Freedom Square Dr,](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
**[Unit 110,](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
**[Independence, OH 44131](https://www.google.com/maps/place/Improving/@41.4004167,-81.6614462,17z/data=!3m2!4b1!5s0x8830e5b8255c5919:0xd8297060eb68fe04!4m6!3m5!1s0x8830dc7a0fe35dc9:0xbfc4710ecadfc5c!8m2!3d41.4004127!4d-81.6588713!16s%2Fg%2F1hm3hkqp3?entry=ttu&g_ep=EgoyMDI1MDQzMC4xIKXMDSoASAFQAw%3D%3D)**
[Teams Link ](https://teams.microsoft.com/meet/287759659366576?p=kCaammjECnUCvZzEJv)if anyone needs it after RSVP-ing for in person.
If you would like to subscribe to our email list outside of Meetup, we have changed platforms recently and you will need to register [here in Kit ](https://ohio-north-data-training.kit.com/b8f036f615)instead to receive emails.
Agenda:
**5:00 PM EST**: Online and in-person meeting begins with a social hour. This is an unstructured hour where you can join us to catch up and meet other group members before the session starts. There will be food brought in for in-person attendees.
**6:00 PM EST**: Elections, announcements, followed by our feature presentation. See below for presentation details.
**7:30 PM EST**: Optionally after the main presentations, the in-person crowd may go out for snacks and drinks at a local establishment.
We hope to see you there!
Session Abstract
### Data Cleansing using Data Bricks
Machine Learning is highly dependent on adequate data. Not only does quantity matter, but more importantly quality. In this session we’ll cover how to build a custom automated process using Data Bricks. This will provide methods for cleaning data in a data lake using functions in Azure.
\*Please note, that we will be using Microsoft Teams for the online portion of this meeting. You may want to join a few minutes early to ensure you do not have any issues. If you are attending in person, there are large TVs at the office, and you do not need to bring a laptop or use Teams.
COhPy Monthly Meeting
**Improving Office in Franklinton**
Physical location:
Improving Office
330 Rush Alley Suite #150
Columbus, OH 43215
Schedule:
6:00 p.m.: Socialize, eat, and drink. Improving will be providing pizza and beverages.
6:30 to 8:00 pm. Main meeting and presentation(s).
Topic: This month Chris Pazsint will be talking about Agentic Coding. How does one use CLI Based Agents, and Agentic IDEs such as Cursor, Kiro, Antigravity? How to include agentic coding plugins for IDEs you already love such as Visual Studio Code.
We meet on the last Monday of each Month. Presentations are given by members and friends of this group. If you would like to do a presentation (small or large) on a python topic, please contact Central OH Python at centralohpython@gmail.com
What If Your AI Could Be a Team? - Chad Green
**Important time note:** Please plan on arriving between 5:30 and 6:00 as the elevators lock after 6 and you'll need to message us and we'll need to come get you.
The building address is 4450 Bridge Park
The entrance is 6620 Mooney St, Suite 400
You will need to scan your ID at the door to get a visitor badge.
**Abstract**
GitHub Copilot is powerful, but what if you could scale from a solo AI assistant to an entire team of specialized agents working in parallel? This session introduces Squad: an open-source framework for multi-agent orchestration that lets you define teams of AI agents with specific roles, responsibilities, and expertise.
We'll progress from Copilot basics to the Copilot CLI, explore how Agents add autonomy, and see how Instructions and Skills let you customize agent behavior. Then, the climax: a live demo where a Squad team of 3 agents (Lead, Developer, Tester) stands up and builds a working application in real-time, showcasing true multi-agent collaboration.
Whether you're new to AI or exploring how to scale your use of Copilot, this session will show you what's possible when agents work as a team.
**YouTube Link**
TBD
Indianapolis Modern Dating for Career Professionals
**🫶 Virtual Speed Dating – Indianapolis Singles, Curated by Personality**
Online speed dating for Indianapolis locals — hosted live on Zoom, personality matched. We match you with compatible Indianapolis locals using a quick personality quiz. You'll chat one-on-one on Zoom in short timed rounds while a host guides the session.
**Register under your age group:**
- ⚡ **Ages 18-32** → [REGISTER HERE](https://tempodating.com/product?productId=430.0&productType=onlineSpeedDating&city=Indianapolis&groupurlname=dynamic-local-singles-speed-dating-meetup&ar=18-32&face_v=3.0)
- **Ages 30-46** → [REGISTER HERE](https://tempodating.com/product?productId=430.0&productType=onlineSpeedDating&city=Indianapolis&groupurlname=dynamic-local-singles-speed-dating-meetup&ar=30-46&face_v=3.0)
**⚠️ RSVP alone won't secure your spot.** You need to register through your age group link below and complete the personality quiz. Places are limited.
---
👥 **Best for:**
- Singles who prefer a hosted, structured experience
- Indianapolis locals after personality-matched dates
⭐ *"Ideal for introverts. Felt at ease the entire time."* – Indianapolis attendee
**At a glance**
- **Format:** Live on Zoom – guided rounds from your home
- 💡 **Location:** Your space – couch, desk, wherever suits you
🔄 **What to expect**
1. **Register** – Select your age group above and sign up.
2. **Take the personality quiz** – We use it to pair you with compatible Indianapolis singles.
3. **Log in** – Connect to the Zoom session from home. The host runs the show.
💡 **Tip:** Good lighting and a clean background make a big difference.
**Frequently Asked Questions**
**Do I need anything besides Zoom?**
Just Zoom, a webcam, and Wi-Fi. That's it.
**What's the personality quiz for?**
**Is my info private?**
---
✨ Meet Indianapolis singles from home. Sign up and we'll take care of the rest. 💖 ✨
Quick version: this one gets straight to the point.
Everything important, without the extra padding.
Columbus HUG April
Want to be a speaker? submit your talk to our Call for Presenters!!!
https://sessionize.com/cbus-hug-2026/
DoJo (Informal Python Meeting)
**Latest Dojo Location!**
**Knotty Pine Brewing**
1765 W 3rd Ave,
Columbus, OH 43212
We're going to try a new dojo location for a few weeks and see how it works
Dojos are informal Python group study sessions where everyone interested in Python gathers to learn about Python, help others with Python, or just hang out. Everyone is welcome from Python beginners to experts. Bringing a laptop is encouraged (we'll have extension cords and power strips). If there's something you want to learn leave a comment on this invite so we can plan ahead.
We're looking for speakers for our Monthly Meetups! Fill out the form if you are interested in presenting to the Python Community.
https://forms.gle/ehSfUAC2WgR34Crq9
Agile Coaching Circle -- IN-PERSON
Join other experienced and aspiring agile coaches and professionals to:
* develop and practice your coaching skills in a peer-to-peer environment
* share current successes and challenges in your work environment and get support from each other
* learn from each other, build better relationships and experiment with new ideas
***NOTE:*** Pre-registration is required for this event. **Please arrive 10 minutes early** to check in at the security desk.

























