Skip to content

About us

This is a group for anybody interested in Big Data: what it means, what tools are there and how to use them, how others are analyzing the data and what is the value they get out of it. Meetups will be both socializing and learning ones. If you are interested about the topic join us.

Sponsors

eSolutions

eSolutions

Unlock the power of your data with customized digital solutions.

Upcoming events

1

See all
  • Reliable Data Streaming in Hybrid Environments

    Reliable Data Streaming in Hybrid Environments

    ING Hubs Romania, Calea Victoriei 174-176, Bucharest, RO

    Join us on Tuesday, June 9, for a new edition of Bucharest Big Data meetup, on real-time data streaming in hybrid environments - from on-prem pipelines to cloud systems. First, Catalin Duta, DevOps Engineer, and Anamaria Lauric, Chapter Lead Engineer at ING Hubs Romania, will cover enabling real-time data flows from on-premise to cloud, showing how modern platforms deliver seamless, low-latency access across systems to support analytics and operations.
    Next, Roberto Comsa, Data Engineer at eSolutions, will cover why data validation matters in CDC streaming pipelines, walking through a production journey from CDC sources via NiFi, Kafka, Spark, and Delta Lake on MinIO, sharing practical lessons on catching drift early and building trust without heavy rules.

    First talk: Enabling the Journey From On-Premise to Cloud With Real-Time Data
    Many organizations operate across both on-premise and cloud environments, making timely and reliable data access essential to support modern business needs. As data landscapes evolve, the ability to share and consume information seamlessly across platforms becomes a key foundation for responsiveness, scalability, and innovation.
    In this talk, we explore how a modern data platform enables real-time and near-real-time data flows across on-premise and cloud environments. Streaming capabilities help ensure that data remains continuously available and aligned across systems, supporting operational processes, analytics, and digital use cases without delay.
    By enabling consistent and real time data access across environments, organizations can improve decision making, support new business opportunities, and take full advantage of cloud native capabilities, while operating within a flexible and future-ready data ecosystem.

    About the speakers
    Catalin Duta, DevOps Engineer at ING Hubs Romania
    Catalin has over 6 years of experience at ING, working within the Analytics domain. Throughout his journey, he has contributed to multiple initiatives that support scalable and reliable data platforms, adapting easily to new challenges. Curious and eager to learn, Catalin brings a collaborative mindset and a continuous-improvement attitude to every project he is involved in. Outside of work, he enjoys playing padel, exploring specialty coffee with friends, and experimenting with smart home automations.

    Anamaria Lauric, Chapter Lead Engineer at ING Hubs Romania
    Anamaria has been part of the organization for 7 years and has been a chapter lead for nearly 5 years. Over this period, she has worked on near real-time data ingestion, supporting teams in designing and implementing data flows that enable access to near real-time data. Her work has focused on helping teams move from traditional approaches to modern, streaming-based architectures, allowing data to be consumed faster and more effectively across platforms. Outside of work, she enjoys skiing and spending time with her little one, which is her favorite way to recharge.

    Second talk: Why Data Validation Matters in CDC Streaming Pipelines
    Building real-time data products on top of CDC streams is powerful, but it also introduces subtle failure modes that can quietly affect data quality and trust. In his session, Roberto will walk us through a practical journey from ingestion to consumption in a modern on-prem data platform, covering how events flow from CDC sources through NiFi, Kafka, Spark, and Delta Lake on MinIO.
    The talk focuses on how production streaming systems can drift in subtle ways, and how, even without heavy business-rule validation at source, lightweight checks remain essential for building trust in data and exposing issues early.
    In the end, he will also share practical lessons learned, highlight the mindset shift required to balance reliability with efficiency, and reinforce the importance of stronger end-to-end ownership of the data pipelines teams build and operate.

    About the speaker
    Roberto Comsa, Data Engineer at eSolutions
    Roberto is a Data Engineer currently building and operating ELT pipelines on top of open-source technologies in on-prem environments. His background combines full-stack software engineering with distributed data systems, giving him a practical perspective on both product delivery and platform reliability. Roberto is particularly interested in streaming architecture and making data platforms observable, resilient, and cost-aware. Outside of work, he enjoys calisthenics and competitive sailing.

    Agenda
    18:30 - 18:50 - Welcome & Networking
    18:50 - 19:30 - Catalin Duta & Anamaria Lauric: Enabling the Journey From On-Premises to Cloud With Real-Time Data
    19:30 - 20:10 - Roberto Comsa: Why Data Validation Matters in CDC Streaming Pipelines
    20:10 - 21:00 - Networking

    The event is hosted by ING Hubs Romania. Meet us all on Tuesday, June 9, at their office (174-176 Calea Victoriei).

    This is an in-person event, with presentations conducted in English. Please RSVP to secure your spot.

    See you there!

    • Photo of the user
    • Photo of the user
    • Photo of the user
    23 attendees

Group links

Organizers

Members

1,951
See all

Find us also at