Skip to content

About us

Apache Pinot is a realtime distributed OLAP datastore, which is used to deliver scalable real time analytics with low latency. It can ingest data from batch data sources (S3, HDFS, Azure Data Lake, Google Cloud Storage) as well as streaming sources (such as Kafka). Pinot is used extensively at LinkedIn and Uber to power many analytical applications such as Who Viewed My Profile, Ad Analytics, Talent Analytics, Uber Eats and many more serving 100k+ queries per second while ingesting 1Million+ events per second.

Pinot committers are active on slack. Click here to join our slack channel.
This meetup is for developers and users of Apache Pinot to share information on
• How to use Pinot 
• Internals of Pinot 
• Products built on top of Pinot

More info on Pinot
Apache Pinot Website

Apache Pinot Docs 
Blog posts

> • https://engineering.linkedin.com/blog/2019/03/pinot-joins-apache-incubator
> • https://engineering.linkedin.com/blog/2019/06/star-tree-index--powering-fast-aggregations-on-pinot
>
> • https://engineering.linkedin.com/blog/2019/auto-tuning-pinot
>
> • Pinot at Uber

Upcoming events

1

See all
  • Network event
    Webinar: Full-Text Search on Apache Iceberg w/ Pinot and Lucene

    Webinar: Full-Text Search on Apache Iceberg w/ Pinot and Lucene

    ·
    Online
    Online
    4 attendees from 10 groups

    To attend, register here.

    While Data Lakehouses like Apache Iceberg provide massive, cost-effective scalability, they are fundamentally designed as scan-heavy engines.

    They lack the sub-second, "needle-in-a-haystack" search capabilities provided by inverted indices found in traditional search engines.

    This session explores how Apache Pinot fills this gap by integrating Apache Lucene segments directly into its distributed serving layer while maintaining the source of truth in Iceberg's Parquet format.

    We will conduct a technical deep-dive into:

    • Segment-to-Parquet Virtualization: Pinot’s segment abstraction onto remote Iceberg/Parquet files without data duplication or heavy re-ingestion.
    • Hybrid Index Pinning: The mechanics of pinning Lucene Inverted and Text Indexes to local NVMe storage on Pinot servers while leaving the raw data blobs on S3.
    • Lucene I/O Orchestration: How the Pinot optimizes query plans to minimize S3 "Time to First Byte" by leveraging metadata-heavy index structures.
    • Photo of the user
    1 attendee from this group

Group links

Organizers

StarTree is a Super Organizer

Members

2,045
See all
Photo of the user Kishore Gopalakrishna
Photo of the user Foo Lim
Photo of the user Hiren
Photo of the user Vardan Aroustamian
Photo of the user Gerald Wluka
Photo of the user K e l v i n
Photo of the user Randy Breunling
Photo of the user Sriram Baskaran
Photo of the user AKulkarni
Photo of the user Victor Chugunov
Photo of the user Alex Kin
Photo of the user Mark Chekhanovskiy

Find us also at