SFPUG Sept: Why you need HyperLogLog

Topic: HyperLogLog and big data analytics for Postgres

Host and pizza sponsor: Neustar

Timon Karnezos will give a talk about the 'hll' extension to Postgres. He'll cover the origin and history of sketching algorithms, a few modern sketching data structures, along with common use cases and implementation guidelines. Finally, he'll cover the guts of the extension and how Neustar is using HyperLogLog sketches to support reporting workloads over hundreds of TBs of unaggregated data.

Timon says "As background, I'll give an overview of what sketching algorithms are, with examples (like Bloom Filters, Count Min Sketch, HyperLogLog, and K-Min Values), use cases, and high-level implementation ideas. Then we'll talk about the PG HLL extension and how we use it at Neustar.

Live video stream will be here.  (assuming it works)

Join or login to comment.

  • Timon K.

    Hi all, I've posted the slides from my HLL talk on our blog: http://research.neustar.biz/2014/09/23/hll-talk-at-sfpug/

    Thank you all for attending!

    2 · September 23

  • David A.

    I wish I could attend, but I will be out of town. We use Vertica's HyperLogLog and consider it revolutionary. Vertica allows you to store intermediate results in a "synopsis," a varbinary(49154) of magic hashes. These synopses can then be queried at other levels of grouping or filtering. This revolutionizes how one can support unique metrics, especially in a self-service BI environment.

    September 18

People in this
Meetup are also in:

Sometimes the best Meetup Group is the one you start

Get started Learn more
Rafaël

We just grab a coffee and speak French. Some people have been coming every week for months... it creates a kind of warmth to the group.

Rafaël, started French Conversation Group

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy