Skip to content

Clojure and Large Data: Working the Middle ground between RAM and a Cluster

Photo of Dorab Patel
Hosted By
Dorab P.
Clojure and Large Data: Working the Middle ground between RAM and a Cluster

Details

We all want fast answers, even as our data sets grow in size. Simple techniques stop working when our data no longer fits in a single process. In this talk I’d like to share a few of the techniques I’ve learned that stretch what you can do on a single machine. I’ll cover taking random samples from large data sets with stream sampling and identifying (probable) duplicates when the set is too large to fit in memory.

Photo of Los Angeles Clojure Users Group group
Los Angeles Clojure Users Group
See more events
MatchCraft LLC
2701 Ocean Park Blvd., Ste 220 · Santa Monica, CA