addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Cassandra and Hadoop at Knewton

Giannis Neokleous( from Knewton( will be presenting on their integration of Cassandra and Hadoop:

"Cassandra is really fast in terms of writing but how do you get stuff out as fast? In this talk we're going to talk about how you can make Hadoop and Cassandra play nice together. The ColumnFamilyInputFormat and OutputFormat are great but they can sometimes put a lot of load onto Cassandra which in a production cluster with a lot of traffic it's easy for it become prohibitively expensive. We're going to introduce a new set of InputFormatReaders that can read tables from a cluster and also briefly talk about how could implement OutputFormats and partition techniques for writing the tables without the need of having a live cluster."

Join or login to comment.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy