Data at a SAAS company: Clojure, Cascalog, Hadoops, Clustering & Datas


Details
Data Masters and Big Data Folk!
This meetup is proudly sponsored by Zendesk, next Wednesday November 5th, 2014 (6.30pm) at Zendesk Offices, 488 Bourke St, Melbourne 3000
Topic of the Night – ‘Data at a SAAS company: Clojure, Cascalog, Hadoops, Clustering & Datas' by Zendesk (Chris Hausler, Anh, Thien Dinh & Jeff Theobald) and PWC data scientist , Matt Kupperholz.
Sponsor of the night is Zendesk, we're looking very much forward to
hearing Zendesk talk on all things Big Data with a side of pizza & beer.
Agenda and Details:
6:00pm
Doors open, beer/drinks, food
6:30pm: 5 min
Intro - Jason Smale
6:35pm: 15 min + 5 min QA
Data Systems for a SAAS company, Hadoop and Clojure at work by Chris Hausler.
At Zendesk, we’ve used the power of Clojure to build a batch analytics system on top of Hadoop that helps us to gain insight into our data stores.
The presentation will provide an introduction to building such a system with
Cascalog on top of Clojure for data processing and Midje for test automation.
(Emphasis on workflow, testing and how cascalog makes this easy).
6:55pm: 15 min + 5 min QA
Investigating customer clustering and segmentation @ Zendesk by Anh Thien Dinh
A dive into customer clustering and segmentation with Python along with an overview of the tools Zendesk are using for adhoc data queries (gorilla, ipython, python->clojure).
7:15pm: 20 min Break
Networking
7:35pm: 15 min + 5 min QA
CroSoLoMo -> M@ (Matt) Kupperholz
M@ is a partner at PwC and data scientist with over 20 years experience who will be talking about how the megatrends of Crowd, Social, Location and Mobility are expanding the value data scientists can bring to a wide range of business challenges.
7:55pm: 15 min + 5 min QA
The data pilgrimage from sharded MYSQL databases to HDFS by Jeff Theobald
Before we can analyse our data at Zendesk, we need to first have the data accessible to the Hadoop ecosystem. Getting the data reliably and regularly is an interesting story.
8:30pm: End
Looks like we have a great session for this meetup, so we look forward to seeing you again on Wednesday 5th November, 2014 for another special edition.
Please invite a friend or two (or three) to join the meetup group, RSVP this meetup, participate, network, eat some pizza with a side of beer and most importantly wear your big data hat (regardless if your hat reads BDNewbie or BDGuru).
RSVP away and see you soon at this meetup!
Cheers,
Fernando

Data at a SAAS company: Clojure, Cascalog, Hadoops, Clustering & Datas