Past Meetup

Data at a SAAS company: Clojure, Cascalog, Hadoops, Clustering & Datas

This Meetup is past

97 people went

Location image of event venue


Data Masters and Big Data Folk!

This meetup is proudly sponsored by Zendesk, next Wednesday November 5th, 2014 (6.30pm) at Zendesk Offices, 488 Bourke St, Melbourne 3000

Topic of the Night – ‘Data at a SAAS company: Clojure, Cascalog, Hadoops, Clustering & Datas' by Zendesk (Chris Hausler, Anh, Thien Dinh & Jeff Theobald) and PWC data scientist , Matt Kupperholz.

Sponsor of the night is Zendesk, we're looking very much forward to
hearing Zendesk talk on all things Big Data with a side of pizza & beer.

Agenda and Details:


Doors open, beer/drinks, food

6:30pm: 5 min

Intro - Jason Smale

6:35pm: 15 min + 5 min QA

Data Systems for a SAAS company, Hadoop and Clojure at work by Chris Hausler.

At Zendesk, we’ve used the power of Clojure to build a batch analytics system on top of Hadoop that helps us to gain insight into our data stores.

The presentation will provide an introduction to building such a system with
Cascalog on top of Clojure for data processing and Midje for test automation.
(Emphasis on workflow, testing and how cascalog makes this easy).

6:55pm: 15 min + 5 min QA

Investigating customer clustering and segmentation @ Zendesk by Anh Thien Dinh

A dive into customer clustering and segmentation with Python along with an overview of the tools Zendesk are using for adhoc data queries (gorilla, ipython, python->clojure).

7:15pm: 20 min Break


7:35pm: 15 min + 5 min QA

CroSoLoMo -> M@ (Matt) Kupperholz

M@ is a partner at PwC and data scientist with over 20 years experience who will be talking about how the megatrends of Crowd, Social, Location and Mobility are expanding the value data scientists can bring to a wide range of business challenges.

7:55pm: 15 min + 5 min QA

The data pilgrimage from sharded MYSQL databases to HDFS by Jeff Theobald

Before we can analyse our data at Zendesk, we need to first have the data accessible to the Hadoop ecosystem. Getting the data reliably and regularly is an interesting story.

8:30pm: End

Looks like we have a great session for this meetup, so we look forward to seeing you again on Wednesday 5th November, 2014 for another special edition.

Please invite a friend or two (or three) to join the meetup group, RSVP this meetup, participate, network, eat some pizza with a side of beer and most importantly wear your big data hat (regardless if your hat reads BDNewbie or BDGuru).

RSVP away and see you soon at this meetup!