addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Data Mining with Scala at Identified

Scala is great for big data cleaning, gleaning, and modeling. We'll see how Identified uses it to get insights into people ranking and search.


Jan Prach is a Scala developer at Identified using Machine Learning and Data Mining to answer key question about people that make Identified stand out.  He will overview the overall data flow and focus on several points of interest along the way, and how Scala makes it all happen.


Data Science is perceived to be complicated, but with the help of Scala and Scalding most patterns can be simplified to a great bit. In this talk, we will show how many common patterns in data science can be redesigned in many cases to almost a single line!

Vitaly Gordon is a senior data scientist on the LinkedIn Product Data Science team where he develops data products that most of you use every day. Prior to LinkedIn, Vitaly founded the data science team at LivePerson and worked in the elite 8200 unit (the Israeli equivalent of the NSA), leading a team of researchers in developing algorithms to fight terrorism. His contributions have been recognized through a number of awards including the “Life Source” award, an award given each year deemed most high-impact in saving lives. Vitaly holds a B.Sc in Computer Science and an MBA from the Israeli Institute of Technology.

Chris Severs works in the Search Science applied research group at eBay. Chris fell in love with Scala at first sight and has been one of the main drivers of Scala adoption at eBay. He has contributed to the Scalding and Scoobi open source projects and authored an addition to Scalding to provide support for Apache Avro. Prior to joining eBay he was a postdoctoral researcher at The Mathematical Sciences Research Institute in Berkeley and then at Reykjavík University in Iceland.

Join or login to comment.

  • Evan D.

    Hi all,

    We've finished editing the videos, from these two talks, and you can check them out at these links:


    Vitaly & Chris:

    Cheers everyone!

    August 9, 2013

  • Vitaly G.

    Hey All,
    Sorry it took a while, but here is the code for the talk.


    August 4, 2013

    • Joaquin D.

      Any slides?

      August 5, 2013

    • Vitaly G.

      There were no slides, just code. There will be a video at some point.

      August 5, 2013

  • Dragisa K.

    Good meetup. +1 for the code examples.

    July 24, 2013

  • Kenner S.

    Parking impossible. ;-(

    July 24, 2013

  • A former member
    A former member

    I won't be able to attend. Enjoy!

    July 24, 2013

  • Chester C.

    I am sick and can't come, so my spot will be given to one of the person on waiting list. is there streaming ?

    July 24, 2013

  • Erin M.

    will give up my seat as I should be able to make Scala Days :-)

    July 23, 2013

  • Christopher C.

    Cannot come. At least it will make two people happy in the waiting list happy. See you next time.

    July 23, 2013

  • Daniel C.

    How long will this meetup go for?

    July 22, 2013

    • A former member
      A former member

      Each talk is 1 hour, so ~6:30-9:30 with social/Q&A.

      July 23, 2013

  • Ramkumar R.

    Will you provide us any material or slide deck at the end of the presentation? Please clarify.

    1 · July 22, 2013

  • Abhishek K.

    Sorry cant make it. Will miss the event.

    July 22, 2013

  • A former member
    A former member

    Folks, please review your RSVPs carefully and adjust properly given the waiting list size. The auditorium is hard-capped at 100 by *walls* so we have to go by the waiting list FIFO, crashing it will not work out. Vitaly & Chris's talk will be repeated at Scala Symposium 8/2, where a few seats are still available (next on SF Scala).

    July 22, 2013

  • Ryan D.

    Due to an unforseen medican situation, I won't be able to make this ... I'm seriously bummed.

    July 21, 2013

  • A former member
    A former member

    Out of town. Next time

    July 15, 2013

  • A former member
    A former member

    Vitaly and Chris will give a version of their talk again at the Silicon Valley Scala Symposium (SVSS) --, and we'll have more data mining talks there. Check it out here at SF Scala if you can't make the Identified meetup. (SVSS is filling up fast too and we may cap it earlier.)

    July 11, 2013

  • A former member
    A former member

    Folks -- 100 is a hard capacity limit, so we'll ask the Yes guys to manage their RSVP responsibly, updating it if their plans change, and the waiting list folks will be propelled inside the tent automatically in a FIFO manner whenever someone inside cancels.

    3 · June 27, 2013

  • Tony H.

    Interesting topic!

    June 25, 2013

Our Sponsors

  • Scale By the Bay

    CFP now open for our yearly conference -- submit by May 31!

  • Twitter

    Awesome venue, food and drinks for our meetups!

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy