Data Mining with Scala at Identified

Scala is great for big data cleaning, gleaning, and modeling. We'll see how Identified uses it to get insights into people ranking and search.

 

Jan Prach is a Scala developer at Identified using Machine Learning and Data Mining to answer key question about people that make Identified stand out.  He will overview the overall data flow and focus on several points of interest along the way, and how Scala makes it all happen.

 

Data Science is perceived to be complicated, but with the help of Scala and Scalding most patterns can be simplified to a great bit. In this talk, we will show how many common patterns in data science can be redesigned in many cases to almost a single line!


Vitaly Gordon is a senior data scientist on the LinkedIn Product Data Science team where he develops data products that most of you use every day. Prior to LinkedIn, Vitaly founded the data science team at LivePerson and worked in the elite 8200 unit (the Israeli equivalent of the NSA), leading a team of researchers in developing algorithms to fight terrorism. His contributions have been recognized through a number of awards including the “Life Source” award, an award given each year deemed most high-impact in saving lives. Vitaly holds a B.Sc in Computer Science and an MBA from the Israeli Institute of Technology.

Chris Severs works in the Search Science applied research group at eBay. Chris fell in love with Scala at first sight and has been one of the main drivers of Scala adoption at eBay. He has contributed to the Scalding and Scoobi open source projects and authored an addition to Scalding to provide support for Apache Avro. Prior to joining eBay he was a postdoctoral researcher at The Mathematical Sciences Research Institute in Berkeley and then at Reykjavík University in Iceland.

Join or login to comment.

  • Aish F.

    Couldn't make it, but sounds really interesting. Don't suppose anyway has video and/or slides?

    July 25, 2013

  • Thomas C.

    Any word on if the video that was made there will be made available?

    August 2, 2013

  • Evan D.

    Hi all,

    We've finished editing the videos, from these two talks, and you can check them out at these links:

    Jan: http://mrkn.co/nqkoi

    Vitaly & Chris: http://mrkn.co/rgny6

    Cheers everyone!

    August 9, 2013

  • Vitaly G.

    Hey All,
    Sorry it took a while, but here is the code for the talk.
    https://github.com/vitalyg/SfScalaTalk

    Vitaly

    August 4, 2013

    • Joaquin D.

      Any slides?

      August 5, 2013

    • Vitaly G.

      There were no slides, just code. There will be a video at some point.

      August 5, 2013

  • Dragisa K.

    Good meetup. +1 for the code examples.

    July 24, 2013

  • Kenner S.

    Parking impossible. ;-(

    July 24, 2013

  • Marcio D.

    I won't be able to attend. Enjoy!

    July 24, 2013

  • Chester C.

    I am sick and can't come, so my spot will be given to one of the person on waiting list. is there streaming ?

    July 24, 2013

  • Erin

    will give up my seat as I should be able to make Scala Days :-)

    July 23, 2013

  • Christopher C.

    Cannot come. At least it will make two people happy in the waiting list happy. See you next time.

    July 23, 2013

  • Daniel C.

    How long will this meetup go for?

    July 22, 2013

    • Alexy K.

      Each talk is 1 hour, so ~6:30-9:30 with social/Q&A.

      July 23, 2013

  • Ramkumar R.

    Will you provide us any material or slide deck at the end of the presentation? Please clarify.

    1 · July 22, 2013

  • Abhishek K.

    Sorry cant make it. Will miss the event.

    July 22, 2013

  • Alexy K.

    Folks, please review your RSVPs carefully and adjust properly given the waiting list size. The auditorium is hard-capped at 100 by *walls* so we have to go by the waiting list FIFO, crashing it will not work out. Vitaly & Chris's talk will be repeated at Scala Symposium 8/2, where a few seats are still available (next on SF Scala).

    July 22, 2013

  • Ryan D.

    Due to an unforseen medican situation, I won't be able to make this ... I'm seriously bummed.

    July 21, 2013

  • Stoney V.

    Out of town. Next time

    July 15, 2013

  • Alexy K.

    Vitaly and Chris will give a version of their talk again at the Silicon Valley Scala Symposium (SVSS) -- funconf.org, and we'll have more data mining talks there. Check it out here at SF Scala if you can't make the Identified meetup. (SVSS is filling up fast too and we may cap it earlier.)

    July 11, 2013

  • Alexy K.

    Folks -- 100 is a hard capacity limit, so we'll ask the Yes guys to manage their RSVP responsibly, updating it if their plans change, and the waiting list folks will be propelled inside the tent automatically in a FIFO manner whenever someone inside cancels.

    3 · June 27, 2013

  • Roy R.

    Really looking forward to this! (We use Scala for our operational telemetry system)

    1 · June 25, 2013

  • Tony H.

    Interesting topic!

    June 25, 2013

Our Sponsors

People in this
Meetup are also in:

Sometimes the best Meetup Group is the one you start

Get started Learn more
Katie

I'm surprised by the level of growth I've seen since becoming an organizer, it's given me more confidence in my abilities.

Katie, started NYC ICO

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy