Data Science Track Kick-Off Event (2 Talks: Szilard Pafka and Eduardo Arino)

We are starting a Data Science Track that will encompass both Machine Learning as used in business applications and the entire process of data mining (business understanding, data collection, exploratory data analysis, data transformations, feature engineering, modeling, model validation, deployment, communication of results).

A note from Szilard Pafka: Rather than starting a separate Data Science Meetup as initially intended (I also run the LA R and DataVis LA meetups) I joined as a co-organizer and I will be mainly responsible for this track.

This track is rooted in the Panel on Data Science events (2 sessions) at the LA R meetup ( http://www.meetup.com/LA-RUG/events/101484102/ ) and in several application oriented machine learning talks in the past at this (LA Machine Learning) meetup. In the former, we have discussed methods, tools and workflows for data analysis/modeling, skills and organizational issues for successful data science projects, and we catered to both data science/machine learning professionals and business executives interested in extracting value from data.

We'd like to expand on this and bring together machine learning professionals, data scientists, business analysts, data engineers, software developers, data hackers along with startup co-founders, tech, business and analytics executives and anyone interested in extracting knowledge and business value from data. There are several ways to achieve this, for example:

1. Companies doing advanced analytics (their data scientists) can present their craft (case studies). A motivation for companies to do so is to make themselves and their analytic sophistication known in order to attract new talent (recruit) or get feedback on their processes.

2. Data scientists/machine learning practitioners can present (talk) or debate (panel) best practices for extracting knowledge and business value from data (methods, algorithms, software tools, pitfalls, challenges, required skills, organization structures etc).

We are looking for speakers for such future events. If you can present "Data Science @XYZ Co." or you would like to give a talk about your experience, methods, tools or achievements in doing data science, please contact Szilard.

------------------------------------------------------------------

Kick-Off event for the Data Science Track:

For this first such meetup, we'll have two 30-minute talks:

1. Szilard Pafka: 10 Pitfalls in Data Science

In this talk I will discuss 10 common pitfalls in doing data analysis, predictive modeling and developing analytical systems in a business environment. Some of these issues are analytical, some are technical while some are business/organizational in nature, so the talk will cover a variety of topics (at various levels from higher-level to more technical) and it should be relevant to a wide range of people interested in data science (data scientists, tech professionals, business executives).

Bio: Szilard Pafka is the Chief Data Scientist at a credit card processor in Santa Monica and a leader of the LA data community. He combines a PhD and more than 15 years of practical experience in performing data analysis and developing analytical systems focused on achieving business goals. He is the founder and organizer of the LA R and DataVis LA meetups and became recently a co-organizer of the Machine Learning meetup. More detailed bio here: http://www.linkedin.com/in/szilard


2. Eduardo Arino de la Rubia: Bootstrapping a Data Science Practice at your Company and in your Career

Being a data scientist can require a PhD, an obsession with matrix notation, and a love of stochasticity. I have none of those. Fortunately, data science can also involve a love of tinkering, data munging, and providing glue between the components brilliant people have built for us. As long as you're hungry to learn and humble enough to keep asking questions, data science is within your grasp. I'll share my experiences that led to becoming my company's de-facto data scientist, how I transitioned to this new kind of hybrid role, and showcase the types of “low-hanging fruit” I was able to address - all while looking great doing it!

Bio: Eduardo Arino de la Rubia is a husband, father, and genuinely fortunate fellow. He started programming when he was 4 years old on a Sinclair ZX Spectrum, and has spent the last 31 years questioning that decision. An exposure to Genetic Algorithms and evolutionary computing taught him that sometimes an indirect approach has real benefits, and the first time he was exposed to a skiplist he realized that messy stochastic approaches often times outperform the best intentions. He has a BS in CS and recently completed General Assembly's Data Science Program.

Timeline:

- 6:15pm food/drinks and networking

- 7:00pm talks starts promptly

Please arrive by 6:55pm the latest.

Please RSVP as places are limited.

Venue: Cross Campus ( http://www.crosscamp.us/ ) will kindly host this meetup. There is no parking provided. Q ( http://www.qconnects.com/ ) will kindly sponsor/provide the food and drinks.

Join or login to comment.

  • Matti S

    Everyone here is invited to come to SCALE 12x, So Cal Linux Expo, Feb 21, 22,[masked] at the LAX Hilton. We've setup a 50% off code for everyone in the group: LAML ( it's a great deal, and the full pass includes most of the Friday events. )
    https://www.socallinuxexpo.org/scale12x

    2 · February 4, 2014

    • Matti S

      The class is using Redhat as a base, so you can pickup Centos and use that for the class. I would recommend making a VM on a fast system, or if an older system in stall it straight. http://www.centos.org...­

      February 19, 2014

    • Jeff W.

      Thanks Matti. I might check it out. There is a nice buffett of stuff to learn. You did a great job this year buddy.

      February 19, 2014

  • Yael K. M.

    Is there a recording? Are Eduardo's slides posted?

    3 · February 19, 2014

  • Andrew M.

    I really enjoyed both presentations and look forward to the next Data Science meetup session.

    February 19, 2014

  • - Szilard Pafka -

    Several people asked me about data leakage, this is the article you wanna read:
    http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.365.7769&rep=rep1&type=pdf

    2 · February 19, 2014

  • Christopher S.

    I can't believe I missed this. Sounds like it was great. I will definitely make the next one.

    February 19, 2014

  • Kien T.

    Two talks were great. I really hope there are more places at DataVis LA next Tuesday so that I can join. Thank you very much, organizers!

    February 19, 2014

    • Jeff W.

      As usual Szilard, smart, succint, valuable info. thanks for sharing. Couldn't make it to the meetup, wish there were video.

      February 19, 2014

  • A former member
    A former member

    Very motivational talks. GGplot is GG for all other plotting :)

    February 18, 2014

  • Adra G.

    Very informative: if anything, I wish each talk could have been longer! Cross Campus was a great space, and the food and beers were tasty and very much appreciated!

    1 · February 18, 2014

  • Daniel G.

    Data Science Track is a real winner

    2 · February 18, 2014

  • Tseng

    10 pitfalls of parking in Santa Monica...

    2 · February 18, 2014

  • Carl M.

    Informative and opens up the possibility for a new job aventure.

    February 18, 2014

  • Jeff W.

    Change of plans, I can't make it. Look forward to the slides and the off chance that somebody might video tape it.

    February 15, 2014

  • Scott P.

    I can no longer make this, but I'll be to the next one for sure.

    February 13, 2014

  • Paul S.

    Darn, will be at corp HQ during this. Next time!

    February 11, 2014

  • Luca F.

    Hi all!

    I'm running the newly created Data Science Meetup in Santa Barbara. For our kick-off meeting we'll have the pleasure to have Russell Jurney, author of Agile Data Science, come talk about rapidly prototyping analytics applications using the Hadoop stack.

    http://www.meetup.com/Santa-Barbara-Data-Science/events/163748742/

    The talk is happening on Monday, February 17, 2014 at 6.30pm, the day before this event, so if you feel like a pleasant drive along the coast you'd get to start your week with two great events in the Southern California data science scene!

    February 5, 2014

Our Sponsors

People in this
Meetup are also in:

Create a Meetup Group and meet new people

Get started Learn more
Henry

I decided to start Reno Motorcycle Riders Group because I wanted to be part of a group of people who enjoyed my passion... I was excited and nervous. Our group has grown by leaps and bounds. I never thought it would be this big.

Henry, started Reno Motorcycle Riders

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy