addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Big Data Gods Message Board › Next Week: Paco Nathan's Hand-on Introduction to Data Science

Next Week: Paco Nathan's Hand-on Introduction to Data Science

Lynn B.
San Francisco, CA
Post #: 2
Fellow Data Geeks,

Paco Nathan will be teaching his Hands-on Intro to Data Science in Santa Clara Next Tuesday the 16th.
Details and RSVP at:


This class has sold out in every city that it has been offered, and because of Paco's travel schedule, this may be the only time that the class will be available in the Bay Area this year.

Course Description

Big Data, Data Science, Cloud Computing... Lots of exciting stuff, lots of media buzz, lots of confusing descriptions. For a programmer armed with a laptop and some knowledge of Bash, Python, Java – what is a good way to begin working with these new tools for handling large-scale unstructured data?
In addition to examining “How” things work, we will take a detailed look at “Why” did MapReduce emerge this way – what factors lead to the popular frameworks and what typical issues confront large-scale deployments – so that each student is prepared to make ongoing assessments and learnings as the field continues to grow and evolve.

* data science history, with video clips from primary sources
* survey of Big Data frameworks (gentle intro to using CAP theorem to categorize)
* intro RStudio, simple data visualization in R
* Hadoop streaming in Python
* Cascading intro
* (for those advanced) explore a little Cascalog or Scalding).

Speaker Bio:
Paco Nathan @pacoid is currently the Director of Data Science at Concurrent in SF, and a committer on the Cascading open source project. For over ten years, A 25 year veteran of the tech industry, for the last ten years Paco has led Data teams. Paco has a background in math/stats and distributed computing, and expertise in Hadoop, R, AWS, predictive analytics, machine learning, and NLP. Paco is author of the upcoming O'Reilly book: Enterprise Data Workflows with Cascading.Paco's Wikipedia Page Paco on Twitter, Linkedin, Slideshare, Github

Details and RSVP at:


-Lynn Bender
Powered by mvnForum

Our Sponsors

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy