addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Science!


5:30-6:30pm Pizza and Networking
6:30-8:00pm Talk and Q&A
8:00-8:30pm Wind down 

As the need for data science as a key differentiator grows in all industries, from large corporations to startups, the need to get to results quickly is enabled by sharing ideas and methods in the community. The data science team at Pivotal leverages and contributes to this community of publicly available and open source technologies as part of their practice. We will share the resources we use by highlighting specific toolkits for building models (e.g. MADlib, R) and visualization (e.g. Gephi and Circos) along with their benefits and limitations by sharing examples from Pivotal's data science engagements. At the end of this session we hope to have answered the questions: Where can I get started with Data Science? Which toolkit is most appropriate for building a model with my dataset? How can I visualize my results to have the greatest impact?

Bio: Sarah Aerni is a member of the Pivotal Data Science team with a focus on healthcare and life science. She has a background in the field of Bioinformatics, developing tools to help biomedical researchers understand their data. She holds a B.S. In Biology with a specialization in Bioinformatics and minor in French Literature from UCSD, and an M.S. and Ph.D in Biomedical Informatics from Stanford University. During her time as a researcher she focused on the interface between machine learning and biology, building computational models enabling research for a broad range of fields in biomedicine. She also co-founded a start-up providing informatics services to researchers and small companies. At Pivotal she works with customers in life science and healthcare building models to derive insight and business value from their data.

Join or login to comment.

  • Tamao N.

    Here are Sarah's slides from the talk. Video coming soon!

    February 14, 2014

  • Tamao N.

    coming soon! :)

    January 30, 2014

  • Sarah

    Have used these packages and more! In fact, scipy is a requirement for many other packages that we use.

    Maybe we can connect at another time if you are unable to make the MeetUp. We should have some follow-on content we can share.

    January 21, 2014

  • Katherine

    Hi Sarah,

    Curious about your thoughts on the developing python tools for science:

    ...not trying to sell Python, just useful to find a person using other languages / frameworks to give context, as I'm trying to organize a student workshop on computing tools for scientists in April.

    Not sure I can attend on 1/28, but will try.


    January 10, 2014

Our Sponsors

  • eBay

    eBay sponsors venues, and food.

  • Pivotal

    Pivotal has sponsored our venues, food, beverages, snacks, and video.

  • Twitter

    Twitter sponsors food and conducts hands-on sessions.

  • Uber

    Uber has provided venues, snacks and video

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy