addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupsimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1outlookpersonJoin Group on CardStartprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

Data Science at X

For the next meetup, I'd like to have a few people presenting what data science looks like at their company/lab. The format could be to have short (5-min) presentations about:

- what data is being collected and how the data pipeline looks like

- what questions are being asked about the data

- how insights found in the data are reported.

followed by a panel on the topic. I'd like the focus to be on *tools* utilized.

We don't  have a venue  (ideas welcome!!) nor a set time yet (the 14/4 is just a proposal,  any of the following days in the neighborhood of mid-April would work as well).

To alleviate the angst of waiting for mid-April, you can check out the fine data-related events put together by the R Users Group. The next one, happening on Thursday April 03 at 6:00pm, will be about "Productivity Tips and Tricks in R."

Join or login to comment.

  • John M.


    May 27, 2014

  • Nam L.

    Here's the link to download my PPT deck "Analytics at"

    May 26, 2014

  • John M.

    You can download my talk on Compound Library Design from

    April 22, 2014

  • krmartin

    Thanks to all presenters and to Luca for organizing.

    April 17, 2014

  • John M.

    Good talks (I will endeavor to shorten my lightening talks in the future :*)

    April 16, 2014

  • Ben B.

    Thanks Luca and all for organizing and presenting! Here's my talk from tonight:

    data science for the ocean health index

    1. ocean health index
    2. data flow
    3. data wrangling
    4. distributed development

    PS Here's the raw R presentation in markdown using RStudio

    1 · April 15, 2014

  • Luca F.

    Ok, so, the 15th it is. I also moved the time to 6:30 to accommodate the needs of people coming from out of town. Right now we have:

    - Nam Le - FindTheBest (hint: it's on the next line)
    - Ben Best - (NCEAS, Duke)
    - Frank Davenport (Climate Hazards Group UCSB)
    - Jon Cotter - Sonos
    - John MacCuish - Mesa Analytics
    - Luca Foschini - The Activity Exchange

    Let me know if I missed anybody. I hope It'll be a lively discussion!

    I encourage to make every deliverable (presentations, toy examples, analysis) shared and reproducible, thanks!


    April 11, 2014

  • Ben B.

    Yes, 15th works for me and happy to contribute however needed.

    April 11, 2014

  • Nam L.

    15th works for me as well. I can contribute for food cost.

    April 10, 2014

  • Frank

    15th works for me. I can send contribute for food as well.

    April 10, 2014

  • John M.

    I can still chip in :*) 15th is ok by me... I got my taxes in :*)

    April 10, 2014

  • John M.

    Any luck on a venue yet?

    April 7, 2014

    • Luca F.

      Thanks guys! So we might have found a venue (thanks Garrett for the tip!)

      The upstairs party room at the downtown Rusty's on Carrillo doesn't cost anything for a 3h period, but requires a total food purchase for at least $50 (regardless of the number of people joining).
      That sounds like a good deal to me, since I would have provided some pizza anyway.

      Unfortunately, the room is booked for the 14th, but free on the 15th from 6:30pm to 9:30pm. Would that be okay if we move the date to Tuesday April 15th?

      April 10, 2014

    • Garrett G.

      The 15th works for me. I can't remember what the walls in that room look like, but they might not be suitable to project on. We should check to see if we need to bring a screen.

      April 10, 2014

  • Luca F.


    No luck about the venue yet. (Please, send some love!)

    Also, it feels like many are on the fence of presenting but have gotten cold feet: It's okay if your data pipeline is not quite there.
    We're all figuring it out as we go.
    It's also not necessary to expose the *nature* of the data you're handling if that's confidential: knowing a few descriptive features (size, refresh rate, format) is enough to steer the discussion about tools and possible architectural choices.

    So fare I have penciled in: - Nam Le - FindTheBest
    - Jon Cotter - Sonos
    - John MacCuish - Mesa Analytics
    - Luca Foschini - The Activity Exchange

    Just a few more people and it should be a lively panel!


    March 27, 2014

  • John M.

    Happy to show the pipeline, the tools, highlight some code (tr, sed, grep -v :*) kidding), and the science.
    Any luck on the venue?

    March 25, 2014

  • Luca F.

    Hi John,

    It would be great to hear more about it. How complex data such as the one representing drug compound is gathered, indexed, and stored is definitely a fascinating topic on which I have little clue about!
    As usual, worked examples are gold: a few lines of code are usually the best explanation one can give.

    1 · March 17, 2014

  • John M.

    Anyone want to know about the data science of diversity-based, and target-based drug compound library design at our company?

    3 · March 17, 2014

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy