addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1linklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

R in Big-Data Environments

  • Feb 16, 2012 · 6:30 PM

Our next meeting will be held at Ebay on Thursday February 16. The featured speaker will be Nachum Shacham, Data Scientist at Ebay.

Once again, thank you O'Reilly for sponsoring ouy meeting!

R in Big-Data Environments

Modern massively parallel processing (MPP) platforms like Hadoop and RDBMS, hold Petabytes of structured and semi-structured records that are readily available for analysis. Running R on such platforms enables the analysis of much larger dataset than is possible through the traditional single-server, RAM-based operation.   We'll describe our experience in using R to process big data on Teradata Enterprise Data Warehouse and on a large Hadoop cluster. We'll review the challenges of running R in conjunction with these  platforms and describe methods for accomplishing this task. The teradataR package that will be described, enables a PC-based R to use the warehouse resource to run statistical functions, e.g., regression, correlation, and parametetric and nonparametric tests on the warehouse-resident tables, while communicating only the much reduced datasets containing the final results. We'll also review our experience in using R to process unstructured log data as well as tabular data on a 1000-node Hadoop cluster. Finally, we'll review a case study, implemented in R, of comparative cost-performance analysis Teradata warehouse and Hadoop.


Join or login to comment.

  • Matt D.

    Great, it would be very nice to have a copy of the presentation for future reference about Teradata.

    March 15, 2012

  • Joseph R.

    Congratulations to Danielle Zhu our BARUG raffle winner. Danielle won a pass to next week's Strata Conference, and thanks again to O'Reilly for sponsoring both the January and February meetings.

    February 21, 2012

  • A former member
    A former member

    Possibly more suited to a Teradata user conference.

    February 17, 2012

  • Robert B.

    Started out good, but then got bogged down in what was for me useless financial analysis of cost comparisons. Would like to have heard more about technical aproaches of R and Hadoop.

    February 17, 2012

  • Gary M.

    Fantastic presentation & nice facility - two more reasons to love EBay.

    February 17, 2012

  • Vlad S.

    Really liked the presentation!
    But:
    1. Was held like 5-10 mins at the lobby because by badge was missing.
    2. Pizza was finished already when I finally got in :).

    February 17, 2012

  • Chris S.

    Speaker and presentation were excellent!

    February 17, 2012

  • Sanjay G.

    Excellent talk on the infrastructure for R - comparing Hadoop and Teradata. Thank you.

    February 17, 2012

  • Stephen

    Last-minute carpool from Palo Alto/Mtn View (my RSVP just opened up)?

    February 16, 2012

  • Justin K.

    If anyone would like to carpool down the peninsula, I'm driving my car from san bruno at 5:30pm and can pick someone at e.g. the san bruno bart station. Let me know.

    February 16, 2012

  • Blake I.

    Thanks Joseph. I am definitely also interested in a webinar or slides in that case.

    February 15, 2012

  • Joseph R.

    Sorry no, not for this event. If you are not listed as going, it is unlikely that there will be room for you. We have already factored in a certain percentage of no shows, and we do not want to be in the position of exceeding room capacity and having to turn people away.

    February 15, 2012

  • Blake I.

    Is it possible to attend even if we are on the waiting list? There may be no-shows that open spots. Thanks.

    February 15, 2012

  • Chris S.

    Hi Mikhail,
    You should go to the main entrance by the flag pole (Community building). The room will be in that building but you likely need to check in first.

    February 15, 2012

  • Mikhail K.

    As far as I know, eBay campus on Hamilton avenue is rather big.
    Can you please give more specific location?

    February 15, 2012

  • A former member
    A former member

    I would like a webcast as well. I won't be able to leave work early enough to make it in time.

    February 15, 2012

  • Gautam S.

    Is there a possibility of a Webcast/recording? The waiting list is too long.

    February 15, 2012

  • A former member
    A former member

    Is there a list for car pooling from Berkeley/Oakland/San Francisco ??

    February 15, 2012

Our Sponsors

  • RStudio

    Financial support for meetings

  • O'Reilly

    Our March 29th BARUG meeting onsite at Strata Hadoop World

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy