Skip to content

Productionizing data science at scale (Strata x Hadoop SG Community Event)

Photo of Kai Xin Thia
Hosted By
Kai Xin T. and 10 others
Productionizing data science at scale (Strata x Hadoop SG Community Event)

Details

http://photos1.meetupstatic.com/photos/event/7/1/9/0/600_444089072.jpeg

Coming December, we are proud to close the year with our biggest meetup yet - at Strata x Hadoop SG! Free event, open to all (no need for strata x hadoop tickets). For folks who wish to attend the full Strata x Hadoop conference, feel free to use our community's discount code UGDSSG for a 20% discount on the tickets (https://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/register). You can also join our facebook group (https://www.facebook.com/groups/dataScienceSG/) for data related geek discussions.

Agenda

• 5.30pm-545pm: Networking session.

• 545pm-6.00pm: Introduction to Druid by Fangjin Yang (Imply (http://imply.io/)).

• 6.00pm-6.15pm: OA Labs Deep Platform Capabilities by Scott Edington (OA Labs (http://oa-labs.com/)).

• 6.15pm-6.30pm: Lightning talk by Arshak Navruzyan (Argyle Data (https://www.argyledata.com/), Startup.ml (http://startup.ml/)).

• 6.30pm-6.45pm: Deep learning applications by Adam Gibson. (Skymind.io (http://www.skymind.io/), author of O'Reilly's Deep Learning book (http://shop.oreilly.com/product/0636920035343.do))

• 6.45pm - 8pm: Panel discussion - Productionizing data science at scale with Albert Bifet (Institut Mines-Télécom (http://www.mines-telecom.fr/)), Shirshanka Das (Linkedin (http://linkedin.com)), Wes McKinney (Cloudera (http://www.cloudera.com)) and Jennifer Marsman (Microsoft (http://www.Microsoft.com)).

• 8pm - 830pm: Networking session

Profile of speakers:

Panelist

• Albert Bifet (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/203040) is a big data scientist with 10+ years of international experience in research and in leading new open source software projects for business analytics, data mining, and machine learning (Institut Mines-Télécom (http://www.mines-telecom.fr/), Huawei, Yahoo, University of Waikato, UPC). He obtained a Ph.D. from UPC-BarcelonaTech. Albert has worked in Hong Kong, New Zealand, and Europe. At Yahoo Labs, he co-founded Apache SAMOA (Scalable Advanced Massive Online Analysis) in 2013. Apache SAMOAis a distributed streaming machine learning (ML) framework that contains a programing abstraction for distributed streaming ML algorithms. At the WEKA Machine Learning group, he has co-led MOA (Massive Online Analysis) since 2008.

• Shirshanka Das (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/113861) is the architect for LinkedIn’s Data Analytics Infrastructure team. He was among the original authors of a variety of open and closed source projects built at LinkedIn, including Databus, Espresso, and Apache Helix. His current focus at LinkedIn includes all things Hadoop, high-performance distributed OLAP engines, large-scale data ingestion, transformation and movement, and data lineage and discovery.

• Wes McKinney (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/193580) is a software engineer at Cloudera and lead developer of Ibis. He is the creator of Python’s pandas library and is the author of Python for Data Analysis. Previously, Wes was the founder and CEO of DataPad.

• Jennifer Marsman (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/211768) is a principal developer evangelist in Microsoft’s Developer and Platform Evangelism group, where she educates developers on Microsoft’s new technologies. In this role, Jennifer is a frequent speaker at software development conferences across the United States. In 2009, Jennifer was chosen as “Techie whose innovation will have the biggest impact” by X-OLOGY for her work with GiveCamps, a weekend-long event where developers code for charity. Prior to becoming a developer evangelist, Jennifer was a software developer in Microsoft’s Natural Interactive Services division. In this role, she earned two patents for her work in search and data mining algorithms.

Tech Startup Lightning Talks:

Fangjin Yang (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/153565) is a co-author of the open source Druid project and a co-founder of Imply, a data analytics startup based in San Francisco. Previously, he held senior engineering positions at Metamarkets and Cisco Systems. Fangjin holds a BASc in Electrical Engineering and a MASc in Computer Engineering from the University of Waterloo, Canada.

Arshak Navruzyan (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/117200) has served in product management and engineering leadership roles at Argyle Data, Alpine Data Labs, Endeca, and Oracle. He is a contributor to the Apache Accumulo project and the organizer of the San Francisco Machine Learning Meetup group. Arshak’s objective is to make machine learning accessible to any organization or individual that wants to transform the world through data. With this aim, he cofounded Startup.ML.

Adam Gibson (https://www.linkedin.com/profile/view?id=AAkAAAlybqcBzFg_BmMEpGhwLGTlfqiT5Px0NMI) is founder of Skymind.io, who provides deep learning as a service. Adam and his team are the proud creators of the world’s first open-source, distributed, commercial-grade deep-learning framework: Deeplearning4j.org (http://deeplearning4j.org/). They also built ND4J (http://nd4j.org/), a scientific computing library for the JVM. Adam is also the co-author of "Deep Learning - A Practitioner's Approach (http://shop.oreilly.com/product/0636920035343.do)" to be released on O'Reilly Media.

Dr. Scott Edington (http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/speaker/227845)provides executive leadership and oversight for OA's Consulting practice and OA Labs. Edington's career spans over two decades of creating next generation technology capabilities in the Payments, Defense, and Intelligence sectors. As the Global Head of Visa Labs & Innovation Enablement, Edington held global responsibility for Research & Development and directed Visa's Open Innovation initiatives frequently partnering with academia, research organizations, technology incubators, and the start-up community.

Photo of DataScience SG group
DataScience SG
See more events
Suntec Singapore Convention & Exhibition Centre Room 321 - 322 (Level 3)
1 Raffles Boulevard, Suntec City, Singapore 03959, Level 3 · Singapore