Skip to content

Hadoop Summit Night

Photo of Uli Bethke
Hosted By
Uli B.
Hadoop Summit Night

Details

HUG Ireland is pleased to announce our Hadoop Summit Night event on 12 April at Mercantile Hotel, 28 Dame Street. We have some high calibre speakers from the US and France. This is your chance to meet some of the architects and evangelists of some of the Apache tools you are working with. The full agenda is a follows:

AGENDA

6:30 - 6:45 - Socialise & Network over beer(s)

6:45 - 10:30 - Talks with break (talks will be 30 minutes)

TALKS

Talk #1: Angling For Insights at Big Fish Games, David Darden (https://www.linkedin.com/in/daviddarden?authType=NAME_SEARCH&authToken=1pIY&locale=en_US&srchid=2114422741459245272924&srchindex=1&srchtotal=56&trk=vsrp_people_res_name&trkInfo=VSRPsearchId%3A2114422741459245272924%2CVSRPtargetId%3A17435917%2CVSRPcmpt%3Aprimary%2CVSRPnm%3Atrue%2CauthType%3ANAME_SEARCH), Director of BI Engineering at Big Fish Games (http://www.bigfishgames.com/?channel=sem&identifier=google_enw_s&v1=82174609519&v2=big%20fish%20games&v3=_e&v4=339843079&v5=21848664319&v6=_g&v7=&v8=_c&gclid=Cj0KEQjwz-i3BRDtn53Z5Z7t4PUBEiQA23q2AO_8ywGyrB9aZzB4ZPy4pT_OS9EAJSAYsg-7o88saXIaAvgl8P8HAQ)

Talk #2: Dataiku Data Science Studio for faster and smarter car parking, Vincent de Stoecklin (https://www.linkedin.com/in/vincentdestoecklin?authType=NAME_SEARCH&authToken=n5As&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A75956606%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245357984%2Ctas%3AVincent%20de%20Stoecklin) at Dataiku (http://www.dataiku.com/)

Talk #3: Apache Arrow: A New Era of Columnar In-Memory Analytics, Tomer Shiran (https://www.linkedin.com/in/tshiran?authType=NAME_SEARCH&authToken=6w1y&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A4261256%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245454399%2Ctas%3ATomer%20Shiran), Co-Founder and CEO at Dremio (http://www.dremio.com/)

Talk #4: Apache Flink 1.0: Real-World Use Cases, Slim Baltagi (https://www.linkedin.com/in/slimbaltagi?authType=NAME_SEARCH&authToken=9FTi&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A21979395%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245532636%2Ctas%3ASlim%20Baltagi), Director Big Data Engineering at Capital One (https://www.capitalone.com/)

Talk #5: Drilling into Data with Apache Drill, Tug Grall (https://www.linkedin.com/in/tugdualgrall), Technical Evangelist at MapR (https://www.mapr.com/)

ABSTRACTS

Talk #1: Angling For Insights at Big Fish Games. David Darden (https://www.linkedin.com/in/daviddarden?authType=NAME_SEARCH&authToken=1pIY&locale=en_US&srchid=2114422741459245272924&srchindex=1&srchtotal=56&trk=vsrp_people_res_name&trkInfo=VSRPsearchId%3A2114422741459245272924%2CVSRPtargetId%3A17435917%2CVSRPcmpt%3Aprimary%2CVSRPnm%3Atrue%2CauthType%3ANAME_SEARCH), Director of BI Engineering, Big Fish Games

Data is eating the world… but what does that mean for a company that is data driven, can't afford an army of developers and PhD's, and where (your) data isn't actually the product? We'll talk about how we've used data to help make our company more successful (and maybe some missteps along the way). We'll start with an overview of the Big Fish Games approach to building a Logical Data Warehouse (which includes Netezza, Hadoop, Tableau, and various other technologies), organizational approach, and how we use data and technology to make an impact on the business. We'll then move in to a 'choose your own adventure' style presentation where we'll dive in to areas the audience is interested in (aww… you were probably hoping for a bunch of slides, weren't you?). Topics may include what we use for query federation, the different types of data we capture, how we approach 'BI as a service' internally, integrating data across multiple platforms, and our approach to democratizing data across the organization. Feel free to post anything you'd like to see as a topic in the comments section.

Talk #2: Dataiku Data Science Studio for faster and smarter car parking, Vincent de Stoecklin (https://www.linkedin.com/in/vincentdestoecklin?authType=NAME_SEARCH&authToken=n5As&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A75956606%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245357984%2Ctas%3AVincent%20de%20Stoecklin) at Dataiku

Dataiku will present a client use case detailing all the steps from raw data to innovative data product. By working with DSS, Parkeon (leader of parking payment solutions) developed Path to Park, combining internal data assets with external data sources to create a predictive application that helps city-dwellers find parking spots faster.

  • initial challenge and existing data assets

  • end-to-end data science project - from design to production

  • tools, people and processes - moving toward a data driven organisation

Talk #3: Apache Arrow: A New Era of Columnar In-Memory Analytics, Tomer Shiran (https://www.linkedin.com/in/tshiran?authType=NAME_SEARCH&authToken=6w1y&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A4261256%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245454399%2Ctas%3ATomer%20Shiran), Co-Founder and CEO at Dremio

Apache Arrow introduces a new era of columnar in-memory analytics. Arrow can accelerate Big Data analysis by 10-100x, while also enabling users to combine multiple systems and programming languages without the traditional serialization/deserialization overhead. Over a dozen open source projects are already on board, including Drill, Ibis, Impala, Kudu, Pandas, Parquet and Spark. In this talk we'll provide an overview of Apache Arrow, including both the technology and several example applications.

Talk #4: Apache Flink 1.0: Real-World Use Cases, Slim Baltagi, Director Big Data Engineering at Capital One

This talk explains how Apache Flink 1.0 announced on March 8th, 2016 by the Apache Software Foundation, marks a new era of Big Data analytics and in particular Real-Time streaming analytics. The talk will map Flink's capabilities to real-world use cases that span multiples verticals such as: Financial Services, Healthcare, Advertisement, Oil and Gas, Retail and Telecommunications.

In this talk, you learn more about:

  1. What is Apache Flink Stack?

  2. The movement from Batch Analytics to Streaming Analytics

  3. Key Differentiators of Apache Flink for Streaming Analytics

  4. Real-World Use Cases with Flink for Streaming Analytics

  5. Who is using Flink?

  6. Where do you go from here?

Talk #5

Drilling into Data with Apache Drill, Tug Grall (https://www.linkedin.com/in/tugdualgrall), Technical Evangelist at MapR

Apache Drill is a next-generation SQL engine for Hadoop and NoSQL. Its unique schema-free approach enables self-service data exploration with the agility that organizations need in this new era of rapidly growing and evolving data.

In this talk, based on demonstrations, you will understand the key features and architecture of Apache Drill. You will also see how to get started with Drill; and start query, using SQL, various data sources such as Hive, Parquet, Avro, and NoSQL (HBase, MapR-DB, Mongo and more) but also more complex data structure stored in JSON documents.

You will learn also how you can extend Drill to create user defined function that brings your SQL on Everything to the next level.

SPEAKER BIOS

Speaker No.1: David Darden (https://www.linkedin.com/in/daviddarden?authType=NAME_SEARCH&authToken=1pIY&locale=en_US&srchid=2114422741459245272924&srchindex=1&srchtotal=56&trk=vsrp_people_res_name&trkInfo=VSRPsearchId%3A2114422741459245272924%2CVSRPtargetId%3A17435917%2CVSRPcmpt%3Aprimary%2CVSRPnm%3Atrue%2CauthType%3ANAME_SEARCH), Director of BI Engineering at Big Fish Games:

David has 10 years+ of Technology experience with companies like CTS, Mariner and Microsoft before joining Big Fish Games in 2010 as the BI Manger before becoming the BI Engineering Director in 2015.

Speaker No.2: Vincent de Stoecklin (https://www.linkedin.com/in/vincentdestoecklin?authType=NAME_SEARCH&authToken=n5As&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A75956606%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245357984%2Ctas%3AVincent%20de%20Stoecklin), Technical Partnership and Business Development Specialist at Dataiku:

Vincent is a Big Data Speaker and Business Development expert in big data for some years now. His prior telecoms experience has made his transition into Big Data an effective one where he now speaks with passion about Big Data along with developing Dataiku's business profile in the marketplace.

Speaker No.3: Tomer Shiran (https://www.linkedin.com/in/tshiran?authType=NAME_SEARCH&authToken=6w1y&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A4261256%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245454399%2Ctas%3ATomer%20Shiran), Co-Founder and CEO at Dremio:

Tomer is a highly successful tech entrepreneur and big data professional with over 10+ years in the big data industry and was also a Technical Researcher with HP Labs/Carnagie Mellon University and a Software Developer with Microsoft.

Speaker No.4: Slim Baltagi (https://www.linkedin.com/in/slimbaltagi?authType=NAME_SEARCH&authToken=9FTi&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A21979395%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1459245532636%2Ctas%3ASlim%20Baltagi), Director Big Data Engineering at Capital One:
Slim is currently a director of Big Data engineering at Capital One in Chicago with over 5 years of Big Data experience working on over a dozen Big Data projects. He is internationally recognized as a Thought Leader in the Big Data space and known for public presentations and lectures at conferences and universities. He is evangelizer of Apache Flink by running the New York City, Chicago, Washington DC, Dallas/Fort Worth and Seattle Apache Flink meetups and co-organizing the Paris, Madrid, Sao Paulo and Boston Flink meetups. He is authoring the book ‘Flink in Action’ to be published by Manning and will give at talk at the Hadoop Summit in Dublin on April 13, 2-016 titled 'Overview of Apache Flink: the 4G of Big Data Analytics Frameworks’ http://hadoopsummit.org/dublin/agenda/

Speaker No.5: Tug Grall (https://www.linkedin.com/in/tugdualgrall), Technical Evangelist at MapR:

Tug is a Technical Evangelist and an accomplished Big Data Professional with over 15 years of experience in the technology sector including companies like Couchbase, Oracle and MongoDB.

Photo of Data Engineering and Data Architecture Group (DEDAG) group
Data Engineering and Data Architecture Group (DEDAG)
See more events