Past Meetup

Big Data, AWS & the Data Pipeline. Distributed MPP & Analytics with HPCC

This Meetup is past

132 people went

Bank of Ireland

1 Grand Canal Square · Dublin 2

How to find us

The BOI venue is beside the Bord Gais Energy theatre in Grand Canal Square just off Pearse St by the bridge. The No1, 56a and 77a bus will take you there as will the dart to Grand Canal Docks (left down Barrow St, left up Pearse St to bridge)

Location image of event venue

Details

HUG Ireland is pleased to announce our March event at Bank of Ireland (@boistartups), Grand Canal Square, Dublin 2. The event is two fold around the data pipeline in a use case exploration along with an innovative approach to visual big data platforms via an open source mpp platform called HPCC. The full agenda is as follows:

Big Data, AWS and the Data Pipeline by Martin Peters (https://www.linkedin.com/in/martinbpeters?authType=NAME_SEARCH&authToken=gsmT&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A30184555%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1457348295199%2Ctas%3AMartin%20Peters), BI Manager with DoneDeal (https://www.donedeal.ie/) and Nigel Creighton (https://www.linkedin.com/in/nigelcreighton?authType=NAME_SEARCH&authToken=pIKH&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A13905280%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1456839653222%2Ctas%3A%20Nigel%20Creighton), CTO with DNM (http://www.dnmgroup.com/)

In the session, Richard will cover what Big Data means in AWS featuring a substantial e-Commerce customer. He shall explore how they have used big data technologies for machine learning and predictive analytics to work out items such as time to sell and value. Nigel shall also explore the underlying use case architecture and the data pipeline in the following areas: - Redshift as a data warehouse: - ingesting data streams using Kinesis: - Auto-scaling resources to respond to peaks and troughs of data streaming event volumes: -Datalakes using S3: -EMR and data pipeline for data processing without persistent server resources: - Map Reduce using EMR for data transformation, and aggregation: - loading data to Redshift: - Spice and QuickSight.

Distributed Computing, MPP and Analysis with HPCC Systems by Ignacio Calvo (https://www.linkedin.com/in/ignaciocalvofernandez?authType=NAME_SEARCH&authToken=SDYn&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A32068724%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1455191918156%2Ctas%3Aignacio), Senior Software Engineer with LexisNexis Risk Solutions (https://www.lexisnexis.com/risk/uk/)

LexisNexis HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for Big Data processing and analytics. Ignacio will cover the following on HPCC Systems: Architecture, Use cases, Integration and comparison with other systems.

We hope you can join us on the event with Bank of Ireland @boistartups kindly sponsoring our March event. As always, the event hashtag is #HUGIreland which is a great way to connect with your fellow (technical) big data peers before, through and after the event.