Skip to content

Big Data, AWS & the Data Pipeline. Distributed MPP & Analytics with HPCC

Photo of John Mulhall
Hosted By
John M. and Uli B.
Big Data, AWS & the Data Pipeline.  Distributed MPP & Analytics with HPCC

Details

HUG Ireland is pleased to announce our March event at Bank of Ireland (@boistartups), Grand Canal Square, Dublin 2. The event is two fold around the data pipeline in a use case exploration along with an innovative approach to visual big data platforms via an open source mpp platform called HPCC. The full agenda is as follows:

Big Data, AWS and the Data Pipeline by Martin Peters (https://www.linkedin.com/in/martinbpeters?authType=NAME_SEARCH&authToken=gsmT&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A30184555%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1457348295199%2Ctas%3AMartin%20Peters), BI Manager with DoneDeal (https://www.donedeal.ie/) and Nigel Creighton (https://www.linkedin.com/in/nigelcreighton?authType=NAME_SEARCH&authToken=pIKH&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A13905280%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1456839653222%2Ctas%3A%20Nigel%20Creighton), CTO with DNM (http://www.dnmgroup.com/)

In the session, Richard will cover what Big Data means in AWS featuring a substantial e-Commerce customer. He shall explore how they have used big data technologies for machine learning and predictive analytics to work out items such as time to sell and value. Nigel shall also explore the underlying use case architecture and the data pipeline in the following areas: - Redshift as a data warehouse: - ingesting data streams using Kinesis: - Auto-scaling resources to respond to peaks and troughs of data streaming event volumes: -Datalakes using S3: -EMR and data pipeline for data processing without persistent server resources: - Map Reduce using EMR for data transformation, and aggregation: - loading data to Redshift: - Spice and QuickSight.

Distributed Computing, MPP and Analysis with HPCC Systems by Ignacio Calvo (https://www.linkedin.com/in/ignaciocalvofernandez?authType=NAME_SEARCH&authToken=SDYn&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A32068724%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1455191918156%2Ctas%3Aignacio), Senior Software Engineer with LexisNexis Risk Solutions (https://www.lexisnexis.com/risk/uk/)

LexisNexis HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for Big Data processing and analytics. Ignacio will cover the following on HPCC Systems: Architecture, Use cases, Integration and comparison with other systems.

We hope you can join us on the event with Bank of Ireland @boistartups kindly sponsoring our March event. As always, the event hashtag is #HUGIreland which is a great way to connect with your fellow (technical) big data peers before, through and after the event.

Photo of Data Engineering and Data Architecture Group (DEDAG) group
Data Engineering and Data Architecture Group (DEDAG)
See more events