Skip to content

Big Data Bristol Evening Session (July 2018) **Updated**

U
Hosted By
user 2. and 3 others
Big Data Bristol Evening Session (July 2018) **Updated**

Details

Speakers: Luke Merrett and Tim van Wijk

'Million Song Dataset – SQL Server vs Apache Hadoop Challenge'

In the red corner we have Luke; loading & querying a million songs using a traditional ETL approach with SQL Server.

Facing him in the blue corner is Tim; performing the same process but in a distributed manner using Apache Hadoop & Hive.

This talk will explore the advantages and drawbacks of these approaches, both for importing data and when querying it, to see how each technology performs against a real data set.

For more info on the Million Song Data Set please visit this site: https://labrosa.ee.columbia.edu/millionsong/

Bios:

Luke has 7 years’ experience working with everything from SQL Server to RavenDB, Azure Table Storage to AWS DynamoDB & Neo4J to Redis. He has seen graph databases perform magic, caching services slowing down servers & SQL stretched to the edge of sanity.

Currently a Senior Engineer at Just Eat & the co-organiser of both the SQL Bristol & F# |> Bristol meetups. For more information see: http://lukemerrett.com/

Tim has over 15 years’ experience working with high performance NoSQL data systems such as RRDTool, Redis and Riak. He uses C++ and C# to arrange data effectively and efficiently.

He is currently a Senior Engineer at Just Eat and co-organiser of the Big Data Bristol Meetup. He entertains himself with DIY and computer games.

Photo of Big Data Bristol group
Big Data Bristol
See more events
JUST EAT
2nd Floor Broad Quay House, Prince Street, BS1 4DJ · Bristol