Big Data Bristol Evening Session (July 2018) **Updated**

Details
Speakers: Luke Merrett and Tim van Wijk
'Million Song Dataset – SQL Server vs Apache Hadoop Challenge'
In the red corner we have Luke; loading & querying a million songs using a traditional ETL approach with SQL Server.
Facing him in the blue corner is Tim; performing the same process but in a distributed manner using Apache Hadoop & Hive.
This talk will explore the advantages and drawbacks of these approaches, both for importing data and when querying it, to see how each technology performs against a real data set.
For more info on the Million Song Data Set please visit this site: https://labrosa.ee.columbia.edu/millionsong/
Bios:
Luke has 7 years’ experience working with everything from SQL Server to RavenDB, Azure Table Storage to AWS DynamoDB & Neo4J to Redis. He has seen graph databases perform magic, caching services slowing down servers & SQL stretched to the edge of sanity.
Currently a Senior Engineer at Just Eat & the co-organiser of both the SQL Bristol & F# |> Bristol meetups. For more information see: http://lukemerrett.com/
Tim has over 15 years’ experience working with high performance NoSQL data systems such as RRDTool, Redis and Riak. He uses C++ and C# to arrange data effectively and efficiently.
He is currently a Senior Engineer at Just Eat and co-organiser of the Big Data Bristol Meetup. He entertains himself with DIY and computer games.

Big Data Bristol Evening Session (July 2018) **Updated**