Analysing Enron’s emails
Details
When Enron collapsed, investigators acquired a database of 600,000 emails known as the Enron Corpus (https://en.wikipedia.org/wiki/Enron_Corpus), which have since entered the public domain. This month we'll be learning three different approaches to analysing such a huge dataset.
This session is being led by data scientists Luuk Derksen and Denise Xifara, who both work at tech education startup Decoded (https://decoded.com/en-gb/). They'll be showing how to analyse the Enron Corpus using MySQL, ElasticSearch and Neo4j.
Make sure to bring a laptop along (Mac or Linux machines are preferable, but Windows is ok too). In now-typical Journocoders style this will be a practical, hands-on workshop. No programming experience is required, but it would be useful to learn the basics of using the command line beforehand. Codecademy can help with that (https://www.codecademy.com/learn/learn-the-command-line).
Oh, and make sure to join the collaborative hackpad (https://journocoders.hackpad.com/Journocoders-May-2016-y95YnOpigdK) for the event!
Schedule:
7.00: Doors open
7.30: Show & tell
7.40: Tutorial
9.00: The pub!
