Please join us for an evening of talking about big open data!
There will be two excellent presentations describing projects done with Common Crawl data. As always, there will be lots of smart, interesting people in attendance and ample opportunity to talk with them. After the presentations (at 8pm) we will adjourn to a nearby bar to continue the conversations.
Wednesday July 23rd
22 Battery Street, San Francisco
Speaker: Stephen Merity
Have you ever been curious as to how widely Google Analytics is used across the web? Stop pondering, start coding! Stephen will discuss how he used the Common Crawl dataset to perform wide scale analysis over billions of web pages and what this means for privacy on the web at large.
Speaker: Oskar Singer
Performing text analytics and NLP on Twitter data can be a challenge because of the frequent disregard for standard spelling and semantic difference between homophones (e.g. two, too, to). In this presentation Oskar will discuss his experience addressing this challenge and the creative solution that he developed with his colleague at Lexalytics.
Thanks to RiskIQ for hosting the event! RiskIQ is a super cool security company - learn more about them here: http://www.riskiq.com/company/about-us
Thanks to O'Reilly for sponsoring the meeting! They will be providing delicious food, awesome O'Reilly books for a raffle, and have generously provided a discount code for the Strata conference. Strata is *the* conference for big data and you should definitely attend. Click here for more information and a discount code: http://oreil.ly/UGSHW14