Paul Codding from Hortonworks will be with us to talk about all things Hadoop.
The talk and demo will walk through how to apply the different components in HDP to deal with streaming web log data. The outline of what we’ll see is stock Apache access log data being ingested into Kafka. Storm to parse and route that data to HDFS for raw storage, HBase for low-latency retrieval, and Solr for search and visualization. We’ll take a look at how we can apply simple rules in real-time to the logs, and how we can visualize the results using a custom web UI as well as Banana for OOTB dash-boarding of events.