SolrCloud, Solr + Hadoop 2 & Nutch Integration


Details
[EDIT: Agenda Added]
Agenda:
- Meet Solr for the first time again - Varun Thacker, LucidWorks
Anyone who has tried integrating search in their application knows how good and powerful Solr is but always wished it was simpler to get started and simpler to take it to production. This session will talk about the recent features added to Solr making it easier for users and some of the changes we plan on adding soon to make the experience even better.
- SolrCloud Deep Dive - Saumitra Srivastav, Glassbeam
In this session, we will explore SolrCloud (https://cwiki.apache.org/confluence/display/solr/SolrCloud) in detail. We will start from basic and talk about the strategies used in production to partition your data to achieve high performance.
- Solr + Hadoop 2 - Saumitra Srivastav, Glassbeam
In this session we will see how can we run Solr on HDFS, what are the limitations, etc. We will explore the reasons why folks want Hadoop integration, even though SolrCloud provides fault tolerance, high availability, etc.
- Nutch Integration - Saumitra Srivastav, Glassbeam
Apache Nutch (http://nutch.apache.org/) is a popular tool for web crawling. In this session, we will see how can you index data crawled by Nutch directly in Solr.
Contact:
• Saumitra Srivastav (saumitra.srivastav@glassbeam.com) +91-8553-845-201
• Suraj Atreya (suraj.atreya@glassbeam.com) +91-9880-718-113

SolrCloud, Solr + Hadoop 2 & Nutch Integration