Apache Solr & MADlib (incubating): Enabling Massive Text Analytics In-Database


Details
90% of the world's unstructured data is text. Integrating Text into a data warehouse of structured data provides for an enhanced data warehousing and rich analytics experience.
Pivotal GPText enables processing mass quantities of raw text data (such as social media feeds or e-mail databases) into mission-critical information that guides business and project decisions. GPText joins the Greenplum Database massively parallel-processing database server with Apache SolrCloud enterprise search and the MADlib Analytics Library to provide large-scale analytics processing and business decision support. GPText includes free text search as well as support for text analysis.
Bharath Sitaraman, Product Manager for GPText, will cover:Full Text Search principles, Full Text Search in Greenplum, and a tour of the latest GPText Functionality.
Enjoy pizza and networking! Agenda for the evening:
-
6:00-6:30 Pizza and socializing - 6:30-7:30 GPText discussion & live Demo
-
7:30-8:00 Q&A and interactive session
Attend in person or online: https://www.youtube.com/watch?v=xUUGWl9iCGE

Apache Solr & MADlib (incubating): Enabling Massive Text Analytics In-Database