Zum Inhalt springen

#10: Multi-Structured Content in Data Science

Foto von Trent McConaghy
Hosted By
Trent M. und Daniel N.
#10: Multi-Structured Content in Data Science

Details

Note this is Wed June 3 not Mon June 1.

This is a special event in collaboration with LucidWorks & Retresco. Here's the schedule:

  1. Food & drink (20 min). LucidWorks & Retresco is supplying this. (Thanks kindly!)

  2. Talk: Solr for Data Science (Grant Ingersoll, 45 min + 10min Q&A)

Abstract: Search engine technology is rapidly evolving from keyword based look-ups to a highly sophisticated ranking engine capable of incorporating many different features across complex data types. With the release of Solr 5, it is now possible to ask more interesting questions of multi-structured content than ever before. In this talk, we will explore a number of new and interesting features, ranging from incredibly easy data ingest to advanced faceting and statistical capabilities, to make data insight easier than ever.

Bio: Grant Ingersoll is the CTO and co-founder of Lucidworks as well as an active member of the Lucene community – a Lucene and Solr committer, co-founder of the Apache Mahout machine learning project, and a long standing member of the Apache Software Foundation. Grant is also the co-author of “Taming Text” from Manning Publications.

  1. Talk: PCRF - A small and efficient C++ library for supervised learning (Thomas Hanneforth, 45 min + 10min Q&A)

Abstract: We report on my recent efforts with supervised ML: a C++ library based on an optimised implementation of the averaged perceptron algorithm [Collins 2002]. As a use case, we report on a named entity recognition task based on a annotated newspaper corpus provided by Retresco GmbH, Berlin.

Bio: Thomas Hanneforth is a senior lecturer at the Potsdam University. His main topics are parsing, weighted automata, and machine learning.

Photo of Berlin Machine Learning Group group
Berlin Machine Learning Group
Mehr Events anzeigen
Retresco GmbH
Grunberger Strasse 44a, 10245 Berlin · Berlin