Gathering of people interested in search and discovery and the related fields -- information gathering, extraction, and retrieval, natural language processing, text analytics, data mining, sentiment detection, named entity recognition...
Some example technologies that would be nice to cover are Lucene, Solr, Nutch, Mahout, UIMA, GATE, OpenNLP, Hadoop, HBase, Katta, etc.
The plan is to have more or less regular meetups and share experiences involving any of the target topics. Initial meetups are likely going to be less structured, but with time, and if people are interested in that format, we can organize presentations.
Probabilistic Retrieval (PR) model uses probability to estimate the odds of relevance of a query to a document. PR helps us to estimate how terms contribute to relevance. In this talk we will discuss how to incorporate probabilistic term knowledge into the Boolean / Vector space retrieval environment to improve relevance.
Presented by Alex Lin, Sr. Architect of Intelligent Mining
Presentation: 30 minutes, followed by Q&A and discussion
Lucene, Solr, Nutch NYC Experts
What members are saying
“ Learning from others. ”
Settings for “Chapter Comments”