What we're about

Information retrieval is everywhere. We organize talks around implementations of information retrieval, in search engines, in recommender systems, or in search-oriented conversational agents.

Search Engines Amsterdam Meetups are usually held on the last Friday of the month, at SPUI25 or at Science Park Amsterdam. Two talks in a row, one industrial, the other academic, 25+5 minutes each. No marketing. Just algorithms. Followed by drinks.

Companies and organizations that presented or will present at SEA:

• 904Labs • Amsterdam Data Science • Beeld en Geluid • Blendle • Bol.com • Collaborne • Criteo • CWI • Elasticsearch • Elsevier • Fredhopper • FuelUp • GoDataDriven • Google • Improve Digital • KBResearch • Marktplaats • Microsoft AI • MyDataFactory • Philips • Qualcomm • SDL • Sanoma • Seecr • SourcingThing • Spinque • Technical University Eindhoven • Textkernel • Trivago • University of Amsterdam • University of Glasgow • University of North Carolina at Chapel Hill • UserSat.com • WizeNoze • Yahoo! • Yandex • ZyLab

Upcoming events (5+)

Textual Definition Learning and Unified Semantic Parsing

This Friday we'll have two talks followed by drinks. 16:00 Georgios Tsatsaronis (Elsevier) Topic Pages: From Articles to Answers 16:30 Priyanka Agrawal (Booking.com) Unified Semantic Parsing with Weak Supervision ======================== 16:00 Georgios Tsatsaronis (Elsevier) Topic Pages: From Articles to Answers Automating the process of learning definitions from unstructured text at scale enables applications with great impact, such as building glossaries, dictionaries, or topic pages that may profile scientific concepts and help readers of scientific articles understand the contents faster and in depth. In this talk we are introducing Topic Pages, a publicly available set of automatically created information pages for scientific concepts across 21 domains. We are discussing the technical challenges pertaining to extracting the relevant information from tens of millions of book chapters and scientific articles, as well as the novel methodologies and architecture that were used, sitting at the borders of Machine Learning, Natural Language Processing and Scalable Data Processing and Management. The focus will be given on the best technical practices utilized to create this large scale machine learning production pipeline, as well as on the novel methodology used to learn textual definitions from unstructured text, based on Multiview LSTMs. Bio: Dr. George Tsatsaronis is Vice President Data Science, Research Content Operations, at Elsevier (RELX Group). Prior to joining Elsevier in 2016 he worked in academia for 13 years, doing research and teaching in the fields of machine learning, natural language processing and bioinformatics in universities in UK, Greece, Norway and Germany. He has published more than 60 scientific articles in high impact peer review journals and conference proceedings in various areas of Artificial Intelligence, primarily natural language processing and text mining. His PhD is in the field of text mining, and he also holds a BSc in Informatics from Athens University of Economics and Business, and an MSc in Advanced Computing from Imperial College London, with specialization in Artificial Intelligence and robotics. He is the inventor of several Artificial Intelligence pipelines that support some of the biggest research platforms of Elsevier. ======================== 16:30 Priyanka Agrawal (Booking.com) Unified Semantic Parsing with Weak Supervision Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains. However, the fundamental challenge lies in obtaining high-quality annotations of (utterance, program) pairs across various domains needed for training such models. To address this problem, this talk discusses a novel framework to build a unified multi-domain enabled semantic parser trained only with weak supervision (denotations). Weakly supervised training is particularly arduous as the program search space grows exponentially in a multi-domain setting. To solve this, we incorporate a multi-policy distillation mechanism in which we first train domain-specific semantic parsers (teachers) using weak supervision, followed by training a single unified parser (student) from these domain specific teacher policies. The resultant semantic parser is not only compact but also generalizes better, and generates more accurate programs. It further does not require the user to provide a domain label while querying. Our experiments demonstrate that the proposed model significantly improves the performance in comparison to baseline techniques.

SEA: Search Engines Amsterdam

Room C1.112

This Friday we'll have two talks followed by drinks. 16:00 Georgios Tsatsaronis (Elsevier) to appear 16:30 Priyanka Agrawal (Booking.com) to appear

SEA: Search Engines Amsterdam

Room C1.112

This Friday we'll have two talks followed by drinks. 16:00 Georgios Tsatsaronis (Elsevier) to appear 16:30 Priyanka Agrawal (Booking.com) to appear

SEA: Search Engines Amsterdam

Room C1.112

This Friday we'll have two talks followed by drinks. 16:00 Georgios Tsatsaronis (Elsevier) to appear 16:30 Priyanka Agrawal (Booking.com) to appear

Past Events

Elasticsearch and Journalism Content Recommendation

Photos (38)