Doorgaan naar de inhoud

Details

This Friday we'll have two talks followed by drinks. Our industrial speaker is Dyaa Albakour, a data scientist at Signal Media. He will talk about crowdsourcing for entity linking. Our academic speaker is Maarten Marx, Assistant Professor at the University of Amsterdam. He will talk about accessing city councils using exploratory search systems.

Program:

16:00-16:30 Dyaa Albakour

16:30-17:00 Maarten Marx

17:00-18:00 Drinks and snacks

Details:

Dyaa Albakour - Crowdsourcing at scale! The case of Entity Linking

In this presentation, we first review the current state-of-the-art for the EL task and make the case for using supervised learning approaches to tackle EL. These approaches require large amounts of labelled data, which represent a bottleneck for scaling them out to cover large numbers of entities. To mitigate this, we have developed a production-ready solution, powered by Active Learning, to collect high-quality labelled data at scale with Crowdsourcing. In particular, we will discuss the different steps and the challenges in tuning the design parameters of the crowdsourcing task. The design parameters include the qualification of the workers and UI features that help them complete the task. The tuning aims to limit the noise, reduce the cost and maximise the throughput of labelling whilst maintaining the quality of the resulting models for EL.

Bio: Dyaa works as a data scientist at a London-based company called Signal Media. In particular, Dyaa works with a team of scientists and developers to research and develop big-data text analytics services for a large-scale business intelligence platform. The platform supports reputation management, media coverage reporting, content marketing and other use cases. Prior to that, Dyaa was working as a Post-doctorate Researcher in the School of Computing Science at the University of Glasgow. He was working with Dr. Iadh Ounis and Dr. Craig Macdonald on the integrated Multimedia City Data (iMCD) project within Urban Big Data Centre. He also worked on the SMART FP7 EU project. Before coming to Glasgow, he was based in Colchester, Essex. Over there, he worked on a couple of research projects and completed his PhD in 2012 at the University of Essex under supervision of Dr. Udo Kruschwitz. His thesis is titled Adaptive Domain Modelling for Information Retrieval.

Gerelateerde onderwerpen

Misschien vind je ook leuk