Skip to content

This event was canceled

Details

For our April meetup we're so very lucky to have Julia Silge join us! Julia is a data scientist and software engineer at RStudio where she works on open-source modeling tools. She is also the co-author of Tidy Text Mining with R (tidytextmining.com). Julia is keynoting at satRdays Newcastle on April 4, and passing through Edinburgh on her way there. We recommend checking out newcastle2020.satrdays.org for a great, nearby R event as well!

Understand word embeddings using tidy data principles

Modern NLP frameworks often depend on word embeddings, a way of statistically modeling language where words or phrases are mapped to vectors of real numbers. In this talk, we’ll work to understand word embeddings by investigating how we can generate them using count-based statistics and dimensionality reduction, then learn how to make use of pre-trained embeddings based on enormous datasets. Finally, we’ll explore the ethical issues involved in using word embeddings and how they can amplify systemic and historical bias.

Related topics

You may also like