Skip to content

Details

For May's event we'll be hosting a workshop focused on Natural Language Processing going from zero to working sets of informative features derived from unstructured text.

The topics we will cover:

  • Tokenization design (Using SpaCy)
  • Token/word counting (Using scikit-learn)
  • Topic models (scikit-learn)

We will make use of Google Collaboratory to run the code, so no need to install anything! (Just make sure you have a Google account!)

We will announce the Zoom info closer to the event. Looking forward to (virtually) seeing everyone!

Members are also interested in