Skip to content

Details

The workshop will primarily cover R's Text Mining package (tm); as well as a couple ML packages. The main parts of the workshop wll include:

  • converting text into a usable form (a matrix of terms/words and their associated documents)

  • performing basic analysis (frequencies and creating wordclouds)

  • applying Classification via a basic Machine Learning algorithm (Bayes)

  • one of the examples will be analyzing Twitter data (which is covered in the earlier API workshop). Sentiment Analysis will only be touched on briefly as this is more of a specialty topic.

Members are also interested in