Intro to Natural Language Text Mining - Short Course
This class will cover machine learning applied to natural language text documents. Applications include: text-based search, predicting page reads based on text content, sentiment analysis and automated page and email construction based on user history. We will cover the use of statistical algorithms for accomplishing machine learning tasks on texts.
I. Intro to text mining problems
II. R language background
III Structuring Text
IV Document-Term Matrix Processing
-Formation and Basic Manipulations of Document-Term Matrix
-Latent Semantic Indexing - Search
-Topic Modelling - Clustering and Classification
Prerequisites - Programming experience is required. We'll use R code examples to work through the material. You should have R installed and R Studio. Here are links for those.
There will be a short intro to R for those who haven't used it.
Early Bird registration is $195 until Monday 12/11. Regular registration is $235. Pay by credit card through Eventbrite:
http://textmining.eventbrite.com, or or paypal (mike at mbowles dot com). You can also pay by check or cash at the first session.
The class will be webcast for those who want to view remotely. To receive the webcast instructions, you'll need to sign up on eventbrite 24 hours before the start of class.