Text Mining Short Course - in San Francisco

Name: Text Mining Short Course - in San Francisco
Start: 2013-07-02T18:00:00-07:00
End: 2013-07-02T21:00:00-07:00
Location: DocuSign

Hosted By

Mike B.

Text Mining Short Course - in San Francisco

Details

Intro to Natural Language Text Mining - Short Course

This class will cover machine learning applied to natural language text documents. Applications include: text-based search, predicting page reads based on text content, sentiment analysis and automated page and email construction based on user history. We will cover the use of statistical algorithms for accomplishing machine learning tasks on texts.

The course starts with some introduction to the subject matter, comparison of statistical techniques to semantic approaches, definition of problems in text mining, and simple text manipulations. We'll go through some basics of the R-language so you can follow the code examples presented. We'll cover various algorithms for dealing with standard text mining problems, such as indexing, automatic classification (e.g. spam filtering) topic modeling, classification etc.

Course Outline

I. Intro to text mining problems
II. R language background
III Structuring Text
-Normalization
-Stop words
-Stemming
IV Document-Term Matrix Processing
-Formation and Basic Manipulations of Document-Term Matrix
-Latent Semantic Indexing - Search
-Topic Modelling - Clustering and Classification
-Spam Detection.

Prerequisites - Programming experience is required. We'll use R code examples to work through the material. You should have R installed and R Studio. Here are links for those.

http://cran.us.r-project.org/
http://www.rstudio.com/ide/download/desktop

There will be a short intro to R for those who haven't used it.

Class Registration

Class will meet in two sessions: Wed 6/26 and Tues 7/2 from 6pm to 9 pm. Registration covers both sessions. Pay by credit card

http://textmining.eventbrite.com

or paypal (mike at mbowles dot com), or pay by check or cash at the first session. There's a $50 discount if you sign up at least 5 days before the class starts.

Web-Cast

The class will be webcast for those who want to view remotely. To receive the webcast instructions, you'll need to sign up on eventbrite 24 hours before the start of class.

Events in San Francisco, CA

Silicon Valley Hands On Programming Events

See more events

Silicon Valley Hands On Programming Events

No ratings yet

Tuesday, July 2, 2013
6:00 PM to 9:00 PM PDT

DocuSign

221 Main St, Suite 1000 · San Francisco, CA

Silicon Valley Hands On Programming Events

public group

Text Mining Short Course - in San Francisco