addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Intro to Natural Language Text Mining - Short Course

Intro to Natural Language Text Mining - Short Course

This class will cover machine learning applied to natural language text documents.  We will cover the use of statistical algorithms for accomplishing machine learning tasks on texts.  We won't cover more traditional rule-based semantics, parsing, etc. 

We'll start with some introduction to the subject matter, comparison of statistical techniques to semantic approaches, definition of problems in text mining, and simple text manipulations.  We'll cover various algorithms for dealing with standard text mining problems, such as indexing, automatic classification (e.g. spam filtering) topic modeling, classification etc.

Course Outline

I. Intro to text mining problems
II. R language background
III Basic text manipulations
-Stop words
IV  Document-Term Matrix Processing
-Formation and Basic Manipulations of Document-Term Matrix
-Latent Semantic Indexing - Search
-Topic Modelling - Clustering and Classification
-Spam Detection. 

Prerequisites - Programming experience is required.  We'll use code examples to work through the material.  We'll use R programming language so you should have R installed and R Studio.  There will be a short intro to R for those who haven't used it.  Other than that you'll only need general undergrad level background math.

Class Registration

There's a $100 discount if you sign up at least 5 days before the class starts.

Those who don't register on Eventbrite can register and pay by check or cash the day of class.  In-class registration will go from 9:00 am until 9:30 am.


The class will be webcast for those who want to view remotely.  You'll need to sign up on eventbrite, if you want the to receive the webcast.

Join or login to comment.

  • Ryan W.

    How many of you are attending remotely?

    December 15, 2012

  • Mike B.

    I'll probably run this course again in 6 months or so. I do a variety of different ML courses. The next one will be a course taking 5 successive Saturday mornings where attendees will learn to write their own ML algorithms using map-reduce on hadoop.

    December 14, 2012

    • Jiunjiun M.

      sounds great, please keep us posted

      December 14, 2012

  • ari

    Will you be running this event again in the future?

    I'm sad I won't be able to attend this one as I just found out about it. :/

    December 13, 2012

  • A former member
    A former member

    Anyone able to give a ride from SF?

    December 12, 2012

27 went

Your organizer's refund policy for Intro to Natural Language Text Mining - Short Course

Refunds are not offered for this Meetup.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy