addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Dave Robinson on tidytext

Register: Expedite your registration at Galvanize and register here

Note: The Room in Galvanize is LL1+LL2. 

Time: Socialize from 6.30 - 7, talk from 7 - 7.45 or so, and head to a bar shortly after. 

Title: Tidy Text Mining with R

Abstract: Text data is increasingly important in many domains, but it can be challenging to manipulate and visualize within typical R analysis workflows. In this talk, I will introduce the tidytext package and show how tidy data principles and tools can make text mining easier and more effective, by structuring text as one-token-per-row. You'll learn how to manipulate, summarize, and visualize text's characteristics using R packages from the tidy ecosystem such as dplyr, ggplot2, and tidyr. You'll see case studies of sentiment analysis, tf-idf, and topic modeling applied to examples from literature, Twitter, and Stack Overflow questions, and gain the tools to draw conclusions from your own text datasets.

Bio: David Robinson is a Data Scientist at Stack Overflow, where he analyzes data on the world's software developers to help them find answers to their programming questions. He is the co-author with Julia Silge of the tidytext package and of the upcoming book Text Mining with R, to be published by O'Reilly in 2017. He is also the author of the broom, gganimate, and fuzzyjoin packages and of the DataCamp course "Exploratory Data Analysis in R: Case Study." He writes about R, statistics and education on his blog Variance Explained, as well as on Twitter as @drob.

Join or login to comment.

  • Jim R.

    Just to confirm--this is as the Platte St. Galvanize and not the Golden Triangle location, correct?

    2 days ago

  • Matt P.

    Is it worth commenting that Dave's talk is starting 30 minutes earlier than speakers typically start?

    February 15

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy