Next Meetup

A Tidy Text Analysis of The Simpsons
Abstract In this presentation, Seamus will illustrate the benefits of adopting "tidy" principles for data analysis, even with unstructured data, such as text. As an example, he will use over 500 scripts from the first 27 seasons of The Simpsons to walk through the process of transforming unstructured data into a structured format. Adding onto the work of others, and primarily using R, the analysis will dig into sentiment and story structure of individual characters, and the nuanced interactions between characters when they appear together. The presentation is intended to be a fun example of a project using unstructured data, the principles of which might be applied to more practical real-world problems. About the Speaker Seamus Wedge is a Data Scientist at Jewelers Mutual Insurance Group in Neenah, WI. He has a B.S. in Chemical Engineering from UW-Madison, and a M.S. in Data Science from UW-Eau Claire. His business experience has included several roles in customer-facing product development, marketing, and now data science. Work projects currently include using data to model and improve the customer experience across channels. His passion is for finding hidden insights in data and using creative ways to clearly communicate those results. Food for this meetup is courtesy of American Family Insurance. After meetup drinks are courtesy of Hortonworks.

Madison Central Public Library

201 West Mifflin St. Room 302 · Madison, WI

    Past Meetups (77)

    What we're about

    Big Data Madison promotes the understanding and adoption of technologies used to acquire, store, and analyze data in all its forms. This spans everything from data engineering to data science.

    Everyone is encouraged to attend, no level of experience is too basic to join and learn.

    We will focus on some of the technologies used in the Big Data ecosystem (Hadoop, Spark, streaming data and data processing, etc), as well as topics in Data Science (machine learning, data visualization, analytics and more). We will try to balance the topics between technology talks, use cases, and demos.

    Members (2,163)

    Photos (50)

    Find us also at