Skip to content

Details

Exploring Data with Python - Glenn Lehman

In this presentation, we'll review some mathematical foundations needed to perform detailed data analysis, applied in Python. If you know basic Python, you can join us as we explore data, learn how to use statistical analysis to make predictions, and understand the initial steps of exploring a data set to see what might be lurking there. What makes a data set a valid sample instead of meaningless noise? How can we interpret visualizations to determine the quality of data samples?

Mathematical Topics Covered:

  • Population
  • Sample
  • Categorical and Quatitative Data (Probability Distribution)
  • Basic visualizations (Bar Plot, Histograms, Stem and Leaf displays)
  • Cross Tabulation - Mean, Median, Mode
  • Group Frequency
  • Standard Deviation
  • Percentiles, Quartiles, Quintiles, Deciles
  • Plotting (elementary box plot)
  • IQR (interquartile range) - Box Plots (traditional/modified)

Python libraries introduced:

  • numpy
  • pandas
  • matplotlib

Working code will be provided in a Jupyter notebook. This presentation will be recorded.

Data Science
Data Science using Python
Applied Statistics

Sponsors

Sponsor logo
Temporal
Temporal sponsors our May 8th PyCon presentation rehearsals
Sponsor logo
Cambridge Mobile Telematics
CMT has sponsored Presentation Night
Sponsor logo
DataDog
DataDog is a regular host and sponsor of our in-person events

Members are also interested in