Skip to content

Small Big Data: using NumPy and Pandas when your data doesn't fit in memory

Photo of Patrick Harrison
Hosted By
Patrick H. and Colin D.
Small Big Data: using NumPy and Pandas when your data doesn't fit in memory

Details

Please join PyData Pittsburgh for a special presentation of the talk Small Big Data: using NumPy and Pandas when your data doesn't fit in memory by Itamar Turner-Trauring. We'll gather in person at Code & Supply, and Itamar will join us via video link from Cambridge, MA.

About the talk

Your data is too big to fit in memory—loading it crashes your program. There's no need to switch to a complex Big Data cluster just yet, though! Much of the time you can process your data simply and quickly with your existing tools running on a single computer.

In this talk you’ll learn the basic techniques for dealing with larger-than-memory data, on a single computer: money, compression, batching, and indexing. You’ll specifically learn how to apply these techniques to NumPy and Pandas, but you’ll also learn the key concepts you can apply to other libraries and the specifics of your particular data.

COVID-19 safety measures

Masks required
Event will be indoors
Code & Supply requires masks for anyone who is not fully vaccinated. We may have spare masks available if you forget one. We ask that you be fully vaccinated unless you cannot for medical reasons. We don't require proof but we expect you will value and respect the safety of your community members. Please do not come if you feel sick. If you're unsure or have been potentially exposed in an enclosed or indoor space, please test yourself!
The event host is instituting the above safety measures for this event. Meetup is not responsible for ensuring, and will not independently verify, that these precautions are followed.
Photo of PyData Pittsburgh group
PyData Pittsburgh
See more events
Code & Supply Coworking
5648 Friendship Ave 3rd Floor · Pittsburgh, PA