Skip to content

dask-sql: Query your (big) data with the power of Python and SQL

Photo of Dan Coyle
Hosted By
Dan C.
dask-sql: Query your (big) data with the power of Python and SQL

Details

Happy new year PyData friends! We're excited to continue our virtual meetups in partnership with PyData Miami and PyData Philly. We'll have breakout sessions for networking with fellow attendees and then a great talk from Nils Braun.

Schedule:
5:00pm - 5:30pm: networking
5:30pm - 6:30pm: talk by Nils Braun

Join here! https://hopin.com/events/pydata-meetup-query-your-big-data-with-the-power-of-python-sql

Title:
dask-sql: Query your (big) data with the power of Python and SQL

Description:
Accessing and working with data is increasingly important for the success of a project or a company. Traditionally, relational (SQL) databases have been used for this and many developers and non-developers working with data are very familiar with the SQL query language. However, the requirements of today's landscape are much more complicated: machine learning, complicated distributed clusters of machines, large amount of data - all of this does not play well with the world of SQL databases.

The Dask ecosystem and the dask-sql library try to bridge this gap. Dask enables its users to easily scale data-intensive Python applications and comes with a large number of extensions, e.g. for machine learning. dask-sql adds a SQL query layer on top of the Dask ecosystem, which allows to query big and small data from SQL and scale the computations via a Dask cluster - without interacting with the complexities of a distributed system.

In this talk, we will cover Dask as the basis for a scalable Python ecosystem and then explore the SQL query engine dask-sql and how it might help to combine the best of both the SQL and Python world. After that, a demo session will demonstrate how you can set up dask-sql on your own cluster and quickly analyze your (big) data.

About the speaker:
Nils is a Data Engineer and core developer of the dask-sql and tsfresh libraries. He did his PhD in particle physics at an experiment in Japan, where he got used to big and complex data architectures. He was working both as a Data Scientist and as a Data Engineer and knows how hard it is, to combine the two worlds. Currently, he is responsible for the Data Engineer training and mentoring at the Bosch Center for Artificial Intelligence in Stuttgart, Germany.

Join here: https://hopin.com/events/pydata-meetup-query-your-big-data-with-the-power-of-python-sql

Photo of PyData NYC group
PyData NYC
See more events
Online event
This event has passed