BDM63: Redshift SQL and Delta Lake / Koala and MLFlow Demo

This is a past event

100 people went

Location image of event venue

Details

Big Data Montreal would like to invite you to its 63rd meeting!

LOCATION: Hopper - 5795 de Gaspé #100 (near Rosemont métro)

Join us on Tuesday Sept 3rd 2019 at 6:30PM to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

All are welcome, no matter if you already have some experience with Big Data technologies or if you're simply curious to learn more.

We have 2 presentations scheduled:

* Optimizing SQL queries for AWS Redshift
- Nicolas Marchildon Team Lead at Plusgrade

"NoSQL has been heavily adopted throughout the industry, but analysts still expect SQL for expressing their queries and expect quick results.
This Talk will focus on how Redshift stores data and how SQL queries are executed. The goal is to figure out how to author SQL queries that will be optimal for Redshift execution."

* Productionizing Machine Learning with Delta Lake, Koalas, and MLflow
- Daniel Arrizza is a Customer Success Engineer at Databricks

"Databricks is an end-to-end platform for data engineering, and data science + ML. Integrate many open source tools, not only Apache Spark, but also Delta Lake, Koalas, Hyperopt, and MLflow. We'll walk through querying a data lake with Delta Lake, transforming the data with koalas (distributed pandas dataframes), do machine learning with hyperparameter tuning (Hyperopt), and log our experiment results to MLflow."

Please tell your friends and colleagues :) !

=====================================

Big Data Montréal vous invite à sa 6rième rencontre!

Emplacement: Hopper - 5795 de Gaspé #100 (près du métro Rosemont)

Joignez-vous à nous le mardi 3 septembre 2019 à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

Tous sont bienvenus, que vous ayez déjà de l'expérience avec les technologies de Big Data ou que vous soyez simplement curieux d'en apprendre plus.

Nous avons 2 présentations à l'horaire:

* Les requêtes SQL avec AWS Redshift
- Nicolas Marchildon Team Lead at Plusgrade

"Les programmeurs ont adopté les bases de données NoSQL, mais les analystes ont besoin de SQL pour exprimer des requêtes très diverses, et veulent des résultats en quelques secondes.
Voyons voir comment Redshift organise les données et execute le SQL. Notre but sera de comprendre comment concevoir des tables optimales pour ces requêtes."

* Productionizing Machine Learning with Delta Lake, Koalas, and MLflow
- Daniel Arrizza is a Customer Success Engineer at Databricks

"Databricks is an end-to-end platform for data engineering, and data science + ML. Integrate many open source tools, not only Apache Spark, but also Delta Lake, Koalas, Hyperopt, and MLflow. We'll walk through querying a data lake with Delta Lake, transforming the data with koalas (distributed pandas dataframes), do machine learning with hyperparameter tuning (Hyperopt), and log our experiment results to MLflow."

Passez le mot et venez en grand nombre :) !