This time we change the format of PyCoffee to turn it into a full-day workshop, along with Redis Portugal and Porto i/o.
WORKSHOP CHALLENGE. VISUALIZING WIKIPEDIA EDIT STREAMS WITH REDIS AND PLOTLY DASH
Wikipedia is probably the biggest and the most successful crowdsourcing platform. The amount of changes, made by volunteer authors and moderators is staggering! Thanks to the open nature of the platform, we can retrieve and process each update in real time.
Python Porto, Redis Portugal (https://www.meetup.com/Redis-Portugal/) and Porto i/o invite you to a full day workshop. In this workshop you will read and process the stream of Wikipedia edits to visualize the rate of edits for each domain of the Wikimedia family of projects. By the end of it you will create a web application which lets users to choose one or more domains to watch, and provides editing statistics for each of the selected domain.
You will get hands-on experience on how to process and visualize data streams in real time, familiarize yourself with Redis, Python client for Redis, and Dash, an Open Source visualization platform from Plotly. In Redis specifically, you will learn the concept of pub-sub events and familiarize with core Redis data types.
09h30 - 10h00. Check in. Bring your laptops, check your internet connection, etc.
10h00 - 10h30. Outlining the challenge. Introduction to Redis. (Filipe Oliveira, Redis Portugal).
10h30 - 12h30. 1st part of the challenge: processing event stream and storing data to Redis.
12h30 - 13h30. Lunch together somewhere nearby
13h30 - 14h00. Introduction to Plotly Dash. (Roman Imankulov, Python Poprto)
13h30 - 15h30. 2nd part of the challenge: visualizing stored data.
15h30 - 16h00. Wrap up. Proudly demonstrating your work to your peers.
EXPECTED LEVEL OF PRIOR CHALLENGE
- Basic knowledge of Python is necessary (made some useful work with Python before, wrote scripts, Jupyter notebooks, applications, etc).
- Familiarity with Redis and NoSQL concepts in general is welcome, but not necessary.
- Prior experience with Plotly Dash is not necessary.
To avoid wasting time on the event, we ask you to make some preparatory work before.
- Install Python. Version 3.6, 3.7 or 3.8 is necessary.
- Install Redis following instructions from https://redis.io. If you use Windows and familiar with Docker, you can run it from a Docker container.
- Alternatively, sign up for a cloud version of Redis at https://redislabs.com/. They provide a free plan, enough to get started with our experiments.
WHAT TO READ TO LEARN MORE OR TO PREPARE BETTER FOR THE WORKSHOP
- Python tutorial on the official website is a good starting point to learn more about the language or to brush up your knowledge: https://docs.python.org/3/tutorial/index.html
- If you want to get to know more about Redis, read the first chapters of the "Redis in Action" book, which is available online freely at https://redislabs.com/redis-in-action/
- Redis Time Series Patterns https://redislabs.com/redis-best-practices/time-series/sorted-set-time-series/
- RedisTimeSeries. Alternative implementation of time series, working as a Redis plug-in. https://oss.redislabs.com/redistimeseries/
IMPORTANT NOTE ON THE VENUE
We gather together in Porto i/o Santa Catarina, it's not our usual venue!