Skip to content

Details

Hi PyLadies Berlin!
We have a new exciting series of workshops from two amazing PyLadies from London and Munich for you! 🙌 Today we start with the first one: scraping & regex.

🔊 Introduction to Web-Scraping & Regex

In a months-long journey, Ane and Anglina have been setting up "The Boulder Gym project": a web-app that analyzes and predicts the occupancy rate of all Boulder gyms in their cities. On the way they encountered multiple challenges, ventured forward into unknown territories of knowledge and established a friendship across different time zones while working together on this project.

Take this opportunity and follow their journey with us while learning one or two new things about web scraping and exploring this interesting field of Python. This meetup will follow a storyline sprinkled with exercises. You will hear the first part of this exciting project today, which is in fact the first part for any data science project - data gathering.

🤓 What you will learn:
You will learn to scrape data from a website using Python’s requests library and regex (regular expressions) and how to improve your code iteratively. By the end of this meetup, we hope you feel confident about scraping any website, and you have solid regex skills.

🛠 Prerequisite:
We will use Google Colab notebooks, which are prepared for you and hosted online by Google. A Google account is required to play with the notebook. If you don’t want to use your Google account or if you don’t have any account, we will also share the notebook file itself so you can run it locally. For this scenario, you will need to have a program to run notebooks on your computer, for example Anaconda.

👩‍💻 About the speakers:

--- Anglina has recently completed a machine learning internship and is now working as a data analyst at a cool FinTech company in London. She is also starting her Data Science MSc this September, and enjoys learning about all things Python. Anglina is a sustainability enthusiast and into almost all type of exercise (including bouldering). You can find her on Twitter @_AnglinaB

--- Ane has been in Germany for 4 years and studied NLP at Ludwig-Maximilians-University in Munich. With experience in software development, data science, and recently DevOps, she has more project ideas than sunlight hours, so if you want to improve your coding skills but don't have ideas, feel free to reach out :smile: Apart from coding, she likes to read, ride her bike, go bouldering and play music. You can find her on Twitter: @aberasategi

--- Sowmya Guru is a founder at Coder Bee (www.coderbee.de), Product Engineer, mentor and speaker. She will start this evening with a motivational 5min Non-Coding-Super-Power talk. You can find her on Twitter: @justabadrobot

--- Heike is a self/community taught Python developer and career changer. After working for several years in the film industry as colourist she is now developing image processing and analysis tools with Python.

🦸🏻‍♀️ 🦹 Non Coding Super Power Talk
--- Devs Be Brave ---
When you're a developer, the easy part is to write code. With experience I have learned that the hardest thing to do is to be brave and tell it as is.
It's not easy but it needs to be done!

📆 Agenda
18h00 Community Announcements
18h10 Non Coding Super Powers
18h20 5 Minutes Break
18h25 Introduction to Web-Scraping & Regex
19h00 Web-Scraping & Regex exercise
19h30 See You Next Time! :D

---
• By attending our online event, you agree to the PyLadies Code of Conduct: https://www.pyladies.com/CodeOfConduct/

• Contact
Interested in speaking at one of our events? Have a good idea for a Meetup? Get in touch with us at berlin@pyladies.com

Find us on the PyLadies Global workspace:

  1. https://slackin.pyladies.com enter your email address.
    Accept the email invitation
  2. Go to workspace https://pyladies.slack.com
  3. Join channel #city-berlin, #germany, #jobs-europe

Members are also interested in