How Would You Do It? Data Challenge: Web Scraping Amazon's Best sellers


Details
This month's meetup will focus on solving a data challenge. The objective is for attendees to work on solving a specific data problem on their own (or with others!) and then sharing how they arrived at a solution during the meetup. We will provide a loose structure around the problem, but feel free to go above and beyond. You learn by doing!
____________________________________________
The Challenge:
Scrape data for the Top 100 Best Sellers in Data Processing:
https://www.amazon.com/gp/bestsellers/books/10806588011/ref=pd_zg_hrsr_books
Suggested Data Fields to Capture:
- Title
- Author
- Publication Year
- Price
- Ratings (optional)
- Reviews (optional)
- ISBN (optional)
- Frequently Bought Together (optional)
- Table of Contents (optional)
- Product details (optional)
Suggested Analyses (optional):
- Text analysis on reviews
- Which programming languages or technologies appear in the Top 100?
- Are there price differences among books focused on R or Python?
- How have the Top 100 books changed over time?
- Which books appear the most in the Frequently Bought Together section?
Web Scraping Resources:
https://justrthings.com/2019/03/03/web-scraping-amazon-reviews-march-2019/
https://www.freecodecamp.org/news/an-introduction-to-web-scraping-using-r-40284110c848/
https://martinctc.github.io/blog/vignette-scraping-amazon-reviews-in-r/
https://www.r-bloggers.com/web-scraping-amazon-reviews-march-2019/
https://www.youtube.com/watch?v=smriIBG08ok
You may adjust this data challenge to suit your interests. We look forward to seeing the different solutions you all come up with!
____________________________________________
******* CALL FOR SPEAKERS **********
Please fill out the following form if you would like to present at one of our events.
https://forms.gle/bydCPsy9j6NCbWhs5
First-time speakers are welcome!
********** DON'T BE SHY **************
********* TOPIC SUBMISSION *********
Please use the link below to suggest topics or events for future meetups.
https://forms.gle/vNeHDjuNg2U7C8zv8
*****************************************
*************JOB SEEKERS *************
Use the link below to submit your information.
https://forms.gle/R31QZHEKt1z2sd3x8
*****************************************
*****************SLACK ****************
Join our slack!
http://bit.ly/HRUG-slack-invite
*****************************************

How Would You Do It? Data Challenge: Web Scraping Amazon's Best sellers