Web Scraping with AI
Details
NOTE: Our classes are for any one of any level who is interested in the subject. The first half will be seminar, and the second half will be hands on writing code. (You can leave after the seminar if you like). If you want to build the examples just bring a Windows/Mac/Linux laptop with preferably over 8 GB of RAM.
This is for educational purposes only... ... ahem...
This class will show you how to scrape the web for information that your AI system can use to provide responses. We will scrape websites, RSS Feeds, PDF files and YouTube videos.
The trick to scraping the web for your AI projects is to get text that is in a usable format, and remove unnecessary text so that you do not over pay for tokens. We will be using well known Python modules to get text from websites, PDF's and YouTube videos so that we can then do something based off of that information.
This class will demonstrate beyond a shadow of a doubt why AI will "kill" the internet...
This class will go over:
- The concept of scraping documents
- BeautifulSoup for web page scraping
- FeedParser for RSS feed scraping
- Scraping PDF's
- Scraping YouTube Videos
- What to do with the data once you have it
- How to build a full auto blog
- Legal and ethical considerations
We will explain how these services work. Demonstrate how to use these services in live code, and you will have time to create simple labs to get the feel for how easy AI can be.
The first part of the class will be seminar, and then the second part is where you will write code and I'll be able to help troubleshoot any problems you run into.
Note 2: We don't have any food/ drink sponsors so please eat beforehand, or... bring enough for the class!
Note 3: This will be at American Underground after normal hours. I will send out the key code to get into the building before the class so make sure to look in your email for the code before you come. I will be in the classroom by 5pm. Please feel free to come early to talk or setup your computer.
