About us
Statistics, AI, and ML meetups (lectures, working groups, demos, talks)
GPU time available on AMD for free with accepted proposal. Free $100 to anyone to get started.
https://www.amd.com/en/developer/ai-dev-program.html
We understand the frustrating nature of learning and try to be patient by providing a free service and access to GPU time and mentors. Everybody is a volunteer and the ease of manipulation via social media has led to the growth of unreasonable expectations no longer bounded by common manners. This group is designed to help individuals with sufficient programming skill the ability to grow by collaboration. Mr. Wielga's comments are inaccurate. I spent hours preparing rules and examples for numpy/pytorch broadacsting and a strategy of how to turn probability formulas to pytorch code using chatGPT. This was put in a form format for easier reading. Mr Wielga is a bootcamp educated golang programmer. His expectation others are responsible for educating him to enable contributions to a RL programming group is absurd. We have removed Stan Wielga permanently.
Upcoming events
56

RL Work Sessions
·OnlineOnlineRL Working Group:
Register your email here if you want to come to the meetups.
https://forms.gle/MkpiRu39xrg62tKCAWe have had issues with zoom bombing and crazy people in general. These meetups are no longer open to the general public without qualification.
Participants collaborate with others. Projects range from homework assignments to reimplementation of papers. This isn't a class. There is some minimal background you will need to be able to contribute. Register
Proposal: Imitation learning to improve BrowserGym leaderboard benchmarks for open source models.
Jobs:https://x.com/adcock_brett/status/2018919553963880613
https://x.com/adcock_brett/status/2018417226895028414
Looking for projects? The class websites are good starting points.
We started here a year ago:
Coursera RL
Current techniques for RL:
Kevin Murphy's RL NotesMultiAgent systems are the next step in LLM applications. version
cs234 Spring 2024 YT Videos
cs224r Deep Reinforcement Learning Class website
Create agent apps using web actionscs224r YT Videos
There are a couple hundred projects at the cs224r website. Practice here with the same format for your projects. You have the luxury of additional time.- Build some protos to get proof of concept and feasibility
- Talk w Professor Huang and see if what you are going to do makes sense.
- Fill out a proposal with AMD for gpu cluster time.
- https://cs224r.stanford.edu/material/CS224R_Custom_Project_Guidelines.pdf
- Overleaf cs224r project template: https://drive.google.com/file/d/1TdXav51fMSQPjT83Ajdz3ZRRMB6xnhjB/view
cs224r projects
vLLM Github
vLLm OH; you can ask questions here
vLLm slack channel; you will have to answer a basic technical question to get in. No, we don't give you the answer.
vLLM production stack;
nanovllm for learning:
vLLM is ok for non-distributed models
If you need distributed; SGLANG
miniSGLANG for learninghttps://ma-lab-berkeley.github.io/deep-representation-learning-book
Free GPU Time sponsored by AMD
They give everyone $100 free no questions asked. Additional time available after project approval2 attendees
Past events
1168
