RLHF with PPO/DPO + Building Direct Preference Optimization on Trainium
Details
RSVP Webinar: https://www.eventbrite.com/e/webinar-generative-ai-on-aws-tickets-45852865154
Talk #0: Introduction
by Chris Fregly (Principal SA, Generative AI) and Antje Barth (Principal Developer Advocate, Generative AI)
Talk #1: Human Alignment with Reinforcement Learning from Human Feedback (RLHF) with both PPO and DPO
by Antje Barth (Principal Developer Advocate, Generative AI)
Talk #2: Building Direct Preference Optimization (DPO) on Trainium/Neuron SDK
by Hunter Carlisle (Senior SA, Annapurna ML)
RSVP Webinar: https://www.eventbrite.com/e/webinar-generative-ai-on-aws-tickets-45852865154
Zoom link: https://us02web.zoom.us/j/82308186562
Related Links
Generative AI Free Course on DeepLearning.ai: https://bit.ly/gllm
O'Reilly Book: https://www.amazon.com/Generative-AWS-Context-Aware-Multimodal-Applications/dp/1098159225
Website: https://generativeaionaws.com
Meetup: https://meetup.generativeaionaws.com
GitHub Repo: https://github.com/generative-ai-on-aws/
YouTube: https://youtube.generativeaionaws.com
Every 3rd Monday of the month until November 17, 2024
RLHF with PPO/DPO + Building Direct Preference Optimization on Trainium