Skip to content

Details

This week Damian will be presenting Direct Preference Optimization: Your Language Model is Secretly a Reward Model by Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn.

If you have the time, please do read the paper ahead of the call as this will aid a more in depth discussion on the day.

We look forward to seeing you all there!

We discuss a different research paper every week. We post each week's paper in our GitHub repo - please read it before the meetup.

All events will be hosted on a Google Meets video call. Once a month we also host an in-person event in our London office - watch this space for updates.

All recorded presentations can be found in our YouTube channel (don't forget to subscribe!).

Related topics

Artificial Intelligence
Artificial Intelligence Applications
Machine Learning
Natural Language Processing
Neural Networks

You may also like