Sveriges mest populära poddar

The Data Exchange with Ben Lorica

Fine-tuning and Preference Alignment in a Single Streamlined Process

36 min • 13 juni 2024

Jiwoo Hong and  Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model

Subscribe to the Gradient Flow Newsletterhttps://gradientflow.substack.com/

Subscribe: AppleSpotify OvercastPocket CastsAntennaPodPodcast AddictAmazon •  RSS.

Detailed show notes can be found on The Data Exchange web site.

Förekommer på
00:00 -00:00