Sveriges mest populära poddar

Microsoft Research Podcast

Abstracts: July 29, 2024

8 min • 29 juli 2024

A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.

Read the paper

Get the code

Kategorier
Förekommer på
00:00 -00:00