Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models (AI summary)

Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

Read more

Chain-of-Thought Reasoning Without Prompting (AI sumamry)

Xuezhi Wang, Denny Zhou

Read more

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator (AI sumamry)

Ziru Chen, Michael White, Raymond Mooney, Ali Payani, Yu Su, Huan Sun

Read more

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss (AI sumamry)

Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

Read more

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs (AI sumamry)

Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker

Read more

LoRA+: Efficient Low Rank Adaptation of Large Models (AI sumamry)

Soufiane Hayou, Nikhil Ghosh, Bin Yu

Read more

Generative Representational Instruction Tuning (AI sumamry)

Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela

Read more

Large Language Models for Data Annotation: A Survey (AI summary)

Zhen Tan, Alimohammad Beigi, Song Wang, Ruocheng Guo, Amrita Bhattacharjee, Bohan Jiang, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu

Read more

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation (AI summary)

Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong

Read more

World Model on Million-Length Video And Language With RingAttention (AI summary)

Hao Liu, Wilson Yan, Matei Zaharia, Pieter Abbeel

Read more
×