The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits (AI summary)

Read more

Language models scale reliably with over-training and on downstream tasks (AI summary)

Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G. Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt

Read more

Sora Generates Videos with Stunning Geometrical Consistency (AI summary)

Xuanyi Li, Daquan Zhou, Chenxu Zhang, Shaodong Wei, Qibin Hou, Ming-Ming Cheng

Read more

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method (AI summary)

Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat

Read more

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization? (AI summary)

Xue-Yong Fu, Md Tahmid Rahman Laskar, Elena Khasanova, Cheng Chen, Shashi Bhushan TN

Read more

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement (AI sumamry)

Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue

Read more

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models (AI summary)

Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

Read more

Chain-of-Thought Reasoning Without Prompting (AI sumamry)

Xuezhi Wang, Denny Zhou

Read more

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator (AI sumamry)

Ziru Chen, Michael White, Raymond Mooney, Ali Payani, Yu Su, Huan Sun

Read more

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss (AI sumamry)

Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

Read more
×