In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss (AI sumamry)

Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

Read more

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs (AI sumamry)

Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker

Read more

LoRA+: Efficient Low Rank Adaptation of Large Models (AI sumamry)

Soufiane Hayou, Nikhil Ghosh, Bin Yu

Read more

Generative Representational Instruction Tuning (AI sumamry)

Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela

Read more

Large Language Models for Data Annotation: A Survey (AI summary)

Zhen Tan, Alimohammad Beigi, Song Wang, Ruocheng Guo, Amrita Bhattacharjee, Bohan Jiang, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu

Read more

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation (AI summary)

Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong

Read more

World Model on Million-Length Video And Language With RingAttention (AI summary)

Hao Liu, Wilson Yan, Matei Zaharia, Pieter Abbeel

Read more

The boundary of neural network trainability is fractal (AI summary)

Jascha Sohl-Dickstein

Read more

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement (AI summary)

Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong

Read more

ChemLLM: A Chemical Large Language Model (AI summary)

Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Dongzhan Zhou, Shufei Zhang, Mao Su, Hansen Zhong, Yuqiang Li, Wanli Ouyang

Read more
×