Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention (AI summary)

Tsendsuren Munkhdalai, Manaal Faruqui, Siddharth Gopal

Read more

Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models (AI summary)

Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei

Read more

Long-context LLMs Struggle with Long In-context Learning (AI summary)

Tianle Li, Ge Zhang, Quy Duc Do, Xiang Yue, Wenhu Chen

Read more

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models (AI summary)

David Raposo, Sam Ritter, Blake Richards, Timothy Lillicrap, Peter Conway Humphreys, Adam Santoro

Read more

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement (AI summary)

Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipali, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Read more

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions (AI summary)

Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

Read more

AIOS: LLM Agent Operating System (AI summary)

Kai Mei, Zelong Li, Shuyuan Xu, Ruosong Ye, Yingqiang Ge, Yongfeng Zhang

Read more

Agent Lumos: Unified and Modular Training for Open-Source Language Agents (AI summary)

Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin

Read more

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models (AI summary)

Yanwei Li, Yuechen Zhang, Chengyao Wang, Zhisheng Zhong, Yixin Chen, Ruihang Chu, Shaoteng Liu, Jiaya Jia

Read more

A comparison of Human, GPT-3.5, and GPT-4 Performance in a University-Level Coding Course (AI summary)

Will Yeadon, Alex Peach, Craig P. Testrow

Read more
×