Long-context LLMs Struggle with Long In-context Learning (AI summary)

Tianle Li, Ge Zhang, Quy Duc Do, Xiang Yue, Wenhu Chen

Read more

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models (AI summary)

David Raposo, Sam Ritter, Blake Richards, Timothy Lillicrap, Peter Conway Humphreys, Adam Santoro

Read more

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement (AI summary)

Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipali, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Read more

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions (AI summary)

Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

Read more

AIOS: LLM Agent Operating System (AI summary)

Kai Mei, Zelong Li, Shuyuan Xu, Ruosong Ye, Yingqiang Ge, Yongfeng Zhang

Read more

Agent Lumos: Unified and Modular Training for Open-Source Language Agents (AI summary)

Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin

Read more

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models (AI summary)

Yanwei Li, Yuechen Zhang, Chengyao Wang, Zhisheng Zhong, Yixin Chen, Ruihang Chu, Shaoteng Liu, Jiaya Jia

Read more

A comparison of Human, GPT-3.5, and GPT-4 Performance in a University-Level Coding Course (AI summary)

Will Yeadon, Alex Peach, Craig P. Testrow

Read more

Long-form factuality in large language models (AI summary)

Jerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le

Read more

Evolutionary Optimization of Model Merging Recipes (AI summary)

Takuya Akiba, Makoto Shing, Yujin Tang, Qi Sun, David Ha

Read more
×