In Defense of RAG in the Era of Long-Context Language Models (AI summary)

Tan Yu, Anbang Xu, Rama Akkiraju

Read more

Training Language Models to Self-Correct via Reinforcement Learning (AI summary)

Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust

Read more

A Practitioner's Guide to Continual Multimodal Pretraining (AI summary)

Karsten Roth, Vishaal Udandarao, Sebastian Dziadzio, Ameya Prabhu, Mehdi Cherti, Oriol Vinyals, Olivier Hénaff, Samuel Albanie, Matthias Bethge, Zeynep Akata

Read more

Text2SQL is Not Enough: Unifying AI and Databases with TAG (AI summary)

Asim Biswal, Liana Patel, Siddarth Jha, Amog Kamsetty, Shu Liu, Joseph E. Gonzalez, Carlos Guestrin, Matei Zaharia

Read more

ReMamba: Equip Mamba with Effective Long-Sequence Modeling (AI summary)

Danlong Yuan, Jiahao Liu, Bei Li, Huishuai Zhang, Jingang Wang, Xunliang Cai, Dongyan Zhao

Read more

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model (AI summary)

Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy

Read more

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling (AI summary)

Hritik Bansal, Arian Hosseini, Rishabh Agarwal, Vinh Q. Tran, Mehran Kazemi

Read more

Persuasion Games using Large Language Models (AI summary)

Ganesh Prasath Ramani, Shirish Karande, Santhosh V, Yash Bhatia

Read more

AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems (AI summary)

Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi

Read more

Agentic Retrieval-Augmented Generation for Time Series Analysis (AI summary)

Chidaksh Ravuru, Sagar Srinivas Sakhinana, Venkataramana Runkana

Read more
×