Discovering Preference Optimization Algorithms with and for Large Language Models (AI summary)

Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob Foerster, Mihaela van der Schaar, Robert Tjarko Lange

Read more

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales (AI summary)

Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao

Read more

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models (AI summary)

Ling Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, Bin Cui

Read more

Faithful Logical Reasoning via Symbolic Chain-of-Thought (AI summary)

Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu

Read more

Scalable MatMul-free Language Modeling (AI summary)

Rui-Jie Zhu, Yu Zhang, Ethan Sifferman, Tyler Sheaves, Yiqiao Wang, Dustin Richmond, Peng Zhou, Jason K. Eshraghian

Read more

SimPO: Simple Preference Optimization with a Reference-Free Reward (AI summary)

Yu Meng, Mengzhou Xia, Danqi Chen

Read more

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality (AI summary)

Tri Dao, Albert Gu

Read more

Are Long-LLMs A Necessity For Long-Context Tasks? (AI summary)

Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Yujia Zhou, Xu Chen, Zhicheng Dou

Read more

Aya 23: Open Weight Releases to Further Multilingual Progress (AI summary)

Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Jon Ander Campos, Yi Chern Tan, Kelly Marchisio, Max Bartolo, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Aidan Gomez, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

Read more

Attention as an RNN (AI summary)

Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Mohamed Osama Ahmed, Yoshua Bengio, Greg Mori

Read more
×