ODIN: Disentangled Reward Mitigates Hacking in RLHF (AI summary)

Lichang Chen, Chen Zhu, Davit Soselia, Jiuhai Chen, Tianyi Zhou, Tom Goldstein, Heng Huang, Mohammad Shoeybi, Bryan Catanzaro

Read more

Direct Language Model Alignment from Direct Language Model Alignment from Online AI Feedback (AI summary)

Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi Liu, Misha Khalman, Felipe Llinares, Alexandre Rame, Thomas Mesnard, Yao Zhao, Bilal Piot, Johan Ferret, Mathieu Blondel

Read more

Scaling Laws for Downstream Task Performance of Large Language Models (AI summary)

Berivan Isik, Natalia Ponomareva, Hussein Hazimeh, Dimitris Paparas, Sergei Vassilvitskii, Sanmi Koyejo

Read more

MOMENT: A Family of Open Time-series Foundation Models (AI summary)

Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski

Read more

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model (AI summary)

Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

Read more

Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models (AI summary)

Jianyuan Guo, Hanting Chen, Chengcheng Wang, Kai Han, Chang Xu, Yunhe Wang

Read more

LiPO: Listwise Preference Optimization through Learning-to-Rank (AI summary)

Tianqi Liu, Zhen Qin, Junru Wu, Jiaming Shen, Misha Khalman, Rishabh Joshi, Yao Zhao, Mohammad Saleh, Simon Baumgartner, Jialu Liu, Peter J. Liu, Xuanhui Wang

Read more

FindingEmo: An Image Dataset for Emotion Recognition in the Wild (AI summary)

Laurent Mertens, Elahe' Yargholi, Hans Op de Beeck, Jan Van den Stock, Joost Vennekens

Read more

Repeat After Me: Transformers are Better than State Space Models at Copying (AI summary)

Samy Jelassi, David Brandfonbrener, Sham M. Kakade, Eran Malach

Read more

Efficient Exploration for LLMs (AI summary)

Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Read more
×