Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities (AI summary)

Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue

Read more

Mathematical Language Models: A Survey (AI summary)

Wentao Liu, Hanglei Hu, Jie Zhou, Yuyang Ding, Junsong Li, Jiayi Zeng, Mengliang He, Qin Chen, Bo Jiang, Aimin Zhou, Liang He

Read more

Vision Transformers Need Registers (AI summary)

Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski

Read more

A Survey of Reasoning with Foundation Models (AI summary)

Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Read more

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty (AI summary)

Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang

Read more

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 (AI summary)

Hamish Ivison, Yizhong Wang, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Read more

Textbooks Are All You Need (AI summary)

Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sébastien Bubeck, Ronen Eldan, Adam Tauman Kalai, Yin Tat Lee, Yuanzhi Li

Read more

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models (AI summary)

Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan

Read more

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling (AI summary)

Pratyush Maini, Skyler Seto, He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly

Read more

Large Language Models on Graphs: A Comprehensive Survey (AI summary)

Bowen Jin, Gang Liu, Chi Han, Meng Jiang, Heng Ji, Jiawei Han

Read more
×