YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information (AI summary)

Chien-Yao Wang, I-Hau Yeh, Hong-Yuan Mark Liao

Read more

Neural Network Diffusion (AI summary)

Kai Wang, Zhaopan Xu, Yukun Zhou, Zelin Zang, Trevor Darrell, Zhuang Liu, Yang You

Read more

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling (AI summary)

Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yugang Jiang, Xipeng Qiu

Read more

Reformatted Alignment (AI summary)

Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu

Read more

Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs (AI summary)

Nicolas Boizard, Kevin El Haddad, Céline Hudelot, Pierre Colombo

Read more

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning (AI summary)

Zhiyang Xu, Chao Feng, Rulin Shao, Trevor Ashby, Ying Shen, Di Jin, Yu Cheng, Qifan Wang, Lifu Huang

Read more

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration (AI summary)

Jun Zhao, Can Zu, Hao Xu, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang

Read more

OneBit: Towards Extremely Low-bit Large Language Models (AI summary)

Yuzhuang Xu, Xu Han, Zonghan Yang, Shuo Wang, Qingfu Zhu, Zhiyuan Liu, Weidong Liu, Wanxiang Che

Read more

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models (AI summary)

Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad Abdul-Mageed

Read more

Recovering the Pre-Fine-Tuning Weights of Generative Models (AI summary)

Eliahu Horwitz, Jonathan Kahana, Yedid Hoshen

Read more
×