MambaByte: Token-free Selective State Space Model (AI summary)

Junxiong Wang, Tushaar Gangavarapu, Jing Nathan Yan, Alexander M Rush

Read more

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All (AI summary)

Mehmet Saygin Seyfioglu, Karim Bouyarmane, Suren Kumar, Amir Tavanaei, Ismail B. Tutar

Read more

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models (AI summary)

Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Mingchuan Zhang, Y.K. Li, Y. Wu, Daya Guo

Read more

WARM: On the Benefits of Weight Averaged Reward Models (AI summary)

Alexandre Ramé, Nino Vieillard, Léonard Hussenot, Robert Dadashi, Geoffrey Cideron, Olivier Bachem, Johan Ferret

Read more

A Survey of Resource-efficient LLM and Multimodal Foundation Models (AI summary)

Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Read more

Self-Discover: Large Language Models Self-Compose Reasoning Structures (AI summary)

Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng

Read more

Red Teaming Visual Language Models (AI summary)

Mukai Li, Lei Li, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu

Read more

Lumiere: A Space-Time Diffusion Model for Video Generation (AI summary)

Omer Bar-Tal, Hila Chefer, Omer Tov, Charles Herrmann, Roni Paiss, Shiran Zada, Ariel Ephrat, Junhwa Hur, Yuanzhen Li, Tomer Michaeli, Oliver Wang, Deqing Sun, Tali Dekel, Inbar Mosseri

Read more

More Agents Is All You Need (AI summary)

Junyou Li, Qin Zhang, Yangbin Yu, Qiang Fu, Deheng Ye

Read more

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads (AI summary)

Tianle Cai, Yuhong Li, Zhengyang Geng, Hongwu Peng, Jason D. Lee, Deming Chen, Tri Dao

Read more
×