Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (AI summary)

Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

Read more

A Comprehensive Survey of Compression Algorithms for Language Models (AI summary)

Seungcheol Park, Jaehyeon Choi, Sojin Lee, U Kang

Read more

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (AI summary)

Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Ré

Read more

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module (AI summary)

Simian Luo, Yiqin Tan, Suraj Patil, Daniel Gu, Patrick von Platen, Apolinário Passos, Longbo Huang, Jian Li, Hang Zhao

Read more

Large Language Models for Mathematical Reasoning: Progresses and Challenges (AI summary)

Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin

Read more

Internet-augmented language models through few-shot prompting for open-domain question answering (AI summary)

Angeliki Lazaridou, Elena Gribovskaya, Wojciech Stokowiec, Nikolai Grigorev

Read more

Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models (AI summary)

Wenhao Yu, Hongming Zhang, Xiaoman Pan, Kaixin Ma, Hongwei Wang, Dong Yu

Read more

Corrective Retrieval Augmented Generation (AI summary)

Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Read more

Proximal Policy Optimization Algorithms (AI summary)

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov

Read more

LLaMA: Open and Efficient Foundation Language Models (AI summary)

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, Aurelien Rodriguez, Armand Joulin, Edouard Grave, Guillaume Lample

Read more
×