A Survey on Hallucination in Large Vision-Language Models (AI summary)

Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

Read more

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models (AI summary)

Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Read more

Direct Preference Optimization: Your Language Model is Secretly a Reward Model (AI summary)

Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn

Read more

The Power of Noise: Redefining Retrieval for RAG Systems (AI summary)

Florin Cuconasu, Giovanni Trappolini, Federico Siciliano, Simone Filice, Cesare Campagnano, Yoelle Maarek, Nicola Tonellotto, Fabrizio Silvestri

Read more

QLoRA: Efficient Finetuning of Quantized LLMs (AI summary)

Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer

Read more

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (AI summary)

Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

Read more

A Comprehensive Survey of Compression Algorithms for Language Models (AI summary)

Seungcheol Park, Jaehyeon Choi, Sojin Lee, U Kang

Read more

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (AI summary)

Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Ré

Read more

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module (AI summary)

Simian Luo, Yiqin Tan, Suraj Patil, Daniel Gu, Patrick von Platen, Apolinário Passos, Longbo Huang, Jian Li, Hang Zhao

Read more

Large Language Models for Mathematical Reasoning: Progresses and Challenges (AI summary)

Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin

Read more
×