Emergent Abilities of Large Language Models (AI summary)

Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus

Read more

Scaling Laws for Neural Language Models (AI summary)

Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei

Read more

A Survey on Hallucination in Large Vision-Language Models (AI summary)

Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

Read more

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models (AI summary)

Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Read more

Direct Preference Optimization: Your Language Model is Secretly a Reward Model (AI summary)

Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn

Read more

The Power of Noise: Redefining Retrieval for RAG Systems (AI summary)

Florin Cuconasu, Giovanni Trappolini, Federico Siciliano, Simone Filice, Cesare Campagnano, Yoelle Maarek, Nicola Tonellotto, Fabrizio Silvestri

Read more

QLoRA: Efficient Finetuning of Quantized LLMs (AI summary)

Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer

Read more

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (AI summary)

Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

Read more

A Comprehensive Survey of Compression Algorithms for Language Models (AI summary)

Seungcheol Park, Jaehyeon Choi, Sojin Lee, U Kang

Read more

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (AI summary)

Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Ré

Read more
×