ChatQA: Building GPT-4 Level Conversational QA Models (AI summary)

Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro

Read more

A phase transition between positional and semantic learning in a solvable model of dot-product attention (AI summary)

Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborová

Read more

Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering (AI summary)

Tal Ridnik, Dedy Kredo, Itamar Friedman

Read more

Self-Rewarding Language Models( AI summary)

Weizhe Yuan, Richard Yuanzhe Pang, Kyunghyun Cho, Sainbayar Sukhbaatar, Jing Xu, Jason Weston

Read more

AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls (AI summary)

Yu Du, Fangyun Wei, Hongyang Zhang

Read more

Consistency Models (AI summary)

Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

Read more

Language Models are Few-Shot Learners (AI summary)

Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei

Read more

Grandmaster-Level Chess Without Search (AI summary)

Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Tim Genewein

Read more

Leveraging Large Language Models for NLG Evaluation: A Survey (AI summary)

Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Chongyang Tao

Read more

SliceGPT: Compress Large Language Models by Deleting Rows and Columns (AI summary)

Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

Read more
×