Controllable Text Generation for Large Language Models: A Survey (AI summary)

Xun Liang, Hanyu Wang, Yezhaohui Wang, Shichao Song, Jiawei Yang, Simin Niu, Jie Hu, Dan Liu, Shunyu Yao, Feiyu Xiong, Zhiyu Li

Read more

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding (AI summary)

Jian Chen, Vashisth Tiwari, Ranajoy Sadhukhan, Zhuoming Chen, Jinyuan Shi, Ian En-Hsu Yen, Beidi Chen

Read more

Graph Retrieval-Augmented Generation: A Survey (AI summary)

Boci Peng, Yun Zhu, Yongchao Liu, Xiaohe Bo, Haizhou Shi, Chuntao Hong, Yan Zhang, Siliang Tang

Read more

Enhancing Robustness in Large Language Models: Prompting for Mitigating the Impact of Irrelevant Information (AI summary)

Ming Jiang, Tingting Huang, Biao Guo, Yao Lu, Feng Zhang

Read more

The Vizier Gaussian Process Bandit Algorithm (AI summary)

Xingyou Song, Qiuyi Zhang, Chansoo Lee, Emily Fertig, Tzu-Kuo Huang, Lior Belenki, Greg Kochanski, Setareh Ariafar, Srinivas Vasudevan, Sagi Perel, Daniel Golovin

Read more

LLM Pruning and Distillation in Practice: The Minitron Approach (AI summary)

Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov

Read more

Automated Design of Agentic Systems (AI summary)

Shengran Hu, Cong Lu, Jeff Clune

Read more

Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation (AI summary)

Junde Wu, Jiayuan Zhu, Yunli Qi

Read more

A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? (AI summary)

Xinyu Liu, Shuyu Shen, Boyan Li, Peixian Ma, Runzhi Jiang, Yuyu Luo, Yuxin Zhang, Ju Fan, Guoliang Li, Nan Tang

Read more

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers (AI summary)

Zhenting Qi, Mingyuan Ma, Jiahang Xu, Li Lyna Zhang, Fan Yang, Mao Yang

Read more
×