Small Language Models: Survey, Measurements, and Insights (AI summary)

Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi, Fangming Liu, Xiwen Zhang, Nicholas D. Lane, Mengwei Xu

Read more

Achieving Peak Performance for Large Language Models: A Systematic Review (AI summary)

Zhyar Rzgar K Rostam, Sándor Szénási, Gábor Kertész

Read more

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? (AI summary)

Yunfei Xie, Juncheng Wu, Haoqin Tu, Siwei Yang, Bingchen Zhao, Yongshuo Zong, Qiao Jin, Cihang Xie, Yuyin Zhou

Read more

Theory, Analysis, and Best Practices for Sigmoid Self-Attention (AI summary)

Jason Ramapuram, Federico Danieli, Eeshan Dhekane, Floris Weers, Dan Busbridge, Pierre Ablin, Tatiana Likhomanenko, Jagrit Digani, Zijin Gu, Amitis Shidani, Russ Webb

Read more

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely (AI summary)

Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, Lili Qiu

Read more

Can Large Language Models Unlock Novel Scientific Research Ideas? (AI summary)

Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, Asif Ekbal

Read more

Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models (AI summary)

Tongxuan Liu, Wenjiang Xu, Weizhe Huang, Xingyu Wang, Jiaxing Wang, Hailong Yang, Jing Li

Read more

LLaMA-Omni: Seamless Speech Interaction with Large Language Models (AI summary)

Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng

Read more

LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench

Karthik Valmeekam, Kaya Stechly, Subbarao Kambhampati

Read more

What is the Role of Small Models in the LLM Era: A Survey (AI summary)

Lihu Chen, Gaël Varoquaux

Read more
×