A Comprehensive Study of Knowledge Editing for Large Language Models (AI summary)

Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, Siyuan Cheng, Ziwen Xu, Xin Xu, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Lei Liang, Zhiqiang Zhang, Xiaowei Zhu, Jun Zhou, Huajun Chen

Read more

The Unreasonable Effectiveness of Easy Training Data for Hard Tasks (AI summary)

Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe

Read more

On Layer Normalization in the Transformer Architecture (AI summary)

Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, Tie-Yan Liu

Read more

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning (AI summary)

Hongye Jin, Xiaotian Han, Jingfeng Yang, Zhimeng Jiang, Zirui Liu, Chia-Yuan Chang, Huiyuan Chen, Xia Hu

Read more

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference (AI summary)

Simian Luo, Yiqin Tan, Longbo Huang, Jian Li, Hang Zhao

Read more

Layer Normalization (AI summary)

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

Read more

LLaMA Beyond English: An Empirical Study on Language Capability Transfer (AI summary)

Jun Zhao, Zhihao Zhang, Luhui Gao, Qi Zhang, Tao Gui, Xuanjing Huang

Read more

Improving language models by retrieving from trillions of tokens (AI summary)

Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen, Laurent Sifre

Read more

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models (AI summary)

Asma Ghandeharioun, Avi Caciularu, Adam Pearce, Lucas Dixon, Mor Geva

Read more

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity (AI summary)

Andrew Lee, Xiaoyan Bai, Itamar Pres, Martin Wattenberg, Jonathan K. Kummerfeld, Rada Mihalcea

Read more
×