SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection (AI summary)

Ke Ye, Heinrich Jiang, Afshin Rostamizadeh, Ayan Chakrabarti, Giulia DeSalvo, Jean-François Kagy, Lazaros Karydas, Gui Citovsky, Sanjiv Kumar

Read more

Fast Inference of Mixture-of-Experts Language Models with Offloading (AI summary)

Artyom Eliseev, Denis Mazur

Read more

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator (AI summary)

Chengshu Li, Jacky Liang, Andy Zeng, Xinyun Chen, Karol Hausman, Dorsa Sadigh, Sergey Levine, Li Fei-Fei, Fei Xia, Brian Ichter

Read more

Rethinking Patch Dependence for Masked Autoencoders (AI summary)

Letian Fu, Long Lian, Renhao Wang, Baifeng Shi, Xudong Wang, Adam Yala, Trevor Darrell, Alexei A. Efros, Ken Goldberg

Read more

MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models (AI summary)

Kailai Yang, Tianlin Zhang, Ziyan Kuang, Qianqian Xie, Sophia Ananiadou, Jimin Huang

Read more

LLM Augmented LLMs: Expanding Capabilities through Composition (AI summary)

Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

Read more

Self-Alignment with Instruction Backtranslation (AI summary)

Xian Li, Ping Yu, Chunting Zhou, Timo Schick, Luke Zettlemoyer, Omer Levy, Jason Weston, Mike Lewis

Read more

pix2gestalt: Amodal Segmentation by Synthesizing Wholes (AI summary)

Ege Ozguroglu, Ruoshi Liu, Dídac Surís, Dian Chen, Achal Dave, Pavel Tokmakov, Carl Vondrick

Read more

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey (AI summary)

Yunpeng Huang, Jingwei Xu, Zixu Jiang, Junyu Lai, Zenan Li, Yuan Yao, Taolue Chen, Lijuan Yang, Zhou Xin, Xiaoxing Ma

Read more

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? (AI summary)

Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein

Read more
×