Prompt Engineer

All authors

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models (AI summary)

Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan

• February 8th, 2024

Read more

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling (AI summary)

Pratyush Maini, Skyler Seto, He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly

• February 7th, 2024

Read more

Large Language Models on Graphs: A Comprehensive Survey (AI summary)

Bowen Jin, Gang Liu, Chi Han, Meng Jiang, Heng Ji, Jiawei Han

• February 7th, 2024

Read more

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal

• February 7th, 2024

Read more

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization (AI summary)

Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami

• February 6th, 2024

Read more

Magicoder: Source Code Is All You Need (AI summary)

Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang

• February 6th, 2024

Read more

YaRN: Efficient Context Window Extension of Large Language Models (AI summary)

Bowen Peng, Jeffrey Quesnelle, Honglu Fan, Enrico Shippole

• February 5th, 2024

Read more

Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation (AI summary)

Shaopeng Zhai, Jie Wang, Tianyi Zhang, Fuxian Huang, Qi Zhang, Ming Zhou, Jing Hou, Yu Liu

• February 5th, 2024

Read more

AppAgent: Multimodal Agents as Smartphone Users (AI summary

Chi Zhang, Zhao Yang, Jiaxuan Liu, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu

• February 3rd, 2024

Read more

ChipNeMo: Domain-Adapted LLMs for Chip Design (AI summary)

Mingjie Liu, Teodor-Dumitru Ene, Robert Kirby, Chris Cheng, Nathaniel Pinckney, Rongjian Liang, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain, Brucek Khailany, George Kokai, Kishor Kunal, Xiaowei Li, Charley Lind, Hao Liu, Stuart Oberman, Sujeet Omar, Sreedhar Pratty, Jonathan Raiman, Ambar Sarkar, Zhengjiang Shao, Hanfei Sun, Pratik P Suthar, Varun Tej, Walker Turner, Kaizhe Xu, Haoxing Ren

• February 2nd, 2024

Read more

47/55

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55

ML and AI papers

Prompt Engineer

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models (AI summary)

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling (AI summary)

Large Language Models on Graphs: A Comprehensive Survey (AI summary)

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization (AI summary)

Magicoder: Source Code Is All You Need (AI summary)

YaRN: Efficient Context Window Extension of Large Language Models (AI summary)

Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation (AI summary)

AppAgent: Multimodal Agents as Smartphone Users (AI summary

ChipNeMo: Domain-Adapted LLMs for Chip Design (AI summary)