Publication TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Ronen Eldan, Yuanzhi Li May 2023 Project
Publication Automatic Prompt Optimization with “Gradient Descent” and Beam Search Reid Pryzant, Dan Iter, Jerry Li, Yin Tat Lee, Chenguang Zhu, Michael Zeng May 2023
Publication Logical Transformers: Infusing Logical Structures into Pre-Trained Language Models Borui Wang, Qiuyuan Huang, Budhaditya Deb, Aaron L. Halfaker, Liqun Shao, Daniel McDuff, Ahmed Awadallah, Dragomir Radev, Jianfeng Gao Proceedings of ACL 2023 | May 2023 Project Project
Publication AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers Ganesh Jawahar, Subhabrata (Subho) Mukherjee, Xiaodong Liu, Young Jin Kim, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Ahmed Awadallah, Sébastien Bubeck, Jianfeng Gao ACL | May 2023 Project
Publication Relational Attention: Generalizing Transformers for Graph-Structured Tasks Cameron Diao, Ricky Loynd ICLR 2023 | May 2023 Spotlight
Publication Benchmarking Spatial Relationships in Text-to-Image Generation Tejas Gokhale, Hamid Palangi, Besmira Nushi, Vibhav Vineet, Eric Horvitz, Ece Kamar, Chitta Baral, Yezhou Yang MSR-TR-2023-44 | May 2023 Published by Microsoft Github
Publication Derivative Based Nonbacktracking Real-World Regex Matching with Backtracking Semantics Dan Moseley, Mario Nishio, Jose Perez Rodriguez, Olli Saarikivi, Stephen Toub, Margus Veanes, Tiki Wan, Eric Xu MSR-TR-2023-15 | April 2023 Published by Microsoft Extended version of paper that appears in PLDI 2023. Project
Publication A Large-scale Robustness Analysis of Video Action Recognition Models Madeline Chantry Schiappa, Naman Biyani, Prudvi Kamtam, Shruti Vyas, Hamid Palangi, Vibhav Vineet, Yogesh Rawat April 2023
Publication What do Compressed Large Language Models Forget? Robustness Challenges in Model Compression Mengnan Du, Subhabrata (Subho) Mukherjee, Yu Cheng, Milad Shokouhi, Xia Hu, Ahmed Awadallah EACL | April 2023 Project Project Project
Publication Sparks of Artificial General Intelligence: Early experiments with GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang March 2023 Video Project