ICLR conference banner - abstract shapes

May 7, 2024 – May 11, 2024

Microsoft at ICLR 2024

Central European Time (UTC +1)

Lieu: Vienna, Austria

Microsoft Research Blog

Blog de recherche Microsoft

LLM profiling guides KV cache optimization

mai 8, 2024 | Liyuan Liu et Jianfeng Gao

LLMs rely on memory-intensive mechanisms like the key-value (KV) cache to store and quickly retrieve data. FastGen optimizes KV cache usage, reducing LLM memory demands by up to 50% while maintaining performance.

Blog de recherche Microsoft

LoftQ: Reimagining LLM fine-tuning with smarter initialization

mai 7, 2024 | Nikos Karampatziakis, Chen Liang, Weizhu Chen, Yixiao Li, Yifan Yu, et Tuo Zhao

LoftQ boosts LLM efficiency by streamlining the fine-tuning process, reducing computational demands while preserving high performance. Innovations like this can help make AI technology more energy-efficient.

Stylized microphone and sound waves illustration.

Podcast de recherche Microsoft

Abstracts: May 6, 2024

mai 6, 2024 | Michel Galley et Gretchen Huizinga

Researcher Michel Galley explores how he and fellow researchers combined new and existing data to create MathVista, an open-source benchmark for measuring the mathematical reasoning capabilities of foundation models in scenarios that involve text and images.