LLaVA: Large Language and Vision Assistant: People - Microsoft Research

Building Next-Gen Multimodal Foundation Models for General-Purpose Assistants

LLaVA is an open-source project, collaborating with research community to advance the state-of-the-art in AI. LLaVA represents the first end-to-end trained large multimodal model (LMM) that achieves impressive chat capabilities mimicking spirits of the multimodal GPT-4. The LLaVA family continues growing to support more modalities, capabilities, applications and beyond.

People

Open research collaboration across universities in the research community and multiple Microsoft team, pushing the SoTA in new capabilities scale and applications etc.

Hao Cheng

Principal Researcher

Learn more

Michel Galley

Senior Principal Researcher

Learn more

Jianfeng Gao

Distinguished Scientist & Vice President

Learn more

Portrait of Yong Jae Lee

Yong Jae Lee

Associate Professor

University of Wisconsin-Madison

Learn more

Lars Liden

Principal Research Software Engineer Manager

Learn more

Portrait of Haotian Liu

Haotian Liu

Ph.D. student

University of Wisconsin-Madison

Learn more

Xiaodong Liu

Senior Principal Researcher

Learn more

Yadong Lu

Researcher

Microsoft Azure AI

Matt Mazzola

Senior Research Software Engineer

Learn more

Tristan Naumann

Principal Researcher

Learn more

Hoifung Poon

General Manager, Health Futures

Learn more

Yelong Shen

Principal Researcher

Microsoft Azure AI

Swadheen Shukla

Principal Program Manager

Learn more

Irina Spiridonova

Senior Software Engineer

Learn more

Andrea Tupini

Research Software Engineer

Learn more

Naoto Usuyama

Principal Researcher

Learn more

Cliff Wong

Principal Data Scientist

Learn more

Jianwei Yang

Principal Researcher

Learn more

Sheng Zhang

Principal Researcher

Learn more