Computer Vision

Career Opportunity

Simulation Engineer – Microsoft’s Cloud Operations & Innovation (CO+I)

Posted: November 4, 2024

Location: Redmond, WA, US

Research Area(s): Computer vision, Data platforms and analytics, Mathematics, Programming languages and software engineering

Microsoft’s Cloud Operations & Innovation (CO+I) is the engine that powers our cloud services. As a datacenter Simulation Expert, you will perform a key role in delivering the core infrastructure and foundational technologies for Microsoft’s…

Career Opportunity

Research Internships at Spatial AI Lab Zurich, Switzerland

Posted: November 4, 2024

Location: Zürich, Switzerland

Research Area(s): Artificial intelligence, Computer vision

We are seeking Research Interns in Computer Vision, Machine Learning and Robotics, broadly defined, for our Spatial AI Lab in Zurich. As an intern, you will collaborate with one or more mentors and use the…

Career Opportunity

Research Intern – IMAIS Group: Situated Intelligence and Multimodal Interaction in the Physical World

Posted: November 1, 2024

Location: Redmond, WA, US

Research Area(s): Artificial intelligence, Computer vision, Human-computer interaction, Social sciences

The Interactive Multimodal AI Systems (IMAIS) group at Microsoft Research seeks a Research Intern to work on a project related to Situated Intelligence. The Situated Intelligence research effort aims to enable computers to reason about the physical everyday world,…

Career Opportunity

Research Scientist – Spatial AI Lab

Posted: October 31, 2024

Location: Zürich, Switzerland

Research Area(s): Artificial intelligence, Computer vision

The Microsoft Spatial AI Lab in Zurich, Switzerland, is a research and development team building the future of spatial computing. We are looking for computer vision and machine learning scientists who share our passion for…

Career Opportunity

Scientist Action Recognition – Spatial AI Lab

Posted: October 31, 2024

Location: Zürich, Switzerland

Research Area(s): Artificial intelligence, Computer vision

The Microsoft Spatial AI Lab in Zurich, Switzerland, is a research and development team building the future of spatial computing. We are looking for computer vision and machine learning scientists who share our passion for…

Career Opportunity

Research Intern – Applied Sciences Group (Audio/Vision/NLP/Multimodal)

Posted: October 28, 2024

Location: Redmond, WA, US

Research Area(s): Audio and Acoustics, Computer vision, Human language technologies

The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen Windows and Surface products. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling…

Career Opportunity

Research Intern – Applied Sciences Group (Computer Agent)

Posted: October 28, 2024

Location: Redmond, WA, US

Research Area(s): Algorithms, Artificial intelligence, Computer vision, Human language technologies

The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen Windows and Surface products. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling…

Video

Hairmony: Fairness-aware hairstyle classification

October 17, 2024 | James Clemoes

We present a method for prediction of a person’s hairstyle from a single image. Despite growing use cases in user digitization and enrollment for virtual experiences, available methods are limited, particularly in the range of…

04:30

Video

Look Ma, no markers: holistic performance capture without the hassle

October 17, 2024 | Charlie Hewitt

We tackle the problem of highly-accurate, holistic performance capture for the face, body and hands simultaneously. Motion-capture technologies used in film and game production typically focus only on face, body or hand capture independently, involve…

03:25

Dataset Source Code

OmniParser

OmniParser is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that can be accurately grounded in the corresponding regions of…

GitHub Publication

Microsoft at CVPR 2024: Innovations in computer vision and AI research

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Exploring how context, culture, and character matter in avatar research

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Simulation Engineer – Microsoft’s Cloud Operations & Innovation (CO+I)

Research Internships at Spatial AI Lab Zurich, Switzerland

Research Intern – IMAIS Group: Situated Intelligence and Multimodal Interaction in the Physical World

Research Scientist – Spatial AI Lab

Scientist Action Recognition – Spatial AI Lab

Research Intern – Applied Sciences Group (Audio/Vision/NLP/Multimodal)

Research Intern – Applied Sciences Group (Computer Agent)

Hairmony: Fairness-aware hairstyle classification

Look Ma, no markers: holistic performance capture without the hassle

OmniParser

Computer Vision

Highlights