Besmira Nushi

Principal Research Manager

A summary of insights extracted by using the Eureka framework, shown via two radar charts for multimodal (left) and language (right) capabilities respectively. The radar charts show the best and worst performance observed for each capability.

Microsoft Research Blog

Eureka: Evaluating and understanding progress in AI

September 17, 2024 | Vidhisha Balachandran, Jingya Chen, Neel Joshi, Besmira Nushi, Hamid Palangi, Eduardo Salinas, Vibhav Vineet, James Woffinden-Luey, and Safoora Yousefi

How can we rigorously evaluate and understand state-of-the-art progress in AI? Eureka is an open-source framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. Learn more about the extended findings.

"Microsoft at AIES 2023: Social Biases through the Text-to-Image Generation Lens" title to the left of the front page of said publication on a red, abstract background.

Microsoft Research Blog

Understanding social biases through the text-to-image generation lens

September 8, 2023 | Ranjita Naik and Besmira Nushi

Gender, race, and age disparities in AI-generated images persist. This AIES 2023 study on text-to-image models shows that even basic prompts can lead to underrepresentation, calling for responsible bias mitigation strategies.

Microsoft Research Blog

Creating better AI partners: A case for backward compatibility

January 25, 2019 | Besmira Nushi and Ece Kamar

Artificial intelligence technologies hold great promise as partners in the real world. They’re in the early stages of helping doctors administer care to their patients and lenders determine the risk associated with loan applications, among other examples. But what happens…

In the news | Microsoft Research Blog

Creating better AI partners: A case for backward compatibility

January 25, 2019

Traditional metrics on performance of the AI component are not sufficient when the AI technology is used by people to accomplish tasks.

Besmira Nushi

News & features

Eureka: Evaluating and understanding progress in AI

Understanding social biases through the text-to-image generation lens

Creating better AI partners: A case for backward compatibility

Creating better AI partners: A case for backward compatibility

Contact Besmira Nushi

AI Frontiers

Microsoft Research Lab – Redmond