{"id":1160590,"date":"2026-01-20T21:38:57","date_gmt":"2026-01-21T05:38:57","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=1160590"},"modified":"2026-01-28T13:25:24","modified_gmt":"2026-01-28T21:25:24","slug":"ai-for-low-resource-languages","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/ai-for-low-resource-languages\/","title":{"rendered":"AI for Low-Resource Languages"},"content":{"rendered":"
\n\t
\n\t\t
\n\t\t\t\"AI\t\t<\/div>\n\t\t\n\t\t
\n\t\t\t\n\t\t\t
\n\t\t\t\t\n\t\t\t\t
\n\t\t\t\t\t\n\t\t\t\t\t
\n\t\t\t\t\t\t
\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n

Bridging the language gap<\/h1>\n\n\n\n

< AI for Good Lab <\/a><\/p>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n

\n

Giving every language a place in the digital world<\/h2>\n\n\n\n

Language is more than communication\u2014it sustains culture, identity, and opportunity<\/strong>. Yet most of the world\u2019s languages remain underrepresented in today\u2019s AI systems because they have limited digital content, few high-quality datasets, and little benchmark data to measure progress.<\/p>\n\n\n\n

Our work focuses on practical, partner-driven pathways to make modern AI usable and safer in low-resource settings\u2014combining data stewardship, evaluation benchmarks, translation tools, and adaptable training workflows that can be reused across languages and contexts.<\/p>\n<\/div>

\"diagram,<\/figure><\/div>\n\n\n\n
\"LINGUA<\/figure>
\n

Microsoft AI for Good Lab LINGUA awardees announced<\/h2>\n\n\n\n

The Microsoft AI for Good Lab has announced the awardees of LINGUA: Expanding Europe\u2019s Voices in AI<\/strong>, an open call supporting ethical, open dataset creation for European languages underrepresented in digital spaces and AI systems.<\/p>\n\n\n\n

The selected projects span 16 languages and dialects across 10 countries, representing a diverse mix of low\u2011resource, vulnerable, and underrepresented linguistic communities. Led by universities, nonprofits, a government language center and public broadcaster, the awardees are advancing multilingual AI by expanding access to speech and text data and strengthening Europe\u2019s linguistic diversity.<\/p>\n\n\n\n

\n
Program overview & awardees<\/a><\/div>\n<\/div>\n<\/div><\/div>\n\n\n\n
\n\n\n\n
\n
\n
\n