{"id":999750,"date":"2024-01-30T05:22:13","date_gmt":"2024-01-30T13:22:13","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&p=999750"},"modified":"2024-06-10T09:57:52","modified_gmt":"2024-06-10T16:57:52","slug":"kahani-visual-storytelling-through-culturally-nuanced-images","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/kahani-visual-storytelling-through-culturally-nuanced-images\/","title":{"rendered":"Kahani: Visual Storytelling through Culturally Nuanced Images"},"content":{"rendered":"\n

Presented by Sameer Segal<\/a> at Microsoft Research Forum, January 2024<\/strong><\/em><\/p>\n\n\n\n

\"Sameer<\/figure>
\n
\n

\u201c[Project Kahani is] trying to bring not only visually stunning images but also bring in cultural nuances to it. Past work has shown that diffusion models tend to stereotype and fail to understand local words, but they don\u2019t provide ways to overcome these shortcomings without modifying the model or using fine-tuning.\u201d<\/p>\n\u2013<\/em> Sameer Segal, Principal Research Software Development Engineer<\/cite><\/blockquote>\n<\/div><\/div>\n\n\n\n

\n