关于
I am a Research Fellow in Microsoft Research India, working on Retrieval models.
Previously, I completed my MS (by Research) from Indian Institute of Technology, Madras where my thesis was based on Multilingual Neural Machine Translation.
精选内容
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the…