{"version":"1.0","provider_name":"Microsoft Research","provider_url":"https:\/\/www.microsoft.com\/en-us\/research","author_name":"Zhe Gan","author_url":"https:\/\/www.microsoft.com\/en-us\/research\/people\/zhgan\/","title":"SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning - Microsoft Research","type":"rich","width":600,"height":338,"html":"
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning<\/a><\/blockquote>