May 4, 2021 - May 7, 2021

Microsoft at ICLR 2021

Location: Virtual

Microsoft Research blog

Microsoft Research Blog

Microsoft and NVIDIA introduce parameter-efficient multimodal transformers for video representation learning

May 17, 2021 | Yale Song

Understanding video is one of the most challenging problems in AI, and an important underlying requirement is learning multimodal representations that capture information about objects, actions, sounds, and their long-range statistical dependencies from audio-visual signals. Recently, transformers have been successful in…

An example of a multi-turn text-to-SQL task. The user query “Find the names of the top 3 highest sales books” corresponds to the formal program “SELECT title FROM book ORDER BY sale_amount DESC LIMIT 3”. The follow-up user query, “Who are their authors,” corresponds to the formal program “SELECT t1.title, t1.name FROM author AS t1 JOIN book AS t2 ON t1.id = t2.author_id ORDER BY t2.sale_amount DESC LIMIT 3”. In the corresponding database, there is an “Author” table with an “id” column, a “name” column, a “country” column, and an ellipsis signifying additional columns; a “Press” table with an “id” column, a “name” column, an “address” column, and an ellipsis signifying additional columns; and a “Book” table with an “id” column, a “title” column, an “author id” column, a “sale_amount” column, and an ellipsis signifying additional columns.

Microsoft Research Blog

Conversations with data: Advancing the state of the art in language-driven data exploration

May 3, 2021 | Alex Polozov, Chris Meek, and Ahmed Awadallah

One key aspiration of AI is to develop natural and effective task-oriented conversational systems. Task-oriented conversational systems use a natural language interface to collaborate with and support people in accomplishing specific goals and activities. They go beyond chitchat conversation. For…

Microsoft Research Blog

Factorized layers revisited: Compressing deep networks without playing the lottery

March 24, 2021 | Misha Khodak, Neil Tenenholtz, Lester Mackey, and Nicolo Fusi

From BiT (928 million parameters) to GPT-3 (175 billion parameters), state-of-the-art machine learning models are rapidly growing in size. With the greater expressivity and easier trainability of these models come skyrocketing training costs, deployment difficulties, and even climate impact. As…