{"id":989523,"date":"2023-12-11T07:00:00","date_gmt":"2023-12-11T15:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=989523"},"modified":"2024-01-04T07:08:27","modified_gmt":"2024-01-04T15:08:27","slug":"neurips-2023-highlights-breadth-of-microsofts-machine-learning-innovation","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/neurips-2023-highlights-breadth-of-microsofts-machine-learning-innovation\/","title":{"rendered":"NeurIPS 2023 highlights breadth of Microsoft’s machine learning innovation"},"content":{"rendered":"\n
\"Research<\/figure>\n\n\n\n

Microsoft is proud to sponsor the 37th Conference on Neural Information Processing Systems<\/a> (NeurIPS 2023). This interdisciplinary forum brings together experts in machine learning, neuroscience, statistics, optimization, computer vision, natural language processing, life sciences, natural sciences, social sciences, and other adjacent fields. We are pleased to share that Microsoft has over 100 accepted papers and is offering 18 workshops at NeurIPS 2023.\u00a0<\/p>\n\n\n\n

This year\u2019s conference includes three papers from Microsoft that were chosen for oral presentations, which feature groundbreaking concepts, methods, or applications, addressing pressing issues in the field. Additionally, our spotlights posters, also highlighted below, have been carefully curated by conference organizers, exhibiting novelty, technical rigor, and the potential to significantly impact the landscape of machine learning. This blog post celebrates those achievements.<\/p>\n\n\n\n

Oral Presentations<\/h2>\n\n\n\n

Bridging Discrete and Backpropagation: Straight-Through and Beyond<\/a><\/h3>\n\n\n\n

Gradient computations are pivotal in deep learning’s success, yet they predominantly depend on backpropagation, a technique limited to continuous variables. The paper Bridging Discrete and Backpropagation: Straight-Through and Beyond<\/a>, tackles this limitation. It introduces ReinMax, extending backpropagation’s capability to estimate gradients for models incorporating discrete variable sampling. Within extensive experiments of this study, ReinMax demonstrates consistent and significant performance gain over the state of the art. More than just a practical solution, the paper sheds light on existing deep learning practices. It elucidates that the ‘Straight-Through’ method, once considered merely a heuristic trick, is actually a viable first-order approximation for the general multinomial case. Correspondingly, ReinMax achieves second-order accuracy in this context without the complexities of second-order derivatives, thus having negligible computation overheads. <\/p>\n\n\n\n\t

\n\t\t\n\n\t\t

\n\t\tSpotlight: Microsoft research newsletter<\/span>\n\t<\/p>\n\t\n\t

\n\t\t\t\t\t\t
\n\t\t\t\t\n\t\t\t\t\t\"\"\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t
\n\n\t\t\t\t\t\t\t\t\t

Microsoft Research Newsletter<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

Stay connected to the research community at Microsoft.<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

\n\t\t\t\t\t
\n\t\t\t\t\t\t\n\t\t\t\t\t\t\tSubscribe today\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div>\n\t<\/div>\n\t<\/div>\n\t\n\n\n

The MineRL BASALT Competition on Learning from Human Feedback<\/a><\/h3>\n\n\n\n

The growth of deep learning research, including its incorporation into commercial products, has created a new challenge: How can we build AI systems that solve tasks when a crisp, well-defined specification is lacking? To encourage research on this important class of techniques, researchers from Microsoft led The MineRL BASALT Competition on Learning from Human Feedback (opens in new tab)<\/span><\/a>, an update to a contest first launched in 2021 (opens in new tab)<\/span><\/a> by researchers at the University of California-Berkeley and elsewhere. The challenge of this competition was to complete fuzzy tasks from English language descriptions alone, with emphasis on encouraging different ways of learning from human feedback as an alternative to a traditional reward signal. <\/p>\n\n\n\n

The researchers designed a suite of four tasks in Minecraft for which writing hardcoded reward functions would be difficult. These tasks are defined by natural language: for example, “create a waterfall and take a scenic picture of it”, with additional clarifying details. Participants must train a separate agent for each task. Agents are then evaluated by humans who have read the task description.<\/p>\n\n\n\n

The competition aimed to encourage development of AI systems that do what their designers intended, even when the intent cannot be easily formalized. Besides allowing AI to solve more tasks, this can also enable more effective regulation of AI systems, as well as making progress on value alignment problems, in which the specified objectives of an AI agent differ from those of its users.<\/p>\n\n\n\n

Related<\/h4>\n\n\n\n
\n\t