Machine Learning & AI | NYC

新闻与深度文章

photo of Ida Momennejad for the AI Frontiers Microsoft Research Podcast series

微软研究院播客

AI Frontiers: Rethinking intelligence with Ashley Llorens and Ida Momennejad

2024年3月28日 | Ashley Llorens 和 Ida Momennejad

Principal Researcher Ida Momennejad brings her expertise in cognitive neuroscience and computer science to this in-depth conversation about general intelligence and what the evolution of the brain across species can teach us about building AI.

Microsoft Research Podcast - Abstracts hero with a microphone icon

微软研究院播客

Abstracts: January 25, 2024

2024年1月25日 | Gretchen Huizinga, Jordan Ash, 和 Dipendra Misra

On “Abstracts,” Jordan Ash & Dipendra Misra discuss the parameter reduction method LASER. Tune in to learn how selective removal of stored data alone can boost LLM performance, then sign up for Microsoft Research Forum for more on LASER &…

微软研究院博客

NeurIPS 2023 highlights breadth of Microsoft’s machine learning innovation

2023年12月11日

We’re proud to have 100+ accepted papers At NeurIPS 2023, plus 18 workshops. Several submissions were chosen as oral presentations and spotlight posters, reflecting groundbreaking concepts, methods, or applications. Here’s an overview of those submissions.

MSR Podcast - AI Frontiers with Hanna Wallach

微软研究院播客

AI Frontiers: Measuring and mitigating harms with Hanna Wallach

2023年9月28日 | Hanna Wallach 和 Ashley Llorens

Powerful large-scale AI models like GPT-4 are showing dramatic improvements in reasoning, problem-solving, and language capabilities. This marks a phase change for artificial intelligence—and a signal of accelerating progress to come.   In this Microsoft Research Podcast series, AI scientist and…

A diagram in which five newspaper icons are lined up in the middle, the first of which is labeled a. An arrow points from the newspaper to an icon of a person above it. The person is labeled x and has a mouse click icon next to it and a thought bubble with the words “I like this!” that’s labeled r. An arrow points from the mouse click icon to a box labeled “recommender system” under the newspapers.

微软研究院博客

Inferring rewards through interaction

2023年5月4日 | Jessica Maghakian, Akanksha Saran, Cheng Tan, 和 Paul Mineiro

In reinforcement learning, handcrafting reward functions is difficult and can yield algorithms that don’t generalize well. IGL-P, an interaction-grounded learning strategy, learns personalized rewards for different people in recommender system scenarios.

新闻报道 | Machine Learning (Theory)

HOMER: Provable Exploration in Reinforcement Learning

2020年7月21日

Last week at ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra had a paper on a new reinforcement learning (RL) algorithm that solves three key problems in RL: (i) global exploration, (ii) decoding latent dynamics, and (iii) optimizing a given…

新闻报道 | Medium | Machine Learning

HOMER: Provable Exploration in Reinforcement Learning

2020年7月14日

At ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and Dipendra Misra published a paper presenting a new reinforcement learning (RL) algorithm called HOMER that addresses three main problems in real-world RL problem: (i) exploration, (ii) decoding latent dynamics, and (iii) optimizing…

微软研究院播客

Provably efficient reinforcement learning with Dr. Akshay Krishnamurthy

2020年6月3日

MSR’s New York City lab is home to some of the best reinforcement learning research on the planet but if you ask any of the researchers, they’ll tell you they’re very interested in getting it out of the lab and…

微软研究院博客

Provably efficient reinforcement learning with rich observations

2019年6月3日 | Akshay Krishnamurthy

Reinforcement learning, a machine learning paradigm for sequential decision making, has stormed into the limelight, receiving tremendous attention from both researchers and practitioners. When combined with deep learning, reinforcement learning (RL) has produced impressive empirical results, but the successes to…