{"id":1152001,"date":"2025-10-16T00:29:08","date_gmt":"2025-10-16T07:29:08","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-academic-program&#038;p=1152001"},"modified":"2026-01-25T19:04:58","modified_gmt":"2026-01-26T03:04:58","slug":"microsoft-research-asia-starleap-program","status":"publish","type":"msr-academic-program","link":"https:\/\/www.microsoft.com\/en-us\/research\/academic-program\/microsoft-research-asia-starleap-program\/","title":{"rendered":"Microsoft Research Asia \u2014 StarLeap Program"},"content":{"rendered":"\n\n<p><\/p>\n\n\n\n\n\n\n<p>The <strong>StarLeap Program<\/strong>, launched by Microsoft Research Asia (MSRA), is designed to provide exceptional students with the opportunity to collaborate with multiple research teams at MSRA and to address real-world, frontier research challenges. Since its establishment in January 2021, the program has received enthusiastic responses and widespread attention from students both in China and abroad.<\/p>\n\n\n\n<p>Participants in the StarLeap Program will conduct impactful research in an <strong>international, inclusive, and intellectually stimulating environment<\/strong>, under the mentorship of <strong>world-class researchers<\/strong> at MSRA.<\/p>\n\n\n\n<p><strong>Program Highlights<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Global and inclusive research culture:<\/strong> Through both online and offline collaboration, participants will gain an in-depth understanding of Microsoft\u2019s open, international research environment and its culture of diversity and inclusion.<\/li>\n\n\n\n<li><strong>Collaborative mentorship across multiple labs:<\/strong> Participants will work under the joint supervision of leading researchers from various MSRA laboratories, engaging in substantive academic exchange with experts from diverse research disciplines.<\/li>\n\n\n\n<li><strong>Focus on real-world, cutting-edge problems:<\/strong> The program emphasizes research topics derived from industrial practice, encouraging participants to produce outcomes that advance both academic knowledge and industry innovation.<\/li>\n<\/ul>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n\n\n<p>The StarLeap Program consists of several collaborative research projects, covering areas such as natural language processing, data intelligence, computer systems and networks, intelligent cloud, image scaling, computer vision, behavior detection, and social computing.<\/p>\n\n\n\n<p><strong>Starleap &#8211; Open for Application<\/strong><\/p>\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>We are focused on exploring cutting-edge technologies in Large Language Models, Data Intelligence, and AI-driven productivity tools. The team works on translating state-of-the-art research into impactful technologies that enhance user efficiency and creativity in open-ended tasks.<\/p>\n\n\n\n<p><strong>\u3010Responsibilities\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Model Optimization: Research and implement fine-tuning techniques for LLMs to improve performance in domain-specific code generation tasks.<\/li>\n\n\n\n<li>Instruction Tuning: Design strategies to construct high-quality datasets that enhance the model&#8217;s ability to follow complex instructions and handle structural constraints.<\/li>\n\n\n\n<li>Evaluation & Analysis: Develop benchmarks to assess the accuracy, syntax correctness, and executability of generated code in real-world scenarios.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Research Exploration\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Collaborate with the team to explore new possibilities in LLM-based tool use and planning, identifying gaps in current technologies and proposing novel solutions.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Qualifications Required Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Currently enrolled in a PhD program in Computer Science, AI, or a related field.<\/li>\n\n\n\n<li>Strong background in Deep Learning, NLP, or Code Generation.<\/li>\n\n\n\n<li>Proficiency in Python and PyTorch.<\/li>\n\n\n\n<li>Experience with LLM training, fine-tuning, or advanced prompting strategies.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Preferred Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Publication record in top-tier conferences (ACL, ICLR, NeurIPS, etc.).<\/li>\n\n\n\n<li>General interest in intelligent agents.<\/li>\n\n\n\n<li>Strong problem-solving skills and ability to work independently.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010How to Apply\u3011<\/strong><\/p>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\https://www.microsoft.com/jsj.top\/f\/Y7QxZq\">Please submit your application materials online<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n\n\n\n<p>\u3010Application Materials\u3011<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>CV in both English and Chinese combined into one PDF file, naming format: Name_ Interested Research Group<\/li>\n\n\n\n<li>Interested research direction and personal statement, naming format: Name_PS<\/li>\n\n\n\n<li>Application form, click the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/view.officeapps.live.com\/op\/view.aspx?src=https%3A%2F%2Fwww.microsoft.com%2Fen-us%2Fresearch%2Fwp-content%2Fuploads%2F2024%2F09%2FInternship_application_form.xlsx&wdOrigin=BROWSELINK\">link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> to download and complete, naming format: Name_ApplicationForm<\/li>\n<\/ul>\n\n\n\n<p>If you have any questions, please email: <a href=\"mailto:msraih@microsoft.com\">msraih@microsoft.com<\/a><\/p>\n\n\n\n<p><\/p>\n\n\n\n\n\n<p><strong>Research Projects<\/strong><\/p>\n\n\n\n\n\n<p><strong>\u3010<\/strong><strong>Introduction<\/strong><strong>\u3011<\/strong><strong><\/strong><\/p>\n\n\n\n<p>We are developing an AI-assisted hardware design framework that leverages large language models to accelerate design exploration. This project focuses on enabling agents to interact with GPU simulators, understand performance bottlenecks, and guide design exploration efficiently. The goal is to build AI systems that can reason about hardware behavior and co-design future architectures with human experts.<\/p>\n\n\n\n<p><strong>\u3010<\/strong><strong>Research Areas<\/strong><strong>\u3011<\/strong><strong><\/strong><\/p>\n\n\n\n<p>GPU Architecture, Hardware Simulation, AI for Chip Design, Machine Learning<\/p>\n\n\n\n<p><strong>\u3010<\/strong><strong>Qualifications<\/strong><strong>\u3011<\/strong><strong><\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Passionate about research in GPU design and simulation<\/li>\n\n\n\n<li>Strong programming and analytical skills (Python\/C++)<\/li>\n\n\n\n<li>Familiarity with GPU simulators (e.g., AccelSim) is a plus<\/li>\n\n\n\n<li>Prior experience in machine learning or large language models is preferred<\/li>\n<\/ul>\n\n\n\n<p><strong>How to Apply<\/strong><\/p>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\https://www.microsoft.com/jsj.top\/f\/Y7QxZq\">Please submit your application materials online<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n\n\n\n<p><strong>\u3010<\/strong><strong>Application Materials<\/strong><strong>\u3011<\/strong><strong><\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>CV in both English and Chinese combined into one PDF file, naming format: Name_ Interested Research Group<\/li>\n\n\n\n<li>Interested research direction and personal statement, naming format: Name_PS<\/li>\n\n\n\n<li>Application form, click the<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/view.officeapps.live.com\/op\/view.aspx?src=https%3A%2F%2Fwww.microsoft.com%2Fen-us%2Fresearch%2Fwp-content%2Fuploads%2F2024%2F09%2FInternship_application_form.xlsx&wdOrigin=BROWSELINK\"> link <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>to download and complete, naming format: Name_ApplicationForm<\/li>\n<\/ul>\n\n\n\n<p>If you have any questions, please email: <a href=\"mailto:msraih@microsoft.com\">msraih@microsoft.com<\/a><\/p>\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>The goal of this project is to develop game-changing techniques for next-gen large pre-trained language models, including\uff1a<\/p>\n\n\n\n<p>(1) Beyond UniLM\/InfoXLM: novel pre-training frameworks and self-supervised tasks for monolingual and multilingual pre-training to support language understanding, generation and translation tasks;<\/p>\n\n\n\n<p>(2) Beyond Transformers: new model architectures and optimization algorithms for improving training effectiveness and efficiency of extremely large language models;<\/p>\n\n\n\n<p>(3) Knowledge Fusion: new modeling frameworks to fuse massive pre-compiled knowledge into pre-trained models;<\/p>\n\n\n\n<p>(4) Lifelong Self-supervised Learning: mechanisms and algorithms for lifelong (incremental) pre-training. This project extends our existing research and aims to advance SOTA on NLP and AI in general.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Natural Language Computing, MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\">https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing<\/a><\/p>\n\n\n\n<p>Deep Learning, MSR Redmond<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/deep-learning-group\">https:\/\/www.microsoft.com\/en-us\/research\/group\/deep-learning-group<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Major in computer science or equivalent areas<\/li>\n\n\n\n<li>One+ year research experience in deep learning for NLP, CV or related areas<\/li>\n\n\n\n<li>Experience with open-source tools such as PyTorch,\u202fTensorflow, etc.<\/li>\n\n\n\n<li>Background knowledge of language model pre-training is preferred<\/li>\n\n\n\n<li>Track record of publications in related top conferences (e.g., ACL, EMNLP, NAACL, ICML, NeurIPS, ICLR) is preferred<\/li>\n\n\n\n<li>Excellent communication and writing skills<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Join our pioneering research team to work on harnessing the power of Large Language Models (LLMs) to address complex real-world optimization problems requiring long-term planning and dynamic information gathering from environments. Traditional optimization techniques often struggle with the high dimensionality, dynamic nature, and intricate dependencies inherent in real-world settings.<\/p>\n\n\n\n<p>Addressing these challenges, our research aims to push the boundaries of LLM capabilities to automate the decision-making processes, improve reliability, and provide innovative solutions to both existing and classical optimization challenges. The successful candidate will have the opportunity to collaborate with world-class researchers and engineers from diverse backgrounds and expertise, access to state-of-the-art computational resources, and contribute to the advancement of LLM research and its impact on real-world optimization problems.<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Conduct cutting-edge research on the application of LLMs to real-world optimization problems.<\/li>\n\n\n\n<li>Develop and implement novel methodologies to improve the performance of LLMs in dynamic and complex environments.<\/li>\n\n\n\n<li>Collaborate with cross-functional teams to integrate advanced AI models with traditional optimization techniques.<\/li>\n\n\n\n<li>Design experiments and simulations to test new hypotheses and validate the effectiveness of LLM-driven solutions.<\/li>\n\n\n\n<li>Publish research findings in top-tier conferences and journals, and present results to both technical and non-technical audiences.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Required Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Currently enrolled in a master&#8217;s, or PhD program in CS, EE, ML, Mathematics, or a related field.<\/li>\n\n\n\n<li>Proficient analytical and problem-solving skills<\/li>\n\n\n\n<li>Proficiency in Python, C\/C++, and other programming languages.<\/li>\n\n\n\n<li>Experience with Linux and development on Linux platforms.<\/li>\n\n\n\n<li>Excellent communication and presentation skills.<\/li>\n\n\n\n<li>Ability to work independently and collaboratively in a dynamic research environment.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Preferred Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Familiarity with optimization techniques and models.<\/li>\n\n\n\n<li>Experience with machine learning frameworks (e.g., PyTorch, TensorFlow).<\/li>\n\n\n\n<li>Knowledge of multi-agent systems and Active Learning.<\/li>\n\n\n\n<li>Experience with LLMs and their applications in dynamic and complex environments.<\/li>\n\n\n\n<li>Strong publication record in top-tier conferences and journals.<\/li>\n\n\n\n<li>Active contribution to open-source projects on platforms like GitHub.<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010How to Apply\u3011<\/strong><\/p>\n\n\n\n<p>Interested candidates should submit their resume along with a cover letter detailing their relevant experience and research interests.<\/p>\n\n\n\n<p>**Join us and contribute to groundbreaking research that integrates advanced AI models with optimization techniques, driving impactful decision-making across various domains.**<\/p>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Knowledge is essential for identifying issues, accelerating remediation, and enhancing existing infrastructure in large-scale systems. However, there is a knowledge gap due to the lack of easily consumable, vast infrastructure data. Because the data is immense and dynamically evolving. Large-language and multi-modal models have created opportunities to better support knowledge production and consumption, from gleaning new insights to extracting entities and generating signatures from unstructured data at scale, as demonstrated in recent research. In this project, we aim to leverage these models to automate and accelerate raw data processing, build knowledge graphs, and connect them to gain a deeper understanding of system infrastructure.<\/p>\n\n\n\n<p>We\u2019ll work with scientists who are at the forefront of system and network research, leveraging the world-leading platforms to solve the challenges problems in this area. The current project team members, from both MSRA Vancouver and MSR Redmond labs, have rich experience contributing to both industry and academic community through transferring innovations that support production systems and publications at top conferences.<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Major in computer science, electrical engineering, or equivalent field<\/li>\n\n\n\n<li>Solid knowledge of data structure\/algorithm<\/li>\n\n\n\n<li>Familiarity with Python, C\/C++ and other programming languages, familiar with Linux and development on Linux platform<\/li>\n\n\n\n<li>Good communication and presentation skills<\/li>\n\n\n\n<li>Good English reading and writing ability, capable of system implementing based on academic papers in English, capable of writing English documents<\/li>\n<\/ul>\n\n\n\n<p><strong>\u3010Preferred Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Rich knowledge of machine learning and machine learning models<\/li>\n\n\n\n<li>Have some basic security knowledge and participated in one security-related projects.<\/li>\n\n\n\n<li>Familiarity with engineering process as a strong plus<\/li>\n\n\n\n<li>Active on GitHub, used or participated in well-known open-source projects<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>We are developing a suite of smaller language models (SLMs) that are similar to LLMs but use less computing power. This project focuses on studying advanced training techniques that can better align the capabilities of SLMs with various aspects of different product scenarios, including but not limited to instruction following and task planning.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Language Model, Machine Learning<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Passionate about research in language models<\/li>\n\n\n\n<li>Solid coding and communication skills<\/li>\n\n\n\n<li>Prior experience in language model and\/or machine learning research is preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Large language models (LLMs) have revolutionized various fields with technologies like Retrieval Augmented Generation (RAG), In-Context Learning (ICL), Chain of Thought (CoT), and Agent-based models. These advancements, while groundbreaking, often result in lengthy prompts that lead to increased computational and financial costs, higher latencies, and added redundancy. Moreover, the intrinsic position bias of LLMs and redundancy within prompt will impact their performance, leading to the &#8220;lost in the middle&#8221; issue.<\/p>\n\n\n\n<p>Previous studies have introduced prompt compression methods such as LLMLingua and LongLLMLingua, which address these issues and show promising results in generic scenarios. This project aims to explore research questions around complex scenarios, such as agent-related prompts and the compression of LLM responses. Furthermore, it seeks to investigate the effects of such compression techniques on adversarial attacks, security, and other critical aspects.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Large Language Models, Agent-based, Efficient Method<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/llmlingua\/overview\/\">https:\/\/www.microsoft.com\/en-us\/research\/project\/llmlingua\/overview\/<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Passionate about research in LLMs and AGI<\/li>\n\n\n\n<li>Solid coding and communication skills<\/li>\n\n\n\n<li>Prior experience in LLM and\/or Machine Learning research is preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>While there have been significant efforts on leveraging LLMs as an evaluator, it is not quite there yet. It is only useful in English and in certain tasks, which severely limits its useability and trustworthiness across language. Join a groundbreaking project at Microsoft Research Asia, focusing on answering fundamental questions around LLM-based evaluation, but with direct production impact. This project aims to surpass the current capabilities of LLMs in certain tasks, emphasizing accuracy, reliability, robustness, and generalizability. The intern will be instrumental in creating a production-deployed system that adapts to needs serving hundreds of millions of users; and answer fundamental questions around the capabilities, limitations, and usages of LLMs and beyond.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Large Language Models, LLM-based Evaluation, LLM for low-resource language<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\/<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Passionate about research in LLMs and AGI<\/li>\n\n\n\n<li>Solid coding and communication skills<\/li>\n\n\n\n<li>Graduate student working towards a Ph.D<\/li>\n\n\n\n<li>Prior experience in LLM and\/or Machine Learning research is preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Retrieval-augmented generation (RAG) is a technique for enhancing the quality of responses generated by large language models (LLMs) by using external sources of knowledge to supplement the LLM&#8217;s internal representation of information. RAG allows LLMs to access the most up-to-date and reliable facts from a knowledge base or internal information storage. It can be used for various natural language generation tasks, such as question answering, summarization, and chat. However, the documents retrieved might be redundant and noisy. This project aims to develop efficient and robust RAG methods, which leverage shorter context length by removing contradiction and redundancy, reduce hallucination and are robust in different domains.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Large Language Models, Retrieval-Augmented Generation<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\/<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Passionate about research in LLMs and AGI<\/li>\n\n\n\n<li>Solid coding and communication skills<\/li>\n\n\n\n<li>Prior experience in LLM and\/or Machine Learning research is preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>For the new Designer app and Designer in Edge, we need to resize templates to different sizes, since different social media platforms require different target dimensions of the media, e.g., Facebook Timeline Post for personal accounts and business pages (1200 x 628), LinkedIn timeline post (1200 x 1200), Twitter timeline post (1600 x 900), etc. Image is the center of a template design. We need an ML-powered technique to automatically resize (including aspect ratio change, crop, zoom in\/out) an image and put it into a resized template (more specifically speaking, resized image placeholder) for the target platform, so that the image placement looks good (i.e., maintaining the aesthetic values).<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Computer Vision and Machine Learning<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Ph.D. students majoring in computer science, applied mathematics, electrical engineering or related technical discipline<\/li>\n\n\n\n<li>Relevant experience in the development and application of computer vision and\/or machine learning algorithms to solve challenging image understanding problems<\/li>\n\n\n\n<li>Strong scientific programming skills, including C\/C++, MATLAB, Python<\/li>\n\n\n\n<li>Independent analytical problem-solving skills<\/li>\n\n\n\n<li>Experience collaborating within research teams to develop advanced research concepts, prototypes, and systems<\/li>\n\n\n\n<li>Strong communication skills<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Pretrained language models such as BERT and UniLM have achieved huge success in many natural language processing scenarios. In many recommendation scenarios such as news recommendation, video recommendation, and ads CTR\/CVR prediction, user models are very important to infer user interest and intent from user behaviors. Previously, user models are trained in a supervised task-specific way, which cannot achieve a global and universal understanding of users and may limit they capacities in serving personalized applications.<\/p>\n\n\n\n<p>In this project, inspired by the success of pretrained language models, we plan to pretrain universal user models from large-scale unlabeled user behaviors using self-supervision tasks. The pretrained user models aim to better understand the characteristics, interest and intent of users, and can empower different downstream recommendation tasks by finetuning on their labeled data. Our recent work can be found at<\/p>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/scholar.google.co.jp\/citations?hl=zh-CN&user=0SZVO0sAAAAJ&view_op=list_works&sortby=pubdate\">https:\/\/scholar.google.co.jp\/citations?hl=zh-CN&user=0SZVO0sAAAAJ&view_op=list_works&sortby=pubdate<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Recommender Systems and Natural Language Processing<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Ph.D. students majoring in computer science, electronic engineering, or related areas<\/li>\n\n\n\n<li>Self-motivated and passionate in research<\/li>\n\n\n\n<li>Solid coding skills<\/li>\n\n\n\n<li>Experienced in Recommender Systems and Natural Language Processing<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Learning visual representation by vision-language pair data has shown highly competitive compared to previous supervised and self-supervised approaches, pioneered by CLIP and DALL-E. Such vision-language learning approaches have also demonstrated strong performance on some pure vision and vision-language applications. The aim of this project is to continually push forward the boundary of this research direction.<\/p>\n\n\n\n<p><strong>\u3010Research Areas \u3011<\/strong><\/p>\n\n\n\n<p>Computer vision<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/visual-computing\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/visual-computing\/<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/hanhu\">https:\/\/www.microsoft.com\/en-us\/research\/people\/hanhu<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Currently enrolled oversea Ph. D. students with promised or deferred offer, and is now staying in China<\/li>\n\n\n\n<li>Major in computer vision, natural language processing, or machine learning<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Are you excited to apply deep neural networks to solve practical problems? Would you like to help secure enterprise computer systems and users across the globe? Cyber-attacks on enterprises are proliferating and oftentimes causing damage to essential business operations. Adversaries may steal credentials of valid users and use their accounts to conduct malicious activities, which abruptly deviate from valid user behavior. We aim to prevent such attacks by detecting abrupt user behavior changes.<\/p>\n\n\n\n<p>In this project, you will leverage deep neural networks to model behaviors of a large number of users, detect abrupt behavior changes of individual users, and determine if changed behaviors are malicious or not. You will be part of a joint initiative between Microsoft Research and the Microsoft Defender for Endpoint (MDE). During your internship, you will get to collaborate with some of the world\u2019s best researchers in security and machine learning.<\/p>\n\n\n\n<p><strong>You would be expected to:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Closely work with researchers in China and Israel towards the research goals of the project.<\/li>\n\n\n\n<li>Develop and implement research ideas and conduct experiments to validate them.<\/li>\n\n\n\n<li>Report and present findings.<\/li>\n<\/ul>\n\n\n\n<p>Microsoft is an equal opportunity employer.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Software Analytics, MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/software-analytics\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/software-analytics\/<\/a><\/p>\n\n\n\n<p>Microsoft Defender for Endpoint (MDE)<\/p>\n\n\n\n<p>This is a Microsoft engineering and research group that develops the Microsoft Defender for Endpoint, an enterprise endpoint security platform designed to help enterprise networks prevent, detect, investigate, and respond to advanced threats<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/security\/business\/threat-protection\/endpoint-defender\">https:\/\/www.microsoft.com\/en-us\/security\/business\/threat-protection\/endpoint-defender<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Must have at least 1 year of experience applying machine learning\/deep learning to real world\/ research problems<\/li>\n\n\n\n<li>Demonstrated hands on the experience with Python through previous projects<\/li>\n\n\n\n<li>Familiarity with Deep Learning frameworks like PyTorch, Tensorflow, etc<\/li>\n\n\n\n<li>Keen ability for attention to detail and a strong analytical mindset<\/li>\n\n\n\n<li>Excellent in English reading and reasonably good in English communications<\/li>\n\n\n\n<li>Advisor\u2019s permission<\/li>\n<\/ul>\n\n\n\n<p><strong>Those with the following conditions are preferred:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Prior experience in behavior modeling<\/li>\n\n\n\n<li>Prior experience in anomaly detection<\/li>\n\n\n\n<li>Security knowledge a plus<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>While PLMs have been widely used to generate high-quality texts in a supervised manner (by imitating texts written by humans), they lack a mechanism for generating texts that directly optimize a given reward, e.g., given user feedback like user clicks or a criterion that cannot be directly optimized by using gradient descent. In real-world applications, we usually wish to achieve more than just imitating existing texts. For example, we may wish to generate more attractive texts that lead to increased user clicks, more diversified texts to improve user experience, and more personalized texts that are better tailored to user tastes. Combing RL with PLMs provides a unified solution for all these scenarios, and is the core for machines to achieve human parity in text generation. Such a method has the potential to be applied in a wide range of products, e.g., Microsoft Advertising (text ad generation), Microsoft News (news headline generation), and Microsoft Stores and Xbox (optimizing the description for recommended items).<\/p>\n\n\n\n<p>In this project, we aim to study how pretrained language models (PLMs) can be enhanced by using deep reinforcement learning (RL) to generate attractive and high-quality text ads. While finetuning PLMs have been shown to be able to generate high-quality texts, RL additionally provides a principled way to directly optimize user feedback (e.g., user clicks) for improving attractiveness. Our initial RL method UMPG is deployed in Dynamic Search Ads and published in KDD 2021. We wish to extend the method so that it can work for all pretrained language models (in addition to UNILM) and study how the technique can benefit other important Microsoft Advertising products and international markets.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Social Computing (SC), MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/social-computing-beijing\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/social-computing-beijing\/<\/a><\/p>\n\n\n\n<p>Microsoft Advertising, Microsoft Redmond<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Ph.D. students majoring in computer science, electrical engineering, or equivalent areas<\/li>\n\n\n\n<li>Experience with deep NLP and Transformers a strong plus<\/li>\n\n\n\n<li>Background knowledge of language model pre-training and\/or reinforcement learning<\/li>\n\n\n\n<li>Capable of system implementing based on academic papers in English<\/li>\n<\/ul>\n\n\n\n<p><strong>Those with the following conditions are preferred:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Good English reading and writing ability and communication skills, capable of writing English papers and documents<\/li>\n\n\n\n<li>Active on GitHub, used or participated in well-known open source projects<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>The parallel and distributed systems are the solution to address the ever-increasing complexity problem of deep learning trainings. However, existing solutions still leave efficiency and scalability on the table by missing optimization opportunities on various environments at industrial scale.<\/p>\n\n\n\n<p>In this project, we\u2019ll work with scientists who are at the forefront of system and network research, leveraging the world-leading platforms to solve system and networking problems in parallel and distributed deep learning area. The current project team members, from both MSR Asia and MSR Redmond labs, have rich experience contributing to both industry and academic community through transferring innovations that support production systems and publications at top conferences.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>System and Networking, MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-and-networking-research-group-asia\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-and-networking-research-group-asia\/<\/a><\/p>\n\n\n\n<p>Research in Software Engineering, MSR Redmond<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/research-software-engineering-rise\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/research-software-engineering-rise\/<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Major in computer science, electrical engineering, or equivalent field<\/li>\n\n\n\n<li>Solid knowledge of data structure\/algorithm<\/li>\n\n\n\n<li>Familiarity with Python, C\/C++ and other programming languages, familiar with Linux and development on Linux platform<\/li>\n\n\n\n<li>Good communication and presentation skills<\/li>\n\n\n\n<li>Good English reading and writing ability, capable of system implementing based on academic papers in English, capable of writing English documents<\/li>\n<\/ul>\n\n\n\n<p><strong>Those with the following conditions are preferred:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Familiarity with deep learning systems, e.g., PyTorch TensorFlow, GPU programming and networking<\/li>\n\n\n\n<li>Familiarity with NCCL, MPI communication protocols such as OpenMPI and MVAPICH<\/li>\n\n\n\n<li>Rich knowledge of machine learning and machine learning models<\/li>\n\n\n\n<li>Familiarity with engineering process as a strong plus<\/li>\n\n\n\n<li>Active on GitHub, used or participated in well-known open-source projects<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Tabular data such as Excel spreadsheets and databases are one of the most important assets in large enterprises today, which however are often plagued with data quality issues. Intelligent data cleansing focuses on novel ways to detect and fix data quality issues in tabular data, which can assist the large class of less-technical\u202fand non-technical users in enterprises.<\/p>\n\n\n\n<p>We are interested in a variety of topics in this area, including data-driven and intelligent techniques to detect data quality issues and suggest possible fixes, leveraging inferred constraints and statistical properties based on existing data assets and software artifacts.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Data, Knowledge, and Intelligence (DKI), MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/data-knowledge-intelligence\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/data-knowledge-intelligence\/<\/a><\/p>\n\n\n\n<p>Exploration and Mining (DMX), MSR Redmond<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/data-management-exploration-and-mining-dmx\">https:\/\/www.microsoft.com\/en-us\/research\/group\/data-management-exploration-and-mining-dmx<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Graduate-level students in Computer Science or related\u202fSTEM\u202ffields.\u202fPhD\u202fstudents are preferred<\/li>\n\n\n\n<li>Students with research\u202fbackground\u202fin database, data mining,\u202fstatistics, software engineering,\u202fand visualization\u202fare preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>As one of the world-leading cloud service providers, Microsoft Azure manages tens of millions of virtual machines every day. Within such a large-scale cloud system, how to efficiently allocate virtual machines on servers is critical and has been a hot research topic for years. Previously, teams from MSR-Asia and MSR-Redmond have made significant contributions in this area that resulted in production impact and publication of academic papers at top-tier conferences (e.g., IJCAI, AAAI, OSDI, NSDI). In this project we intend to unify the strength of MSR-Asia and MSR-Redmond for performing forward-looking and collaborative research on power management in datacenters, including power-aware virtual machine allocation. The project involves developing power prediction models by leveraging the start-of-the-art machine learning methods, as well as building efficient and reliable allocation systems in large-scale distributed environments.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Data, Knowledge, and Intelligence (DKI), MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/data-knowledge-intelligence\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/data-knowledge-intelligence\/<\/a><\/p>\n\n\n\n<p>System, MSR Redmond<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-research-group-redmond\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-research-group-redmond\/<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Currently enrolled in a graduate program in computer science or equivalent field<\/li>\n\n\n\n<li>Good research track record in related areas<\/li>\n\n\n\n<li>Able to carry out research tasks with\u202fhigh quality<\/li>\n\n\n\n<li>Good communication and presentation skills in written and oral English<\/li>\n\n\n\n<li>Knowledge and experience in machine learning, data mining and data analytics are preferred<\/li>\n\n\n\n<li>Familiarity with AIOps or AI for systems\u202fis a strong\u202fplus<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>In today\u2019s real-time video applications, a key component for optimizing the user\u2019s quality of experience is bandwidth estimation and rate control. It estimates the network capacity based on congestion signals observed on the path and adapts the video bitrate accordingly through the codec. However, existing handcrafted bandwidth estimators have failed to accommodate a wide range of complex network conditions, calling for a data-driven approach.<\/p>\n\n\n\n<p>Motivated by the recent success in applying reinforcement learning (RL) to video streaming and congestion control, we have made an initial attempt at designing an RL-based bandwidth estimator for one-on-one video calls. Going forward, we are working to optimize the performance of our current neural network model, as well as extending the research study of bandwidth estimation and rate control to multiparty videoconferencing.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>System and Networking, MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-and-networking-research-group-asia\/\">https:\/\/www.microsoft.com\/en-us\/research\/group\/systems-and-networking-research-group-asia\/<\/a><\/p>\n\n\n\n<p>Mobility and Networking, MSR Redmond<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/mobility-and-networking-research\">https:\/\/www.microsoft.com\/en-us\/research\/group\/mobility-and-networking-research<\/a><\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Major in computer science or a related field<\/li>\n\n\n\n<li>Strong programming skills in Python or C++<\/li>\n\n\n\n<li>Excellent English communication skills<\/li>\n\n\n\n<li>Experience with deep reinforcement learning or related areas is preferred<\/li>\n\n\n\n<li>Knowledge of computer networks is preferred<\/li>\n\n\n\n<li>Background in AI for systems and networking is a strong plus<\/li>\n\n\n\n<li>Track record of publications in top systems, networking, or AI conferences is strongly preferred<\/li>\n<\/ul>\n\n\n\n\n\n\n\n<p><strong>\u3010Introduction\u3011<\/strong><\/p>\n\n\n\n<p>Our cross-lab, inter-disciplinary research team develops AI technology for interactive coding assistance for data science, data analytics, and business process automation. It allows the user to specify their data processing intent in the middle of their workflow using a combination of natural language, input-output examples, and multi-modal UX \u2013 and translates that intent into the desired source code. The underlying AI technology integrates our state-of-the-art research in program synthesis, semantic parsing, and structure-grounded natural language understanding. It has the potential to improve productivity of millions of data scientists and software developers, as well as establish new scientific milestones for deep learning over structured data, grounded language understanding, and neuro-symbolic AI.<\/p>\n\n\n\n<p>The research project involves collecting and establishing a novel benchmark dataset for data science program generation, developing novel neuro-symbolic semantic parsing models to tackle this challenge, adapting large-scale pretrained language models to new domains and knowledge bases, as well as publishing in top-tier AI\/NLP conferences. We expect the benchmark dataset and the new models to be used in academia as well as in Microsoft products.<\/p>\n\n\n\n<p><strong>\u3010Research Areas\u3011<\/strong><\/p>\n\n\n\n<p>Natural Language Computing, MSR Asia<\/p>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing\">https:\/\/www.microsoft.com\/en-us\/research\/group\/natural-language-computing<\/a><\/p>\n\n\n\n<p>Neuro-Symbolic Learning, MSR Redmond<\/p>\n\n\n\n<p><strong>\u3010Qualifications\u3011<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Masters or Ph.D. students, majoring in computer science or equivalent areas<\/li>\n\n\n\n<li>Background in deep NLP, semantic parsing, sequence-to-sequence learning, Transformers required<\/li>\n\n\n\n<li>Experience with PyTorch and HuggingFace Transformers<\/li>\n\n\n\n<li>Fluent English speaking, listening, and writing skills<\/li>\n\n\n\n<li>Background in deep learning over structured data (graphs\/trees\/programs) and program synthesis preferred<\/li>\n\n\n\n<li>Students with papers published at top-tier AI\/NLP conferences are preferred<\/li>\n<\/ul>\n\n\n\n\n\n<p><\/p>\n\n\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_hide_image_in_river":null,"footnotes":""},"msr-opportunity-type":[],"msr-region":[],"msr-locale":[268875],"msr-program-audience":[],"msr-post-option":[269148,269142],"msr-impact-theme":[],"class_list":["post-1152001","msr-academic-program","type-msr-academic-program","status-publish","hentry","msr-locale-en_us","msr-post-option-approved-for-river","msr-post-option-include-in-river"],"msr_description":"","msr_social_media":[],"related-researchers":[],"tab-content":[],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/1152001","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-academic-program"}],"version-history":[{"count":11,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/1152001\/revisions"}],"predecessor-version":[{"id":1160873,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/1152001\/revisions\/1160873"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1152001"}],"wp:term":[{"taxonomy":"msr-opportunity-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-opportunity-type?post=1152001"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=1152001"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1152001"},{"taxonomy":"msr-program-audience","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-program-audience?post=1152001"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1152001"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=1152001"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}