{"id":835453,"date":"2022-04-27T09:00:00","date_gmt":"2022-04-27T16:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=835453"},"modified":"2022-08-18T08:26:18","modified_gmt":"2022-08-18T15:26:18","slug":"moler-creating-a-path-to-more-efficient-drug-design","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/moler-creating-a-path-to-more-efficient-drug-design\/","title":{"rendered":"MoLeR: Creating a path to more efficient drug design"},"content":{"rendered":"\n
Drug discovery has come a long way from its roots in serendipity. It is now an increasingly rational process, in which one important phase, called lead optimization<\/em>, is the stepwise search for promising drug candidate compounds in the lab. In this phase, expert medicinal chemists work to improve \u201chit\u201d molecules\u2014compounds that demonstrate some promising properties, as well as some undesirable ones, in early screening. In subsequent testing, chemists try to adapt the structure of hit molecules to improve their biological efficacy and reduce potential side effects. This process combines knowledge, creativity, experience, and intuition, and often lasts for years. Over many decades, computational modelling techniques have been developed to help predict how the molecules will fare in the lab, so that costly and time-consuming experiments can focus on the most promising compounds.<\/p>\n\n\n\n The Microsoft Generative Chemistry team<\/a> is working with Novartis to improve these modelling techniques with a new model called MoLeR. <\/p>\n\n\n\n “MoLeR illustrates how generative models based on deep learning can help transform the drug discovery process and enable our colleagues at Novartis to increase the efficiency in finding new compounds.”<\/em><\/p>Christopher Bishop, Technical Fellow and Laboratory Director, Microsoft Research Cambridge<\/em><\/cite><\/blockquote>\n\n\n\n We recently focused on predicting molecular properties using machine learning methods in the FS-Mol project<\/a>. To further support the drug discovery process, we are also working on methods that can automatically design compounds that better fit project requirements than existing candidate compounds. This is an extremely difficult task, as only a few promising molecules exist in the vast and largely unexplored chemical space\u2014estimated to contain up to 1060<\/sup> drug-like molecules. Just how big is that number? It would be enough molecules to reproduce the Earth billions of times. Finding them requires creativity and intuition that cannot be captured by fixed rules or hand-designed algorithms. This is why learning <\/em>is crucial not only for the predictive <\/em>task, as done in FS-Mol, but also for the generative <\/em>task of coming up with new structures. <\/p>\n\n\n\n In our earlier work<\/a>, published at the 2018 Conference on Neural Information Processing Systems (NeurIPS) (opens in new tab)<\/span><\/a>, we described a generative model of molecules called CGVAE. While that model performed well on simple, synthetic tasks, we noted then that further improvements required the expertise of drug discovery specialists. In collaboration with experts at Novartis, we identified two issues limiting the applicability of the CGVAE model in real drug discovery projects: it cannot be naturally constrained to explore only molecules containing a particular substructure (called the scaffold<\/em>), and it struggles to reproduce key structures, such as complex ring systems, due to its low-level, atom-by-atom generative procedure. To remove these limitations, we built MoLeR, which we describe in our new paper, “Learning to Extend Molecular Scaffolds with Structural Motifs<\/a>,” published at the 2022 International Conference on Learning Representations (ICLR)<\/a>. <\/p>\n\n\n\n
<\/a>