{"id":980151,"date":"2023-11-07T09:00:00","date_gmt":"2023-11-07T17:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/toward-developing-faster-algorithms-for-minimizing-submodular-functions\/"},"modified":"2023-11-06T10:18:54","modified_gmt":"2023-11-06T18:18:54","slug":"toward-developing-faster-algorithms-for-minimizing-submodular-functions","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/toward-developing-faster-algorithms-for-minimizing-submodular-functions\/","title":{"rendered":"Toward developing faster algorithms for minimizing submodular functions"},"content":{"rendered":"\n<p class=\"has-text-align-center\"><strong><em>This research paper was presented at the <\/em><\/strong><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/focs.computer.org\/2023\/\" target=\"_blank\" rel=\"noreferrer noopener\"><strong><em>64<sup>th<\/sup> IEEE Symposium on Foundations of Computer Science (FOCS) 2023<\/em><\/strong><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><strong><em>, a premier forum for the latest research in theoretical computer science.<\/em><\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"576\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1024x576.jpg\" alt=\"FOCS 2023 paper: Toward developing faster algorithms for minimizing submodular functions\" class=\"wp-image-980157\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1.jpg 1400w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Submodular functions are versatile mathematical tools, finding diverse applications in real-world scenarios and guiding solutions across complex domains. From dissecting the intricate networks of graphs to deciphering the complexities of economic landscapes through utility functions, and even navigating the enigmatic world of random variables via entropy functions, they offer valuable insights into challenging problems. Their wide-ranging applicability has made them pivotal tools for modeling and optimization in various theoretical computer science domains, including operations research and game theory.&nbsp;In recent years, submodular functions have gained prominence in solving optimization problems within machine learning (ML) applications. These tasks encompass vital areas such as feature selection and clustering, as illustrated in Figure 1. Additionally, submodular functions are instrumental in applications like sensor placement and graphical models. For further exploration, comprehensive resources are available in Bilmes\u2019 insightful <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/pdf\/2202.00132.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">survey<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and Bach\u2019s <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.di.ens.fr\/~fbach\/2200000039-Bach-Vol6-MAL-039.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">standard textbook<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> on this subject.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"200\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/11\/FOCS-Blog_figure1v2.png\" alt=\"Two graphics. The left graphic depicts the process of feature selection, beginning with all the features on the top, then the unselected features crossed in the middle, and finally the selected features remain at the bottom. The right graphic shows the process of clustering, where a set of points in 2D are assigned different colors so that points with the same color are physically close to each other to form a cluster.\" class=\"wp-image-981999\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/11\/FOCS-Blog_figure1v2.png 600w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/11\/FOCS-Blog_figure1v2-300x100.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/11\/FOCS-Blog_figure1v2-240x80.png 240w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Figure 1. Application of submodular function optimization to feature selection, on the left, and clustering on the right.<\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"algorithm-design-for-submodular-function-minimization\">Algorithm design for submodular function minimization<\/h2>\n\n\n\n<p>In a joint paper with researchers from Stanford University, \u201c<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/sparse-submodular-function-minimization\/\" target=\"_blank\" rel=\"noreferrer noopener\">Sparse Submodular Function Minimization(opens in new tab)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,\u201d presented at <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/focs.computer.org\/2023\/\" target=\"_blank\" rel=\"noreferrer noopener\">FOCS 2023(opens in new tab)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, we investigate the problem of minimizing a submodular function in the standard model. &nbsp; Here, we assume that the submodular function can be accessed through an evaluation oracle that returns the value \\( f(S) \\) in response to a query with a set \\( S \\). This is the most classical and well-studied model for studying algorithm design for minimizing submodular functions.<\/p>\n\n\n\n<p>Before we discuss our study, it\u2019s important to bear in mind that a submodular function \\( f \\) is defined on subsets of a finite set of elements \\( V \\) that satisfy a diminishing marginal difference property. That is, for any two subsets \\( S \\subseteq T \\) and any element \\( e \\in V \\setminus T \\), the marginal value of \\( e \\) when added to the smaller set \\( f(S \\cup {e}) \u2013 f(S) \\) is at least the marginal value of \\( e \\) when added to the bigger set \\( f(T \\cup {e}) \u2013 f(T) \\).<\/p>\n\n\n\n<p>In the 1980s, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/link.springer.com\/article\/10.1007\/BF02579273\" target=\"_blank\" rel=\"noreferrer noopener\">foundational work<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> revealed that submodular functions could be minimized in polynomial time, marking a significant breakthrough. Since then, researchers have made substantial progress in the quest for faster algorithms for submodular function minimization (SFM). Despite these efforts, fundamental questions persist, such as determining the minimum number of queries required to minimize any given submodular function\u2014a concept referred to as the problem&#8217;s query complexity.<\/p>\n\n\n\n<p>Currently, the most advanced algorithm needs to make \\( \\widetilde{O}(n^2) \\) queries for any given submodular function, while the best lower bound is only \\( \\widetilde{\\Omega}(n) \\), where \\(n\\) is the size of the ground set on which the submodular function is defined. This disparity results in a substantial gap, leaving an \\(n\\)-fold difference between the existing upper and lower bounds.<\/p>\n\n\n\n<p>Given this considerable difference, a natural question arises: What additional structural assumptions could potentially pave the way for faster algorithms in submodular function minimization (SFM)? One prevalent assumption is sparsity, which posits that the size of the set minimizing the submodular function is small. This holds particular relevance in diverse applications, including signal processing, feature selection, and compressed sensing. In these scenarios, solutions are expected to exhibit sparse non-zero entries, making it important to understand how algorithmic complexity depends on sparsity, as it provides insights into the intricate combinatorial and geometric structures of the problems.<\/p>\n\n\n\n<p>Interestingly, existing algorithmic techniques developed over the past four decades for SFM do not yield improved runtimes even when the solution is sparse. Therefore, it is imperative to develop innovative techniques that can drive advancements in sparse SFM and bridge the existing gap between upper and lower bounds.<\/p>\n\n\n\n\t<div class=\"border-bottom border-top border-gray-300 mt-5 mb-5 msr-promo text-center text-md-left alignwide\" data-bi-aN=\"promo\" data-bi-id=\"1085523\">\n\t\t\n\n\t\t<p class=\"msr-promo__label text-gray-800 text-center text-uppercase\">\n\t\t<span class=\"px-4 bg-white display-inline-block font-weight-semibold small\">Microsoft research podcast<\/span>\n\t<\/p>\n\t\n\t<div class=\"row pt-3 pb-4 align-items-center\">\n\t\t\t\t\t\t<div class=\"msr-promo__media col-12 col-md-5\">\n\t\t\t\t<a class=\"bg-gray-300\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/podcast\/abstracts-august-15-2024\/\" aria-label=\"Abstracts: August 15, 2024\" data-bi-cN=\"Abstracts: August 15, 2024\" target=\"_blank\">\n\t\t\t\t\t<img decoding=\"async\" class=\"w-100 display-block\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2024\/08\/Episode-15_Abstracts_Hero_Feature_No_Text_1400x788.jpg\" alt=\"Stylized microphone and sound waves illustration.\" \/>\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t<div class=\"msr-promo__content p-3 px-5 col-12 col-md\">\n\n\t\t\t\t\t\t\t\t\t<h2 class=\"h4\">Abstracts: August 15, 2024<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<p class=\"large\">Advanced AI may make it easier for bad actors to deceive others online. A multidisciplinary research team is exploring one solution: a credential that allows people to show they\u2019re not bots without sharing identifying information. Shrey Jain and Zo\u00eb Hitzig explain.<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<div class=\"wp-block-buttons justify-content-center justify-content-md-start\">\n\t\t\t\t\t<div class=\"wp-block-button\">\n\t\t\t\t\t\t<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/podcast\/abstracts-august-15-2024\/\" class=\"btn btn-brand glyph-append glyph-append-chevron-right\" aria-label=\"Listen now\" data-bi-cN=\"Abstracts: August 15, 2024\" target=\"_blank\">\n\t\t\t\t\t\t\tListen now\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div><!--\/.msr-promo__content-->\n\t<\/div><!--\/.msr-promo__inner-wrap-->\n\t<\/div><!--\/.msr-promo-->\n\t\n\n\n<h2 class=\"wp-block-heading\" id=\"parallel-algorithms-for-submodular-function-minimization\">Parallel algorithms for submodular function minimization<\/h2>\n\n\n\n<p>Exploring beyond SFM\u2019s query complexity, recent research has shed light on the importance of sparse SFM, particularly in understanding the inherent adaptivity of parallel algorithms (known as <em>parallel complexity<\/em>) designed to solve the problem. Research has shown that any parallel algorithm for SFM requires a minimum adaptivity that is a polynomial in the size of the ground set.<\/p>\n\n\n\n<p>Our results improve both parallel and sequential algorithms for SFM. For example, consider a scenario where the minimizer of the given submodular function is \\(\\widetilde{O}(1)\\)-sparse. In this context, our parallel algorithm runs in a nearly constant number of rounds, while our sequential algorithm makes a nearly linear number of queries. This achievement stands in stark contrast with the previous best parallel upper bound of \\(\\widetilde{O}(n)\\) and the best query complexity upper bound of \\(\\widetilde{O}(n^2)\\).<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"fast-first-order-methods-for-exact-submodular-function-minimization\">Fast first-order methods for exact submodular function minimization<\/h2>\n\n\n\n<p>Current fast algorithms for SFM rely on cutting-plane methods, a standard class of convex optimization techniques applied to the Lov\u00e1sz extension\u2014a natural continuous extension of the given submodular function. However, restricting the optimization domain to sparse solutions doesn&#8217;t significantly expedite cutting-plane methods beyond a logarithmic factor. To address this, we shifted our approach and employed first-order methods, including stochastic mirror descent, to minimize the Lov\u00e1sz extension. These methods, non-Euclidean generalizations of stochastic gradient descent, are more attuned to problem geometry. Unlike cutting-plane methods, first-order methods exhibit a polynomial convergence rate, rather than a polylogarithmic dependency on the additive error concerning the optimal solution.&nbsp;<\/p>\n\n\n\n<p>This rate of convergence indicates that first-order methods are better suited for <em>approximate <\/em>submodular function minimization, while our goal is to solve it <em>exactly<\/em>. Using the sparsity assumption, we developed a new algorithmic framework for SFM based on a new concept of duality. We used this framework to demonstrate how first-order methods, with substantially reduced accuracy requirements, can be applied to solve SFM exactly.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"toward-faster-algorithms-for-sfm-and-its-applications\">Toward faster algorithms for SFM and its applications<\/h2>\n\n\n\n<p>These techniques not only promise advancements for sparse SFM but also provide a foundation for tackling other fundamental problems in SFM theory. Our algorithms for sparse SFM serve as valuable starting points for designing improved algorithms for related problems. They offer potential insights into developing polynomial-time algorithms for SFM with lower query and parallel complexity, opening avenues for future research.<\/p>\n\n\n\n<p>Traditionally, research on submodular function minimization has focused on the global properties of the problem over the past four decades. Sparse SFM, in contrast, enables us to explore local and more refined structures of submodular functions. Our work introduces new algorithmic tools that better use these structural properties, a vital aspect for applications in ML and operations research, because these areas often have special structures. Beyond advancing sparse SFM, our paradigm paves the way for the development of enhanced algorithms for SFM and its diverse applications.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This research paper was presented at the 64th IEEE Symposium on Foundations of Computer Science (FOCS) 2023 (opens in new tab), a premier forum for the latest research in theoretical computer science. Submodular functions are versatile mathematical tools, finding diverse applications in real-world scenarios and guiding solutions across complex domains. From dissecting the intricate networks [&hellip;]<\/p>\n","protected":false},"author":42735,"featured_media":980157,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"footnotes":""},"categories":[1],"tags":[],"research-area":[13561],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[264846],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-980151","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-algorithms","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199565],"msr_impact_theme":["Computing foundations"],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[437022],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-960x540.jpg\" class=\"img-object-cover\" alt=\"FOCS 2023 paper: Toward developing faster algorithms for minimizing submodular functions\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2023\/10\/FOCS-BlogHeroFeature-1400x788-1.jpg 1400w\" sizes=\"(max-width: 960px) 100vw, 960px\" \/>","byline":"Haotian Jiang","formattedDate":"November 7, 2023","formattedExcerpt":"This research paper was presented at the 64th IEEE Symposium on Foundations of Computer Science (FOCS) 2023 (opens in new tab), a premier forum for the latest research in theoretical computer science. Submodular functions are versatile mathematical tools, finding diverse applications in real-world scenarios and&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/980151"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/42735"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=980151"}],"version-history":[{"count":25,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/980151\/revisions"}],"predecessor-version":[{"id":982002,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/980151\/revisions\/982002"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/980157"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=980151"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=980151"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=980151"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=980151"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=980151"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=980151"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=980151"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=980151"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=980151"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=980151"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=980151"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}