Unsupervised Object Detection Using the Azure Cognitive Services on Spark | Spark Summit Europe 2018

We present HTTP on Spark, a novel integration between Spark with the widely used Hypertext Transfer Protocol (HTTP). This library can be used to integrate any framework into the Spark ecosystem that is capable of communicating through HTTP. Furthermore, HTTP on Spark enables distributed and fault tolerant micro service architectures that commute with Spark’s dynamic allocation and Streaming capabilities. We build upon this work and release a library of idiomatic spark bindings for a wide array of Microsoft Cognitive Services. These bindings allow users to easily add *any* cognitive service as a part of their existing Spark and SparkML machine learning pipelines. Finally, we demonstrate how to use these services to create a large class of custom image classification and object detection systems that can learn without requiring human labeled training examples. We demonstrate the power of these new releases with an automated Snow Leopard Detection system.

Databricks provides a unified data analytics platform (opens in new tab), powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Download the report
Date: