{"id":486102,"date":"2018-08-14T09:49:27","date_gmt":"2018-08-14T16:49:27","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=486102"},"modified":"2023-07-10T07:52:57","modified_gmt":"2023-07-10T14:52:57","slug":"project-brainwave","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-brainwave\/","title":{"rendered":"Project Brainwave"},"content":{"rendered":"

Project Brainwave is a deep learning platform for real-time AI inference in the cloud and on the edge. A soft Neural Processing Unit (NPU), based on a high-performance field-programmable gate array (FPGA), accelerates deep neural network (DNN) inferencing, with applications in computer vision and natural language processing.\u202fProject Brainwave is transforming computing by augmenting CPUs with an interconnected and configurable compute layer composed of programmable silicon.<\/p>\n

For example, this FPGA configuration achieved more than an order of magnitude improvement in latency and throughput on RNNs for Bing, with no batching. By delivering real-time AI and ultra-low latency without batching required, software overhead and complexity are reduced.<\/p>\n

Learn more about Project Brainwave on:<\/p>\n