{"id":9514,"date":"2022-05-25T09:09:23","date_gmt":"2022-05-25T16:09:23","guid":{"rendered":"https://www.microsoft.com\/en-us\/translator/blog\/?p=9514"},"modified":"2022-05-27T14:06:47","modified_gmt":"2022-05-27T21:06:47","slug":"translate-scanned-pdf-documents-with-document-translation","status":"publish","type":"post","link":"https://www.microsoft.com\/en-us\/translator/blog\/2022\/05\/25\/translate-scanned-pdf-documents-with-document-translation\/","title":{"rendered":"Translate scanned PDF documents with Document translation"},"content":{"rendered":"

\"Phone<\/p>\n

Today, the\u202fDocument translation<\/a> feature of Translator, a Microsoft Azure Cognitive Service,\u202fadds the ability to translate PDF documents containing scanned image content, eliminating the need for customers to preprocess them through an OCR engine before translation.<\/p>\n

Document translation was made generally available last year, May 25, 2021, allowing customers to translate entire documents and batches of documents into more than 110 languages and dialects<\/a> while preserving the layout and formatting of the original file. Document translation supports a variety of file types, including Word, PowerPoint and PDF, and customers can use either pre-built or custom machine translation models. Document translation is enterprise-ready with Azure Active Directory authentication, providing secured access between the service and storage through Managed Identity.<\/p>\n

Translating PDFs with scanned image content is a highly requested feature from Document translation customers. Customers find it difficult to segregate PDF documents which have regular text or scanned image content through automation. This creates workflow issues as customers have to route PDF documents with scanned image content first to an OCR engine before sending them to document translation.<\/p>\n

Document translation services now have the intelligence<\/p>\n