微軟翻譯博客 http://approjects.co.za/?big=en-us/translator/blog/ 星期五, 03 五月 2024 18:16:58 +0000 en-美國 小時 1 Azure AI 自定義翻譯器神經詞典:提供更高的術語翻譯品質  http://approjects.co.za/?big=en-us/translator/blog/2023/12/06/azure-ai-custom-translator-neural-dictionary-delivering-higher-terminology-translation-quality/ 星期三, 06 十二月 2023 14:00:56 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9839 Today, we are super excited to announce the release of neural dictionary, a significant translation quality improvement to our platform. In this blog post, we will explore the neural dictionary feature. Introduction   Neural dictionary is an extension to our dynamic dictionary and phrase dictionary features in Azure AI Translator. Both allow our users to customize the translation output by providing their....

The post Azure AI Custom Translator Neural Dictionary: Delivering Higher Terminology Translation Quality  appeared first on Microsoft Translator Blog.

]]>

Today, we are super excited to announce the release of neural dictionary, a significant translation quality improvement to our platform. In this blog post, we will explore the neural dictionary feature.

Introduction  

Neural dictionary is an extension to our dynamic dictionary and phrase dictionary features in Azure AI Translator. Both allow our users to customize the translation output by providing their own translations for specific terms or phrases. Our previous method used verbatim dictionary, which was an exact find-and-replace operation. Neural dictionary improves translation quality for sentences which may include one or more term translations by letting the machine translation model adjust both the term and the context to produce more fluent translation. At the same time, it preserves the high term translation accuracy.  

The following English-German example demonstrates differences in translation outputs between both methods when a custom terminology translation is requested: 

Input:   Basic Knowledge of <mstrans:dictionary translation=”regelmäßiges Testen”>Periodic Maintenance</mstrans:dictionary>   
Verbatim dictionary:   Grundkenntnisse der regelmäßiges Testen 
Neural dictionary:   Grundkenntnisse des regelmäßigen Testens 

Quality improvement 

The chart below illustrates the significant improvements the new feature brings on common publicly available terminology test sets in Automotive (https://aclanthology.org/2021.eacl-main.271), Health (https://aclanthology.org/2021.emnlp-main.477) and Covid-19 domains (https://aclanthology.org/2021.wmt-1.69) using our general translation models. 

We also conducted a series of customer evaluations on Custom Translator platform and neural dictionary models. We measured the translation quality gains on customer data between models with and without the neural dictionary extension. Five customers participated, covering German, Spanish, and French in different business domains.

The chart below shows the average improvement of COMET in the education domain for English-German, English-Spanish, and English-French; for general models on the left, and for customized models on the right. BLUE color bars represent general translation quality without neural dictionary and ORANGE color bars represent translation quality using neural dictionary. These are overall average improvements on the entire test sets. For segments including one or more customer’s dictionary entries (between 19% and 63%), the improvement is as high as +6.3 to +12.9 COMET points. 

 Supported languages  

  • Currently available (as of December 6, 2023): Chinese simplified, French, German, Italian, Japanese, Korean, Polish, Russian, Spanish and Swedish – to and from English.  
  • We are adding more in the future. For updates, refer to Custom Translator release notes 

How neural dictionary works 

Neural dictionary does not employ the exact find-and-replace operation when handling custom terminology translation. Instead, it translates terms or phrases from the dictionary in a way that fits best the entire context. This means that the term can be inflected or have different casing, or that the surrounding words can be adjusted, producing a more fluent and coherent translation.  

Let’s say, for example, we have the following input sentence in English and its translation into Polish without any dictionary phrases is as follows:  

Input:   We need a fast solution that will be understandable.  
Standard translation:   Potrzebujemy szybkiego rozwiązania, które będzie zrozumiałe.  

If you want to make sure that “solution” is translated as “alternatywa” (“an alternative” in English), you can add a dynamic dictionary annotation to achieve that:  

Input:   We need a fast <mstrans:dictionary translation=”alternatywa“>solution</mstrans:dictionary> that will be understandable.  
Verbatim dictionary:   Potrzebujemy szybkiego alternatywa, który będzie zrozumiały.  
Neural dictionary:   Potrzebujemy szybkiej alternatywy, która będzie zrozumiała.  

The output produced by the previous method is not fluent as grammatical gender consistency is violated. The neural dictionary produces fluent output by a) inflecting the requested replacement and b) changing the surrounding words where needed. It can also change the casing in some cases, as in the following example:  

Input:   This company’s <mstrans:dictionary translation=”akcje“>stock</mstrans:dictionary> is cheap.  
Verbatim dictionary:   akcje tej firmy jest tani.  
Neural dictionary:   Akcje tej firmy tanie.   

Neural dictionary expects that the requested translation of a term is provided in its base grammatical form. Multi-word terms are also supported and should be provided as noun phrases, i.e., words should not be lemmatized independently (for example, “Estonian parliamentary election” will be better than “Estonia parliament election”). 

How to enable neural dictionary 

For all supported languages listed above, neural dictionary is immediately available for all customers using Custom Translator platform with phrase dictionaries. Full (or dictionary only) custom model retraining is required to enable neural dictionary. 

 Recommendations 

  1. If you want to ensure that the phrase dictionary entry is used more often when working with neural dictionary, you may consider adding the phrase entry with the source part in various forms. In the above example, next to “solution _ alternatywa”, you may want to add the following entries as well: “Solution _ alternatywa”, “solutions _ alternatywy”, “Solutions _ alternatywy”.  
  2. If the goal is to ensure that a specific word or phrase is copied “as is” from the input text to the output translation when using phrase dictionary, consider enforcing verbatim dictionary as it may be more consistent.   
  3. Avoid adding translations of common or frequent words or phrases to the phrase dictionary.  

To learn more about Custom Translator and how it can help your business thrive in the global marketplace, start with the Custom Translator beginner’s guide. 

What you can do with Microsoft Custom Translator 

Build custom models with your domain specific terminology and translate real-time using the Microsoft Translator API. 

Use Microsoft Custom Translator with your translation solutions to help globalize your business and improve customer interactions. 

For more information, visit Microsoft Translator business solutions and Custom Translator release notes. 

 

 

 

 

 

The post Azure AI Custom Translator Neural Dictionary: Delivering Higher Terminology Translation Quality  appeared first on Microsoft Translator Blog.

]]>
自定義翻譯器直接模型自定義 http://approjects.co.za/?big=en-us/translator/blog/2023/11/15/custom-translator-direct-model-customization/ 2023 年 11 月 15 日星期三 17:10:30 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9813 Custom Translator has been a key translation enabler capability in Azure AI Translator for almost a decade, with customers across industries such as transportation, healthcare and financial services using it to develop custom translator models. Custom Translator launched the Neural Machine Translation (NMT) back in 2016 and has made it available through the Microsoft Translator Text REST API ever since,....

The post Custom Translator Direct Model Customization appeared first on Microsoft Translator Blog.

]]>
Custom Translator has been a key translation enabler capability in Azure AI Translator for almost a decade, with customers across industries such as transportation, healthcare and financial services using it to develop custom translator models. Custom Translator launched the Neural Machine Translation (NMT) back in 2016 and has made it available through the Microsoft Translator Text REST API ever since, incrementally adding support for languages over the years.  

Today, Microsoft is thrilled to announce the release of direct model customization to all customers. Customers have been required to create a model where one language had to be English. That changes today. This new capability allows customers to build Custom Translator models to translate documents from one non-English language to another non-English language. This method allows for greater accuracy, maintaining cultural sensitivity in the translation, as well as a better understanding of the nuances and idiomatic expressions of the source language. Direct custom translation engines can also save time and resources by eliminating the need for additional editing and proofreading, resulting in a more efficient and cost-effective translation process. 

 

Watch a 2-minute video on how to train and publish a direct custom model using Azure AI Translator.

“Transitioning through English as an intermediary in translations has shown its limitations, particularly when nuances and context, such as ‘sex’, become lost in translation. This not only risks accuracy but accumulates mistakes throughout the process. By adopting the direct model, we address these translation concerns.  For Belgium, a nation with official languages being Dutch, French and German, the business value of such an approach is huge. The tests we performed at CrossLang with the direct model have demonstrated that with only a handful of additional segments, the new engine surpasses previous benchmarks in performance,” said Tom Francis, CrossLang CEO. 

The direct model customization feature is easy to use and available to all users signed up for Translator in an Azure subscription. Users can select from a list of supported languages published in the Custom Translator release notes.

To learn more about Custom Translator and how it can help your business thrive in the global marketplace, start with the Custom Translator beginner’s guide.

 

What you can do with the Custom Translator feature in Azure AI Translator

The post Custom Translator Direct Model Customization appeared first on Microsoft Translator Blog.

]]>
微軟自定義翻譯器平台升級:提供更高的自定義翻譯品質 http://approjects.co.za/?big=en-us/translator/blog/2023/06/06/microsoft-custom-translator-platform-upgrade-delivering-higher-custom-translation-quality/ 週二, 06 六月 2023 15:52:50 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9761 As businesses increasingly look to expand their global reach, the demand for high-quality, accurate translations has never been higher. At Microsoft, we’re committed to providing the most powerful and effective translation tools available, and we’re excited to announce the latest upgrade to our Custom Translator platform. In recent years, Custom Translator was upgraded to state-of-the-art Transformer architecture as part of....

The post Microsoft Custom Translator Platform Upgrade: Delivering Higher Custom Translation Quality appeared first on Microsoft Translator Blog.

]]>

As businesses increasingly look to expand their global reach, the demand for high-quality, accurate translations has never been higher. At Microsoft, we’re committed to providing the most powerful and effective translation tools available, and we’re excited to announce the latest upgrade to our Custom Translator platform.

In recent years, Custom Translator was upgraded to state-of-the-art Transformer architecture as part of our ongoing effort to improve our custom translation and provide the best possible translation quality to our customers. The upgrade was done in two phases, first phase was completed in November 2020 and second phase was completed in August 2022.

And today, we unveiled a significant upgrade to our platform. In this blog post, we will explore the model quality trained on the new Custom Translator platform.

Custom Translator platform’s latest upgrade is still based on the Transformer architecture; however, our team of experts have been hard at work developing a new set of algorithms and techniques, grounded by customer feedback and testing, to improve the accuracy and quality of Custom Translator models when adapted to customer terminology and domain data. Thanks to these enhancements, customers can now expect more accurate translations than previous models.

To put it to the test, we conducted a series of customer evaluations on Custom Translator’s upgraded models. Five customers participated, covering German, Spanish, French, Japanese and Chinese in different business domains. The results were impressive, with significant improvements in translation quality.

The chart below shows the average improvement from the customer evaluations.

The blue solid bars show the average improvement across language pairs when comparing our strong general models to the new Custom Translator platform adapted to the customer domain terminology and data. Across the five customers and five languages, there is a 4.3 BLEU improvement of the custom models over our strongest general models. For customers with parallel data, using Custom Translator provides even higher quality translation than our general systems.

The striped green bars show the average improvement across language pairs when comparing the new Custom Translator system adapted to the customer data to the previous version of Custom Translator. Across the five customers and five languages, there is a 4.5 BLEU improvement. For customers already using Custom Translator, the new platform upgrade provides an even larger improvement.

We’re thrilled to offer this upgrade to customers and we’re confident it will help businesses around the world communicate more effectively and efficiently. Custom Translator is now even more powerful and effective than ever before.

To learn more about Custom Translator and how it can help your business thrive in the global marketplace, start with the Custom Translator beginner’s guide.

What you can do with Microsoft Custom Translator

Build custom models with your domain specific terminology and translate real-time using the Microsoft Translator API.

Use Microsoft Custom Translator with your translation solutions to help globalize your business and improve customer interactions.

For more information, visit Microsoft Translator business solutions and Custom Translator release notes.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

The post Microsoft Custom Translator Platform Upgrade: Delivering Higher Custom Translation Quality appeared first on Microsoft Translator Blog.

]]>
微軟內部版本 2023 中的翻譯器公告 http://approjects.co.za/?big=en-us/translator/blog/2023/05/23/translator-announcements-in-microsoft-build-2023/ 週二, 23 五月 2023 16:23:05 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9754 Seattle—May 23, 2023—Today during Microsoft Build 2023 we are happy to announce several tools and solutions for you to adopt Translator service with ease. Announcing Document Translation feature in Language Studio, a ready-to-use consumption experience to translate documents in batches. Enterprise can adopt this solution across their organization by simply authorizing specific employees or employee groups to use the translator’s....

The post Translator announcements in Microsoft Build 2023 appeared first on Microsoft Translator Blog.

]]>
Microsoft Build
Image having text ‘Microsoft Build’

Seattle—May 23, 2023—Today during Microsoft Build 2023 we are happy to announce several tools and solutions for you to adopt Translator service with ease.

Announcing Document Translation feature in Language Studio, a ready-to-use consumption experience to translate documents in batches. Enterprise can adopt this solution across their organization by simply authorizing specific employees or employee groups to use the translator’s resource behind it. This feature would allow them to translate documents from their local or network storage hiding the complexity of uploading source documents to a cloud storage before translation.

Learn more: Technical blog on Document Translation feature in Language Studio

Announcing Microsoft Translator V3 connector for Power Automate platform as public preview, which enables workflow automation to translate text and documents. The new translator connector enables you to automate translation of documents in various clouds, local network, or local storage. You could also automate translation of text and document attachments on a support call, emails, etc.

Learn more: Technical blog on using Microsoft Translator V3 connector

Announcing Text Translator SDK as public preview in four programming languages – .Net, Python, Java, and JS. The SDK simplifies developer experience and reduces the need to assimilate API definitions.

Learn more: Technical blog on using Translator SDK

Announcing ‘Microsoft Translator’ Plugin for AOAI & Tools for PromptFlow as private preview. The plugin will allow you to translate a source text between any of the 125+ languages and dialects supported by Azure Cognitive Services Translator service using both general and custom translation models. Whenever the user sends a prompt request to AOAI, the GPT model will decide and invoke the Microsoft Translator plugin as required. Translator and Language Detect tools for PromptFlow helps user to get better responses for various prompts submitted to GPT in non-English languages.

Learn more: Blog on Translator Plugin for AOAI & Tools for PromptFlow

The post Translator announcements in Microsoft Build 2023 appeared first on Microsoft Translator Blog.

]]>
下索布語簡介 http://approjects.co.za/?big=en-us/translator/blog/2023/04/20/introducing-lower-sorbian/ 週四, 20 四月 2023 13:00:44 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9745 Microsoft Translator now supports 129 languages  Today, we are introducing a new language to the ever-growing list of Translator languages – welcome Lower Sorbian!  Adding Lower Sorbian to Translator was made possible by the Sorbian language community, in particular the Foundation for the Sorbian People, the Sorbian Institute and the Witaj Language Centre  which donated data to create the new language system, following up....

The post Introducing Lower Sorbian appeared first on Microsoft Translator Blog.

]]>
Microsoft Translator now supports 129 languages 
Altmarkt and St. Nikolai Church in Cottbus/Chóśebuz, Brandenburg, Germany, illuminated at night
Altmarkt and St. Nikolai Church in Cottbus/Chóśebuz, Brandenburg, Germany, illuminated at night

Today, we are introducing a new language to the ever-growing list of Translator languages welcome Lower Sorbian! 

Adding Lower Sorbian to Translator was made possible by the Sorbian language community, in particular the Foundation for the Sorbian People, the Sorbian Institute and the Witaj Language Centrewhich donated data to create the new language system, following up on our collaboration to build language support for Upper Sorbian last year. 

Beate Brězan, Head of the WITAJ Language Centre, states that “the addition of Lower Sorbian to Microsoft Translator is a valuable contribution to the preservation of our cultural identity. The ability to translate between Sorbian and more than 100 languages enables our speech community to write in our own language and be understood by others, and vice-versa.” 

Bernd Melcher, Head of the WITAJ Department in Lower Lusatia, highlights the positive impact of the language’s integration into Translator: “Now we can create much-needed textbooks, study materials, and other publications at a larger scale. And our students can use their mobile devices for quick look-up of words and even save the translations – very motivating!” 

Text translation for Lower Sorbian is available now in the Microsoft Translator apps, Office, and Translator for Bing. Using Translator, a Microsoft Azure Cognitive Service, you can add Lower Sorbian text translation to your apps, websites, workflows, and tools; or use Translator’s Document Translation feature to translate entire documents, or volumes of documents, in a variety of different file formats preserving their original formatting. You can also use Translator with Cognitive Services, such as Speech or Vision, to add additional capabilities such as speech and image translation into your apps. 

About the Lower Sorbian language 

Lower Sorbian (dolnoserbšćina) is a West Slavic minority language spoken in eastern Germany in the historical province of Lower Lusatia, today part of Brandenburg. Signs in this region are typically bilingual, and the city of Cottbus (Chóśebuz) has a Lower Sorbian High School where one language of instruction is Lower Sorbian. It is a heavily endangered language with less than 7,000 speakers. 

English  Lower Sorbian 
Good morning  Dobre zajtšo 
My name is…  Mójo mě jo… 
I’m from…  Pśidu z… 

 Learn more about Lower Sorbian on Bing. 

 What you can do with Microsoft Translator 

At home
Translate real-time conversations, menus and street signs, websites, documents, and more using the Microsoft Translator app for iOS and Android.  Learn more 

At work
Globalize your business and customer interactions with customizable text and document translation using Azure Cognitive Services Translator.  Learn more 

In the classroom
Create a more inclusive classroom for both students and parents with live captioning and cross-language understanding.  Learn more 

The post Introducing Lower Sorbian appeared first on Microsoft Translator Blog.

]]>
宣佈四種新語言:孔卡尼語、邁蒂利語、信德語和僧伽羅語  http://approjects.co.za/?big=en-us/translator/blog/2023/04/18/announcing-four-new-languages-konkani-maithili-sindhi-and-sinhala/ 週二, 18 四月 2023 20:00:36 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9730 Microsoft Translator now supports a total of 16 Indian languages  Today, we are delighted to announce that we have added three new Indian languages to Microsoft Translator – Konkani, Maithili, and Sindhi. Along with this, we are also adding support for Sinhala, the official language of Sri Lanka. With this latest release, Microsoft is further democratizing access to information in....

The post Announcing four new languages: Konkani, Maithili, Sindhi and Sinhala  appeared first on Microsoft Translator Blog.

]]>
Microsoft Translator now supports a total of 16 Indian languages 

Today, we are delighted to announce that we have added three new Indian languages to Microsoft Translator – Konkani, Maithili, and Sindhi. Along with this, we are also adding support for Sinhala, the official language of Sri Lanka. With this latest release, Microsoft is further democratizing access to information in native languages for India. This brings our total number of supported languages to 128 and adds language support for 16 Indian languages including Assamese, Bengali, Gujarati, Hindi, Kannada, Konkani, Maithili, Malayalam, Marathi, Nepali, Odia, Punjabi, Sindhi, Tamil, Telugu, and Urdu. 

Text translation for the four new languages is available now in the Microsoft Translator apps, Office, and Translator for Bing. Using Translator, a Microsoft Azure Cognitive Service, you can add Indian languages’ text translation to your apps, websites, workflows, and tools; or use Translator’s Document Translation feature to translate entire documents, or volumes of documents, in a variety of different file formats preserving their original formatting. You can also use Translator with Cognitive Services, such as Speech or Vision, to add additional capabilities such as speech and image translation into your apps. 

About the Konkani language 

Konkani is primarily spoken by over 2 million people in the states of Goa, Maharashtra, and Karnataka. It is also spoken by a significant number of people in other parts of India, such as Kerala and Gujarat. 

English  Konkani 
Hello  हॅलो 
My name is…  म्हजें नांव आसा 
I’m from…  हांव हांगाच्यान… 

 Learn more about Konkani on Bing. 

About the Maithili language 

Maithili is spoken by over 75 million people in India and Nepal. Popularly written in Devanagari script, it has its own script variously called Mithilakshara, Tirhuta, and Maithili. Maithili is mainly spoken in the northern regions of Indian state of Bihar, in parts of the state of Jharkhand and in the Terai regions of Nepal. The development of Maithili was championed by Dr. Girish Nath Jha, professor of Computational Linguistics at the School of Sanskrit & Indic Studies, Jawaharlal Nehru University(New Delhi, India), Dr. Ritu Nidhi and their team of research students.  We collaborated closely with the JNU community who helped with their expertise in machine translation and resource creation. 

English  Maithili 
Hello  नमस्कार 
My name is…  हमर नाम थिक 
I’m from…  हम अहिठाम सँ छी 

 Learn more about Maithili on Bing. 

About the Sindhi language 

Sindhi is spoken by over 20 million people in Pakistan, India, and Afghanistan. 

English  Sindhi 
Hello  سلام 
My name is…  منهنجو نالو آهي 
I’m from…  مان کان آهيان 

 Learn more about Sindhi on Bing. 

About the Sinhala language 

Sinhala is spoken by over 16 million people in Sri Lanka, as well as in other countries such as Malaysia and Singapore.  

English  Sinhala 
Hello  ආයුබෝවන් 
My name is…  මගේ නම 
I’m from…  මමසිට 

 Learn more about Sinhala on Bing. 

 What you can do with Microsoft Translator 

At home
Translate real-time conversations, menus and street signs, websites, documents, and more using the Microsoft Translator app for iOS and Android.  Learn more 

At work
Globalize your business and customer interactions with customizable text and document translation using Azure Cognitive Services Translator.  Learn more 

In the classroom
Create a more inclusive classroom for both students and parents with live captioning and cross-language understanding.  Learn more 

 

The post Announcing four new languages: Konkani, Maithili, Sindhi and Sinhala  appeared first on Microsoft Translator Blog.

]]>
文件翻譯現已在語言工作室中提供 http://approjects.co.za/?big=en-us/translator/blog/2023/03/29/document-translation-in-language-studio/ 週三, 29 三月 2023 15:30:36 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9721 We are excited to announce the release of a new UI feature to translate documents in language studio without writing a single line of code. Enterprises can deploy this solution across their organization, enabling employees to translate documents on demand. The feature is powered by Azure Cognitive Services (ACS) Translator document translation API, which can translate documents in variety of....

The post Document Translation now available in the Language Studio appeared first on Microsoft Translator Blog.

]]>
A translated document
Paper tagged as ‘Translation’ on wooden table with books and mug

We are excited to announce the release of a new UI feature to translate documents in language studio without writing a single line of code. Enterprises can deploy this solution across their organization, enabling employees to translate documents on demand. The feature is powered by Azure Cognitive Services (ACS) Translator document translation API, which can translate documents in variety of formats preserving the original structure and format as in the source document.

Document translation experience in the language studio enables customers to easily evaluate and adopt the service by simply configuring their Azure Translator and Storage resources. Customers could scale and control the usage through Azure Active Directory (AAD) authentication.

This new experience leverages the full capabilities of the document translation service and offers more. You could translate documents from either your local system or Azure blob storage. Likewise, the translated documents could be either downloaded to your local system or stored in Azure blob storage. Optionally, you could specify the glossary or custom models to be used during translation.

Why should I use the language studio to translate documents?

    • Enterprise ready UI solution to translate documents securely.
    • Language studio does not store customer data.
    • Scale and control the usage through AAD authentication.
    • You can use your existing Azure account and translator resources.
    • Since it is a web app, you can access the feature from any operating system.

How do I use this feature?

    1. Sign In into the language studio using your Azure credentials.
    2. Click on ‘Document translation (Preview)’ tile.
    3. Configure your setup by choosing your Translator resource and Azure Storage account.
    4. Select source and target language(s).
    5. Choose the source document(s) from your local system or blob storage.
    6. Choose the destination for translated files as either blob storage upload or local system download.
    7. Translate!

You can monitor the status of current and previously submitted jobs through the ‘Job History’ page.

References:

The post Document Translation now available in the Language Studio appeared first on Microsoft Translator Blog.

]]>
介紹13種新的非洲語言 http://approjects.co.za/?big=en-us/translator/blog/2023/03/08/introducing-13-new-african-languages/ 週三, 08 3月 2023 17:30:42 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9697 Microsoft Translator increases number of supported languages to 124! Today, we are excited to announce that we have added 13 new African languages to Microsoft Translator! Following last year’s release of Somali and Zulu, this release highlights our continued efforts of empowering African language communities to achieve more. Microsoft Translator now supports: chiShona (7 million speakers) Hausa (72 million speakers)....

The post Introducing 13 New African Languages appeared first on Microsoft Translator Blog.

]]>
The image shows the English phrase “Hello” and its translation into the set of African languages described in this blog post.Microsoft Translator increases number of supported languages to 124!

Today, we are excited to announce that we have added 13 new African languages to Microsoft Translator! Following last year’s release of Somali and Zulu, this release highlights our continued efforts of empowering African language communities to achieve more. Microsoft Translator now supports:

  • chiShona (7 million speakers)
  • Hausa (72 million speakers)
  • Igbo (40 million speakers)
  • Kinyarwanda (10 million speakers)
  • Lingala (65 million speakers)
  • Luganda (21 million speakers)
  • Nyanja (1 million speakers)
  • Rundi (12 million speakers)
  • Sesotho (14 million speakers)
  • Sesotho sa Leboa (15 million speakers)
  • Setswana (13 million speakers)
  • Xhosa (10 million speakers)
  • Yoruba (55 million speakers)

This brings our total number of supported languages to 124 and adds language support for more than 335 million people in Africa and worldwide.

Text translation for the 13 new African languages is available now in the Microsoft Translator apps, Office, and Translator for Bing. Using Translator, a Microsoft Azure Cognitive Service, you can add African languages’ text translation to your apps, websites, workflows, and tools; or use Translator’s Document Translation feature to translate entire documents, or volumes of documents, in a variety of different file formats preserving their original formatting. You can also use Translator with Cognitive Services such as Speech or Computer Vision to add additional capabilities such as speech-to-text and image translation into your apps.

About the chiShona language

The chiShona language is spoken by 7 million people in Zimbabwe and Mozambique. The language is in the Bantu language family. Here are some useful phrases in chiShona:

English chiShona
Hello Mhoro
My name is… Zita randu ndi…
I’m from… Ndinobva…

Learn more about chiShona on Bing.

About the Hausa language

The Hausa language is spoken by 72 million people in Niger, Nigeria, and Benin, as well as throughout other West African countries. The language is in the Chadic branch of the Afroasiatic language family. Here are some useful phrases in Hausa:

English Hausa
Hello Sannu
My name is… Sunana…
I’m from… Ni dan…

Learn more about Hausa on Bing.

About the Igbo language

The Igbo language is spoken by 40 million people in Nigeria, Equatorial Guinea, and Cameroon. The language is in the Niger-Congo language family. Here are some useful phrases in Igbo:

English Igbo
Hello Ndewo
My name is… Aha m bụ…
I’m from… Abụ m onye…

Learn more about Igbo on Bing.

About the Kinyarwanda language

The Kinyarwanda language is spoken by 10 million people in Rwanda, Uganda, DR Congo, and Tanzania. The language is in the Bantu language family. It is officially known as Ikinyarwanda. Here are some useful phrases in Kinyarwanda:

English Kinyarwanda
Hello Muraho
My name is… Nitwa…
I’m from… Nturuka…

Learn more about Kinyarwanda on Bing.

About the Lingala language

The Lingala language is spoken by 65 million people in DR Congo and the Republic of the Congo. The language is in the Bantu language family. Here are some useful phrases in Lingala:

English Lingala
Hello Mbote
My name is… Kombo nanga eza…
I’m from… Naza mutu ya ekolo…

Learn more about Lingala on Bing.

About the Luganda language

The Luganda language is spoken by 21 million people in Uganda. The language is in the Bantu language family. Here are some useful phrases in Luganda:

English Luganda
Hello Nkulamusizza
My name is… Erinnya lyange nze…
I’m from… Nva…

Learn more about Luganda on Bing.

About the Nyanja language

The Nyanja language is spoken by 1 million speakers in Malawi, Mozambique, and Zambia. The language is in the Bantu language family. It is also known as Chewa. Here are some useful phrases in Nyanja:

English Nyanja
Hello Moni
My name is… Dzina langa ndi…
I’m from… Ndimachokera ku…

Learn more about Nyanja on Bing.

About the Rundi language

The Rundi language is spoken by 12 million speakers in Burundi, Rwanda, Tanzania, DR Congo, Uganda, and Kenya. The languages is in the Bantu family of languages. Here are some useful phrases in Rundi:

English Rundi
Hello Ni gute
My name is… Amazina yanje ni…
I’m from… Aho navukiye…

Learn more about Rundi on Bing.

About the Sesotho language

The Sesotho language is spoken by 14 million speakers in Lesotho, South Africa, and Zimbabwe. The language is in the Bantu family of languages. It is also known as Sotho or Souther Sotho. Here are some useful phrases in Sesotho:

English Sesotho
Hello Dumela
My name is… Lebitso la ka ke…
I’m from… Ke tswa…

Learn more about Sesotho on Bing.

About the Sesotho sa Leboa language

The Sesotho sa Leboa language is spoken by 15 million speakers in South Africa. The language is in the Bantu family of languages. It is also known as Northern Sotho or Sepedi. Here are some useful phrases in Sesotho sa Leboa:

English Sesotho sa Leboa
Hello Dumela
My name is… Leina laka ke…
I’m from… Ketswa…

Learn more about Sesotho sa Leboa on Bing.

About the Setswana language

The Setswana language is spoken by 13 million people in Botswana, South Africa, Zimbabwe, and Namibia. Here are some useful phrases in Setswana:

English Setswana
Hello Dumela
My name is… Leina lame ke…
I’m from… Ke tswa kwa…

Learn more about Setswana on Bing.

About the Xhosa language

The Xhosa language is spoken by 10 million people in South Africa, Zimbabwe, and Botswana. It is in the Bantu family of languages. Here are some useful phrases in Xhosa:

English Xhosa
Hello Molo
My name is… Igama lam ngu…
I’m from… Ndisuka e…

Learn more about Xhosa on Bing.

About the Yoruba language

The Yoruba language is spoken by 55 million people in Nigeria, Benin, and Togo. Here are some useful phrases in Yoruba:

English Yoruba
Hello Ẹ pẹ̀lẹ́ o
My name is… Orúkọ mi ni…
I’m from… Mo wá láti…

Learn more about Yoruba on Bing.

What you can do with Microsoft Translator

At home
Translate real-time conversations, menus and street signs, websites, documents, and more using the Microsoft Translator app for iOS and Android.  Learn more

At work
Globalize your business and customer interactions with customizable text and document translation using Azure Cognitive Services Translator.  Learn more

In the classroom
Create a more inclusive classroom for both students and parents with live captioning and cross-language understanding.  Learn more

 

The post Introducing 13 New African Languages appeared first on Microsoft Translator Blog.

]]>
必應的性別翻譯解決了翻譯中的偏見 http://approjects.co.za/?big=en-us/translator/blog/2023/03/08/bings-gendered-translations-tackle-bias-in-translation/ 週三, 08 三月 2023 08:00:07 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9679 We’re excited to announce that, as of today, masculine and feminine alternative translations are available for when translating from English to Spanish, French, or Italian. You can try out this new feature in both Bing Search and Bing Translator verticals. Over the last few years, the field of Machine Translation (MT) has been revolutionized by the advent of transformer models,....

The post Bing’s gendered translations tackle bias in translation appeared first on Microsoft Translator Blog.

]]>
Gender de-bias
3D rendering of gender symbols.

We’re excited to announce that, as of today, masculine and feminine alternative translations are available for when translating from English to Spanish, French, or Italian. You can try out this new feature in both Bing Search and Bing Translator verticals.

Over the last few years, the field of Machine Translation (MT) has been revolutionized by the advent of transformer models, leading to tremendous improvements in quality. However, models optimized to capture the statistical properties of data collected from the real world inadvertently learn or even amplify social biases found in that data.

Our latest release is a step towards reducing one of these biases, specifically gender bias that is prevalent in MT systems. Bing Translator has always produced a single translation for an input sentence even when the translations could have had other gender variations including feminine and masculine variants. In accordance with the Microsoft responsible AI principles, we want to ensure we provide correct alternative translations and are more inclusive to all genders. As part of this journey our first step is to provide feminine and masculine translation variants.

Gender is expressed differently across different languages. For example, in English, the word lawyer could refer to either a male or female individual, but in Spanish, abogada would refer to a female lawyer, while abogado would refer to a male one. In the absence of information about the gender of a noun like ‘lawyer’ in a source sentence, MT models may resort to selecting an arbitrary gender for the noun in the target language. Often, these arbitrary gender assignments align with stereotypes, perpetuating harmful societal bias (Stanovsky et al., 2019; Ciora et al., 2021) and leading to translations that are not fully accurate.

In the example below, you notice that while translating gender-neutral sentences from English to Spanish, the translated text follows the stereotypical gender role, i.e., lawyer is translated as being male.

Translation with gender bias
Screenshot of translation of English text “Let’s get our lawyer’s opinion on this issue.” into Spanish language having gender bias.

As there is no context in the source sentence that implies the gender of the lawyer, producing a translation with the assumption of either a male or female lawyer would both be valid. Now, Bing Translator produces translations with both feminine and masculine forms.

Translation of gender ambiguous English Text into Spanish
Screenshot of translation of English text “Let’s get our lawyer’s opinion on this issue.” into Spanish language having gender specific translations.

System design

We aimed to design our system to meet the following key criteria for providing gendered alternatives:

  1. The feminine and masculine variants should have minimal differences except for those needed to convey gender.
  2. We wanted to cover a wide range of sentences where multiple gendered alternatives are possible.
  3. We wanted to ensure that the translations preserve the meaning of the original source sentence.

Detecting gender ambiguity

In order to accurately detect gender ambiguity in source text, we utilize a coreference model to analyze inputs containing animate nouns. For instance, if a given input text contains a gender-neutral profession word, we only want provide gendered alternatives for it when its gender can’t be determined by other information in the sentence. For example: On translating an English sentence “The lawyer met her driver at the hotel lobby.” into French we can determine that the lawyer is female, while the gender of the driver is unknown.

Translation of gender ambiguous English Text into French
Screenshot of translation of English text “The lawyer met her driver at the hotel lobby.” into French language.

Generating alternate translation

When the source sentence is ambiguously gendered, we examine our translation system’s output to decide if an alternative gender interpretation is possible. If so, we proceed to determine the best way to revise the translation. We begin by constructing a set of candidate target translations by rewriting the original translation. We apply linguistic constraints based on dependency relations to ensure consistency in the proposed alternatives and prune the erroneous candidates.

However, in many cases, even after applying our constraints, we are left with multiple candidate rewrites for the gendered alternative translation. To determine the best option, we evaluate each candidate by scoring it with our translation model. By leveraging the fact that a good gender rewrite will also be an accurate translation of the source sentence, we are able to ensure high accuracy in our final output.

System design of gender re-inflection
A diagram showing system design of gender re-inflection.

Leveraging managed online endpoints in Azure Machine Learning

The gendered alternative feature in Bing is hosted on managed online endpoints in Azure Machine Learning. Managed online endpoints provide a unified interface to invoke and manage model deployments on Microsoft-managed compute in a turnkey manner. They enable us to take advantage of scalable and reliable endpoints without being concerned about infrastructure management. This inference environment also enables the processing of large numbers of requests with low latency. Our ability to create and deploy the gender debias service with the latest frameworks and technologies has been greatly improved through the use of managed inference features in Azure Machine Learning. By leveraging these features, we have been able to maintain low COGS (Cost of Goods Sold) and ensure straightforward security and privacy compliance.

How can you contribute?

To facilitate progress in gender bias reduction in MT, we are releasing a test corpus containing gender-ambiguous translation examples from English into Spanish, French and Italian. Each English source sentence is accompanied by multiple translations, covering each possible gender variation.

Our test set is constructed to be challenging, morphologically rich and linguistically diverse. This corpus has been instrumental in our development process. It was developed with the help of a bilingual linguists with significant translation experience. We are also releasing a technical paper that discusses the test corpus in detail and the methodology and tools for evaluation.

GATE: A challenge set for Gender-Ambiguous Translation Examples – Paper

GATE: A challenge set for Gender-Ambiguous Translation Examples – Test set

Path forward

Through this work we aim to improve the quality of MT output in cases of ambiguous source gender, as well as facilitate the development of better and more inclusive natural language processing (NLP) tools in general. Our initial release focuses on translating from English to Spanish, French, and Italian. Going forward, we plan to expand to new language pairs, as well as cover additional scenarios and types of biases.

Credits:

Ranjita Naik, Spencer Rarrick, Sundar Poudel, Varun Mathur, Jeshwanth Kumar Chandrala, Charan Mohan, Lee Schwartz, Steven Nguyen, Amit Bhagwat, Vishal Chowdhary.

The post Bing’s gendered translations tackle bias in translation appeared first on Microsoft Translator Blog.

]]>
推出重新設計的適用於iOS的微軟翻譯應用程式 http://approjects.co.za/?big=en-us/translator/blog/2023/02/28/introducing-the-redesigned-microsoft-translator-app-for-ios/ 週二, 28 二月 2023 19:00:49 +0000 http://approjects.co.za/?big=en-us/translator/blog/?p=9673 We are excited to announce the redesign of the Microsoft Translator app for iOS devices. Microsoft Translator is here to help you and connect with people from around the globe – whether you are travelling and or navigating unfamiliar streets. We have worked hard to design a user-friendly, accessible and intuitive interface and upgraded it with advanced AI technologies. Discover....

The post Introducing the redesigned Microsoft Translator app for iOS appeared first on Microsoft Translator Blog.

]]>
We are excited to announce the redesign of the Microsoft Translator app for iOS devices. Microsoft Translator is here to help you and connect with people from around the globe – whether you are travelling and or navigating unfamiliar streets. We have worked hard to design a user-friendly, accessible and intuitive interface and upgraded it with advanced AI technologies. Discover what’s new in this free app: Microsoft Translator on the App Store (apple.com)

Language centric translation with rolling history

With an intuitive interface built on top of upgraded Azure Cognitive Services providing real-time translations, Microsoft Translator app allows you to translate text, speech, and images in more than 100 languages without having to worry about changing languages as you hop from speech translation to image translation or to text translation. Start by selecting your translation languages and the app will indicate which modes are supported for the language pair, such as speech translation, image translation and/or text translation. View your translation history on the same page as you continue translating new phrases.

Improved speech translation experience with continuous Language Identification (LID)

Our most exciting new advancement is an improved speech translation experience which lets AI auto-detect your spoken language. We have incorporated a new AI technology in Cognitive Services, continuous Language Identification (LID), into speech translation where two people can have natural and continuous conversations without the interruption of selecting source/target audio languages or tapping the microphone each time someone speaks. We have added a new microphone with stars icon which appears when both selected languages support continuous LID.

Improved image translation experience

We transitioned from an offline OCR (Optical Character Recognition) engine to the best-in-class online OCR engine powered by Azure Cognitive Services. This new experience offers high quality image translation with expanded language coverage.

Enriched multi-device conversation with transcript and auto-generated meeting topics

Want to have long conversations with someone or a group who is not fluent in your language? Use the conversation feature by tapping on the icon located on top-right corner of the app to transcribe a single person’s speech or translate multiple users within a group. Stay on top of conversations by accessing the transcripts and using meeting topics to navigate within a transcript.

Don’t let language barriers hold you back. With support for more than 100 languages (https://aka.ms/applanguages), you’ll be able to communicate with ease. Download Microsoft Translator app now and see how you can connect with people from different cultures and backgrounds. We will release a similar redesigned experience in Android soon!

Check out the Microsoft Translator app video.

The post Introducing the redesigned Microsoft Translator app for iOS appeared first on Microsoft Translator Blog.

]]>