Neural Text to Speech extends support to 15 more languages with state-of-the-art AI quality

Neural Text to Speech extends support to 15 more languages with state-of-the-art AI quality

This article is contributed. See the original author and article here.

Neural Text to Speech extends support to 15 more languages with state-of-the-art AI quality

 

This post was co-authored by Sheng Zhao, Anny Dow, Garfield He and Lei He.  

 

Neural Text to Speech, part of Speech in Azure Cognitive Services, enables you to convert text to lifelike speech for more natural interfaces. Neural Text to Speech (Neural TTS) enables a wide range of scenarios, from audio content creation to natural-sounding voice assistants. Companies like the BBC and Motorola Solutions are using Text to Speech in Azure to develop conversational interfaces for their voice assistants.

 

To make it possible for more developers to add natural-sounding voices to their applications and solutions, today, we’re building on our language support with 15 new Neural TTS voices along with significant voice quality improvements.

 

Language support extended with 15 new voices

Our new Neural TTS voices include: Salma in Arabic (Egypt), Zariyah in Arabic (Saudi Arabia), Alba in Catalan (Spain), Christel in Danish (Denmark), Neerja in English (India), Noora in Finnish (Finland), Swara in Hindi (India), Colette in Dutch (Netherland), Zofia in Polish (Poland), Fernanda in Portuguese (Portugal), Dariya in Russian (Russia), Hillevi in Swedish (Sweden), Achara in Thai (Thailand), HiuGaai in Chinese (Cantonese, Traditional) and HsiaoYu in Chinese (Taiwanese Mandarin).

 

Hear samples of the voices, or try them with your own text in our demo.

 

Locale code

Language

Voice name

Audio sample

ar-EG

Arabic (Egypt)

“ar-EG-SalmaNeural”

أتساءل ماذا يمكن ان يحدث لجسمك عندما تأكل الزنجبيل كل يوم لمدة شهر؟

ar-SA

Arabic (Saudi Arabia)

“ar-SA-ZariyahNeural”

لديك نصف ساعة فقط؟

 

ca-ES

Catalan (Spain)

“ca-ES-AlbaNeural”

L’obra és el retrat d’un moment històric de mobilització popular.

 

da-DK

Danish (Denmark)

“da-DK-ChristelNeural”

Halvfjerds procent af din krop består af vand

 

en-IN

English (India)

“en-IN-NeerjaNeural”

How about coming to the barbecue at the tennis club?

 

fi-FI

Finnish (Finland)

“fi-FI-NooraNeural”

Tavoitteena on lisätä kohtuuhintaisten vuokra-asuntojen määrää kasvukeskuksissa.

 

hi-IN

Hindi (India)

“hi-IN-SwaraNeural”

‘आयरन’ शब्द किस खेल से सम्बन्धित है ?

 

nl-NL

Dutch (Netherland)

“nl-NL-ColetteNeural”

Alle oceanen zijn met elkaar verbonden en vormen samen één grote massa zout water. 

 

pl-PL

Polish (Poland)

“pl-PL-ZofiaNeural”

Wyjazd z Poznania planujemy o godzinie czwartej rano.

 

 

pt-PT

Portuguese (Portugal)

“pt-PT-FernandaNeural”

Amanhã vai estar tanto calor que vou à praia.

 

ru-RU

Russian (Russia)

“ru-RU-DariyaNeural”

В качестве примера он привел искусственный интеллект, беспилотную технику, генетику, медицину и образование.

 

sv-SE

Swedish (Sweden)

“sv-SE-HilleviNeural”

Ett kul och intressant avsnitt även för dig som inte var på plats!

 

 

th-TH

Thai (Thailand)

“th-TH-AcharaNeural”

เขาทำด้วยหัวใจบริสุทธิ์และต้องการให้ความยุติธรรมแก่ประชาชน

 

zh-HK

Chinese (Cantonese, Traditional)

“zh-HK-HiuGaaiNeural”

了解該等基金的三大特點,有助投資者作出更明智的選擇。

 

zh-TW

Chinese (Taiwanese Mandarin)

“zh-TW-HsiaoYuNeural”

如果一個人從一所優秀大學畢業,可能意味他有能力做大事。

 

 

Text-to-speech quality is measured by Mean Opinion Score (MOS), a widely-recognized scoring method for speech quality evaluation. For MOS studies, participants rate speech characteristics such as sound quality, pronunciation, speaking rate, and articulation on a 5-point scale. According to several MOS tests we have done (n>50 for each study), the average MOS score for the 15 new Neural TTS voices is above 4.1, about +0.5 higher than the scores for standard (non-neural) voices.

 

See the full language list for Neural TTS and standard voices.  

 

Voice quality and performance improved with state-of-the-art neural speech synthesis models

Neural TTS initially achieved near-human-parity on sentence reading using a recurrent neural network (RNN) based sequence-to-sequence model. Inspired by the Transformer model—a powerful sequence-to-sequence modeling architecture that advanced the state-of-the-art in neural machine translation (NMT), Microsoft researchers piloted the Transformer and FastSpeech models on Neural TTS and saw significant improvements in performance and efficiency. The Transformer TTS model is based on the auto-regressive Transformer structure, which can produce speech output in the quality close to the actual human voices with 5x less training time. FastSpeech is a new text-to-speech model that improves speech synthesis speed, accuracy, and controllability.

 

New neural voice model creation based on teacher-student transfer learningNew neural voice model creation based on teacher-student transfer learning

 

Multi-lingual and multi-speaker TTS recordings are first used to train a transformer base model.  To scale TTS development for many locales and voices, it is vital to have a highly agile development process. We built a “transformer teacher model” with 3,000+ hours of speech data from hundreds of speakers in 50+ languages/locales —about 50x of a typical single language multi-speaker model.

 

By using around 2 hours of a target speaker’s data, we can now adapt the multi-lingual multi-speaker transformer teacher model to generate a new high quality model for the speaker that sounds very similar to the original recording.  Then we can use a “finetuned teacher” to generate training data with rich content coverage to train a FastSpeech “student” for deployment that achieves the same quality as its finetuned teacher.

 

With this powerful multi-lingual model, we are also able to take the voice samples from one speaker in one language as input and transfer the voice into another target language, without losing quality.

 

With the Transformer and FastSpeech models, key improvements include:

Quality enhancements: The new models achieved significant MOS improvements over the previous robust LSTM-based Neural TTS models in our platform. For example, we did a side-by-side comparison on de-DE Kajta voice; the new model shows +0.4 comparative MOS gain over the baseline. 

 

Higher performance: With the new models, users can get high quality Neural TTS output with faster response time. FastSpeech “students” have 10X inference speedup on mel-spectrogram generation using M60 GPUs compared to our previous production systems. Neural TTS can run 40% faster on a Kubernetes GPU Pod. We can also run Neural TTS on CPU with 0.06 RTF (Real Time Factor), which means 1 second of audio can be generated in 60ms on a Kubernetes CPU Pod.

 

Language-specific improvements

When developing Neural TTS for new languages, there are also language-specific challenges that need to be addressed to ensure high voice quality and performance. 

 

For example, to make synthetic speech sound humanlike, it is critical to get pitch accents right. Japanese (ja-JP) poses challenges for speech synthesis because of its complicated pitch accents. However, most end-to-end TTS systems cannot perform well on pitch accents; we found that about 60% of production system’s problems in Japanese synthesis are related to intonation and accents.

 

Language-specific pitch accent prediction modelLanguage-specific pitch accent prediction model

 

We built a transformer model to predict and account for pitch accent related features. The accent model predicts accent phrase boundaries and accent type information, and these accent features are introduced into the acoustic model. The teacher model and student model will use the accent features in training and synthesis.

 

With the pitch accent features, the voice quality improves significantly. Our MOS test shows that the new ja-JP voice, Nanami, has a +0.3 improvement in MOS score compared to the previous production system. This method is also applicable to other languages with pitch accents.

 

Here are some samples:

Text

Sample of the old model without pitch accent support

Sample of new model with pitch accent support

1日2食に切り替える予定だ。

 

 

被災地には僕らの番組のため今も毎週のように行っています。

 

 

 

Create a custom voice with Neural TTS technology

The latest technical advancements in Neural TTS are also available in the Custom Neural Voice capability, enabling organizations to create a unique brand voice in multiple languages with 5-10X less data. 

 

Learn more about the process for getting started with Custom Neural Voice.

 

Get started 

With these updates, we’re excited to be powering natural and intuitive voice experiences for more customers. Text to Speech has more than 110 standard voices in over 40 languages and locales in addition to our growing list of Neural TTS voices.  .

 

For more information:

Improve performance, security, and reliability for your Azure VMs with Azure Advisor

Improve performance, security, and reliability for your Azure VMs with Azure Advisor

This article is contributed. See the original author and article here.

Running virtual machines in Azure is great. However, there are a lot of things you need to think about to improve performance, security, and reliability. Your cloud environment is also constantly changing, so you will need to check your Azure VMs from time to time. Luckily, there is a service called Azure Advisor which is a personalized cloud consultant that helps you follow best practices to optimize your Azure deployments. It analyzes your resource configuration and usage telemetry and then recommends solutions that can help you improve the cost-effectiveness, performance, reliability, and security of your Azure resources.

 

With Advisor, you can:

  • Get proactive, actionable, and personalized best practices recommendations.
  • Improve the performance, security, and reliability of your resources, as you identify opportunities to reduce your overall Azure spend.
  • Get recommendations with proposed actions inline.

 

How to check your Azure Advisor recommendations for your Azure virtual machines (VMs)

You can access Azure Advisor, for all Azure services through the Azure portal or directly in as an option in the Azure VM navigation.

 

Azure Advisor recommendations for Azure VMsAzure Advisor recommendations for Azure VMs

 

From here you can read more about the recommendation and get more details, as well as take action.

 

You can also create an Azure Advisor recommendation digest, so you can find your recommendations directly in your inbox.

 

Azure Advisor recommendation digestAzure Advisor recommendation digest

 

 

Conclusion

Azure Advisor is a great tool to get recommendations for not just your Azure virtual machines, but also for other Azure services. If you want to learn more check out Microsoft Docs.

Improve performance, security, and reliability for your Azure VMs with Azure Advisor

Microsoft Azure Advisor: How to Improve performance, security, and reliability of your Azure VMs

This article is contributed. See the original author and article here.

Running virtual machines in Azure is great. However, there are a lot of things you need to think about to improve performance, security, and reliability. Your cloud environment is also constantly changing, so you will need to check your Azure VMs from time to time. Luckily, there is a service called Azure Advisor which is a personalized cloud consultant that helps you follow best practices to optimize your Azure deployments. It analyzes your resource configuration and usage telemetry and then recommends solutions that can help you improve the cost-effectiveness, performance, reliability, and security of your Azure resources.

 

With Advisor, you can:

  • Get proactive, actionable, and personalized best practices recommendations.
  • Improve the performance, security, and reliability of your resources, as you identify opportunities to reduce your overall Azure spend.
  • Get recommendations with proposed actions inline.

 

How to check your Azure Advisor recommendations for your Azure virtual machines (VMs)

You can access Azure Advisor, for all Azure services through the Azure portal or directly in as an option in the Azure VM navigation.

 

Azure Advisor recommendations for Azure VMsAzure Advisor recommendations for Azure VMs

 

From here you can read more about the recommendation and get more details, as well as take action.

 

You can also create an Azure Advisor recommendation digest, so you can find your recommendations directly in your inbox.

 

Azure Advisor recommendation digestAzure Advisor recommendation digest

 

 

Conclusion

Azure Advisor is a great tool to get recommendations for not just your Azure virtual machines, but also for other Azure services. If you want to learn more check out Microsoft Docs.

Join the Excel product team at the Excel Virtually Global summit starting Tue, 21 July, 17:00 GMT

This article is contributed. See the original author and article here.

In its 5th consecutive year, the Excel Virtually Global summit will start on Tue, 21 July, 17:00 GMT. This year’s event will benefit Covid-19 research.

 

Bringing together over 50 Microsoft’s Most Valuable Professionals (MVPs), leading experts across communities around the world, and Excel product managers (PMs) at Microsoft, the event will offer over 50 hours of in-depth discussions and presentations on Microsoft Excel, Power BI, and more – most sessions will be recorded for post-event viewing. See the full list of speakers here and event program here.

 

Sessions by Excel PMs include:

Date

Time (GMT)

Session

Speaker

Language

Tue 21 July

21:00

Collaboration

Adam Callens

English

Wed 22 July

2:00

Build Add-ins for Microsoft 365

Raymond Lu

Mandarin

Wed 22 July

20:00

Build Add-ins for Microsoft 365

Keyur Patel

English

Tue 23 July

0:00

Intelligence

Mar Gines Marin

English

Thu 23 July

9:00

What’s New in Microsoft Excel

Guy Hunkin, Danielle Rifinski Fainman, and Jonathan Kahati

English

Tue 23 July

21:00

Office Scripts

Sudhi Ramamurthy

English

Tue 23 July

23:00

Excel and Power Automate

Harysh Menon

English

 

REGISTER today! All profits will go to Covid-19 research.

 

HDInsight Tools for Visual Studio Code: Create, Run and Debug Notebook

HDInsight Tools for Visual Studio Code: Create, Run and Debug Notebook

This article is contributed. See the original author and article here.

We are pleased to announce the Visual Studio Code Notebook support for HDInsight clusters in the HDInsight Spark & Hive Extension. The new feature facilitates you to perform Jupyter like Notebook operations and boosts collaborations with one-click conversion between IPYNB and PY files. 

 

The Visual Studio Code Notebook supports working with Jupyter Notebooks natively, allowing you to create a new IPYNB Notebook, open an existing Notebook, and perform cell level operations for code and markdown cells. Moreover, you can fully enjoy the language service for Python, debug your Notebook, and watch variables. 

 

The HDInsight Spark & Hive extension also delivers seamless integration with HDInsight clusters. You can quickly access to HDInsight standard Clusters, ESP Clusters and HIB clusters through Azure sign in or link to a cluster manually for PySpark interactive query and batch job submission.  

 

Key customer benefits   

 
  • Single Azure sign-in to access HDInsight clusters including ESP and HIB clusters.

  • Create, open, and save a Jupyter Notebook (IPYNB)

  • Work with cells in the Notebook Editor

  • IntelliSense support in the Jupyter Notebook Editor

  • View, inspect, and filter variables through the Variable explorer and Data viewer

  • Debug a Jupyter Notebook

  • Run Notebook against HDInsight clusters for PySpark query.

 

How to get started

 

First, install Visual Studio Code and download Mono 4.2.x (for Linux and Mac). Then, get the latest HDInsight Tools by going to the Visual Studio Code Extension repository or the Visual Studio Code Marketplace and searching Spark & Hive Tools.

Spark & Hive.png

For more information about the HDInsight Spark & Hive Tools for Visual Studio Code, please see the following resources:

If you have questions, feedback, comments, or bug reports, please send a note to hdivstool@microsoft.com.

Announcements Big Data HDInsight Spark Hive Visual Studio Code