ACM

Communications of the ACM

Home/News/Toward Speech Recognition for Uncommon Spoken Languages/Full Text

ACM TechNews

Toward Speech Recognition for Uncommon Spoken Languages

By MIT News
November 5, 2021
Comments

View as: Print Mobile App Share:

Translation of a phrase from Wolof to English. — PARP is a new technique that reduces computational complexity of an advanced machine learning model so it can be applied to perform automated speech recognition for rare or uncommon languages, like Wolof, which is spoken by 5 million people in West Africa

Credit: Jose-Luis Olivares/MIT

Massachusetts Institute of Technology researchers have developed the Prune, Adjust, and Re-Prune (PARP) technique to simplify an advanced speech-learning model to learn uncommon spoken languages more easily.

It entails eliminating unnecessary components of the Wave2vec 2.0 neural network, then making small adjustments so it can recognize a specific language.

Wave2vec 2.0 is pretrained to learn basic speech from raw audio, and requires massive computing power to train on specific languages.

The researchers pruned network connections that were unnecessary for learning language, then trained the subnetwork with sets of labeled Spanish and French speech, which had 97% overlap.

PARP outperformed other common pruning techniques for speech recognition, especially when trained on a very small amount of transcribed speech.

From MIT News
View Full Article

No entries found