ACM

Communications of the ACM

Home/News/Hallucinating to Better Text Translation/Full Text

ACM TechNews

Hallucinating to Better Text Translation

By MIT News
June 9, 2022
Comments

View as: Print Mobile App Share:

In the VALHALLA machine learning model, a trained neural network sees a source sentence in one language, hallucinates an image of what it looks like, and then uses both to translate the sentence into a target language.

Credit: MIT News

Scientists at the Massachusetts Institute of Technology (MIT), the University of California, San Diego, and IBM developed the VALHALLA machine learning method to hallucinate images of written words, using them to translate text into target languages.

The researchers trained a neural network to focus on key words and semantics in a sentence, then used a transformer to create a visual hallucination, and a second transformer to execute multimodal translation using outputs from the first.

Training involves a source sentence paired with a ground-truth image, and the same sentence hallucinated to form a text-image pair.

The team compared VALHALLA to other state-of-the-art multimodal and text-only translation techniques, and quantified its performance across 13 tasks. VALHALLA improved text-only translation, and outperformed other methods as sentences became longer.

From MIT News
View Full Article

No entries found