HEADLINES

AI system reaches human parity in translating news from Chinese to English

Published

March 27, 2018

Xuedong Huang, a technical fellow in charge of Microsoft’s speech, natural language and machine translation efforts. (Photography by Scott Eklund/Red Box Pictures)

A team of Microsoft researchers said Wednesday that they believe they have created the first machine translation system that can translate sentences of news articles from Chinese to English with the same quality and accuracy as a person.

Researchers in the company’s Asia and U.S. labs said that their system achieved human parity on a commonly used test set of news stories, called newstest2017, which was developed by a group of industry and academic partners and released at a research conference called WMT17 last fall. To ensure the results were both accurate and on par with what people would have done, the team hired external bilingual human evaluators, who compared Microsoft’s results to two independently produced human reference translations.

Xuedong Huang, a technical fellow in charge of Microsoft’s speech, natural language and machine translation efforts. (Photography by Scott Eklund/Red Box Pictures)

Xuedong Huang, a technical fellow in charge of Microsoft’s speech, natural language and machine translation efforts, called it a major milestone in one of the most challenging natural language processing tasks.

“Hitting human parity in a machine translation task is a dream that all of us have had,” Huang said. “We just didn’t realize we’d be able to hit it so soon.”

Huang, who also led the group that recently achieved human parity in a conversational speech recognition task, said the translation milestone was especially gratifying because of the possibilities it has for helping people understand each other better.

Advertisement. Scroll to continue reading.

“The pursuit of removing language barriers to help people communicate better is fantastic,” he said. “It’s very, very rewarding.”

Machine translation is a problem researchers have worked on for decades – and, experts say, for much of that time many believed human parity could never be achieved. Still, the researchers cautioned that the milestone does not mean that machine translation is a solved problem.

Arul Menezes, partner research manager of Microsoft’s machine translation team. (Photo by Dan DeLong)

Ming Zhou, assistant managing director of Microsoft Research Asia and head of a natural language processing group that worked on the project, said that the team was thrilled to achieve the human parity milestone on the dataset. But he cautioned that there are still many challenges ahead, such as testing the system on real-time news stories.

Arul Menezes, partner research manager of Microsoft’s machine translation team, said the team set out to prove that its systems could perform about as well as a person when it used a language pair – Chinese and English – for which there is a lot of data, on a test set that includes the more commonplace vocabulary of general interest news stories.

“Given the best-case situation as far as data and availability of resources goes, we wanted to find out if we could actually match the performance of a professional human translator,” said Menezes, who helped lead the project.

Menezes said the research team can apply the technical breakthroughs they made for this achievement to Microsoft’s commercially available translation products in multiple languages. That will pave the way for more accurate and natural-sounding translations across other languages and for texts with more complex or niche vocabulary.

Advertisement. Scroll to continue reading.

DUAL LEARNING, DELIBERATION, JOINT TRAINING AND AGREEMENT REGULARIZATION

Although academic and industry researchers have worked on translation for years, they’ve recently achieved substantial breakthroughs by using a method of training AI systems called deep neural networks. That has allowed them to create more fluent, natural-sounding translations that take into account an even broader context than the previous approach, known as statistical machine translation.

To reach the human parity milestone on this dataset, three research teams in Microsoft’s Beijing and Redmond, Washington, research labs worked together to add a number of other training methods that would make the system more fluent and accurate. In many cases, these new methods mimic how people improve their own work iteratively, by going over it again and again until they get it right.

Tie-Yan Liu, a principal research manager with Microsoft Research Asia in Beijing. (Photo courtesy of Microsoft)

“Much of our research is really inspired by how we humans do things,” said Tie-Yan Liu, a principal research manager with Microsoft Research Asia in Beijing, who leads a machine learning team that worked on this project.

One method they used is dual learning. Think of this as a way of fact-checking the system’s work: Every time they sent a sentence through the system to be translated from Chinese to English, the research team also translated it back from English to Chinese. That’s similar to what people might do to make sure that their automated translations were accurate, and it allowed the system to refine and learn from its own mistakes. Dual learning, which was developed by the Microsoft research team, also can be used to improve results in other AI tasks.

Another method, called deliberation networks, is similar to how people edit and revise their own writing by going through it again and again. The researchers taught the system to repeat the process of translating the same sentence over and over, gradually refining and improving the response.

Advertisement. Scroll to continue reading.

The researchers also developed two new techniques to improve the accuracy of their translations, Zhou said.

One technique, called joint training, was used to iteratively boost the English-to-Chinese and Chinese-to-English translation systems. With this method, the English-to-Chinese translation system translates new English sentences into Chinese in order to obtain new sentence pairs. Those are then used to augment the training dataset that is going in the opposite direction, from Chinese to English. The same procedure is then applied in the other direction. As they converge, the performance of both systems improves.

Another technique is called agreement regularization. With this method, the translation can be generated by having the system read from left to right or from right to left. If these two translation techniques generate the same translation, the result is considered more trustworthy than if they don’t get the same results. The method is used to encourage the systems to generate a consensus translation.

Zhou said he expects these methods and techniques to be useful for improving machine translation in other languages and situations as well. He said they also could be used to make other AI breakthroughs beyond translation.

“This is an area where machine translation research can apply to the whole field of AI research,” he said.

Advertisement. Scroll to continue reading.

NO ‘RIGHT’ ANSWER

The test set the team used to reach the human parity milestone includes about 2,000 sentences from a sample of online newspapers that have been professionally translated.

Microsoft ran multiple evaluation rounds on the test set, randomly selecting hundreds of translations for evaluation each time. To verify that Microsoft’s machine translation was as good as a person’s translation, the company went beyond the specifications of the test set and hired a group of outside bilingual language consultants to compare Microsoft’s results against manually produced human translations.

The method of verifying the results highlights the complexity of teaching systems to translate accurately. With other tasks, such as speech recognition, it’s pretty straightforward to tell if a system is performing as well as a person, because the ideal result will be the exact same for a person and a machine. Researchers call that a pattern recognition task.

With translation, there’s more nuance. Even two fluent human translators might translate the exact same sentence slightly differently, and neither would be wrong. That’s because there’s more than one “right” way to say the same thing.

Advertisement. Scroll to continue reading.

“Machine translation is much more complex than a pure pattern recognition task,” Zhou said. “People can use different words to express the exact same thing, but you cannot necessarily say which one is better.”

The researchers say that complexity is what makes machine translation such a challenging problem, but also such a rewarding one.

Liu said no one knows whether machine translation systems will ever get good enough to translate any text in any language pair with the accuracy and lyricism of a human translator. But, he said, these recent breakthroughs allow the teams to move on to the next big steps toward that goal and other big AI achievements, such as reaching human parity in speech-to-speech translation.

“What we can predict is that definitely we will do better and better,” Liu said.

Advertisement. Scroll to continue reading.

In this article:Artificial Intelligence, machine translation, Microsoft

HEADLINES

Majority of Filipinos believe AI enhances creativity and efficiency for communication, Samsung PH study shows

Creativity and experience is a common AI activity theme among Filipinos with 48% using it for photo editing and 42% for both entertainment and...

Upgrade StaffApril 2, 2025

HEADLINES

AI driving communications revolution but ethical tightrope looms

The future of communications hinges on our ability to responsibly harness artificial intelligence, ensuring it enhances, rather than undermines, the art of strategic communication.

Upgrade StaffApril 2, 2025

HEADLINES

AI revolution is not just about compute — it’s about connectivity, stresses Ciena study

To meet surging AI demands, 43% of new data center facilities are expected to be dedicated to AI workloads. With AI model training and...

Upgrade StaffMarch 31, 2025

HEADLINES

Maya Group Chief Technology Officer Alfred Lo unveils homegrown AI breakthroughs

Maya’s fraud detection approach, Transaction Sequence Embeddings, analyzes and uncovers subtle patterns between transactions —flagging those that resemble fraudulent behavior or deviate from a...

Upgrade StaffMarch 31, 2025

SOFTWARE

Microsoft Copilot updated

With these enhancements, Copilot is now more accessible than ever across Windows 11, macOS, mobile apps, and Telegram. Plus, with improved local interoperability, Copilot...

Upgrade StaffMarch 28, 2025

HEADLINES

Majority of businesses intrigued by potential of AI in achieving sustainability goals, but energy consumption concerns persist

This interest in the potential of AI, cloud computing and other advanced digital technologies to support sustainable development varies across regions, with emerging Asian...

Upgrade StaffMarch 12, 2025

HEADLINES

GenAI capabilities rolled out to 100% of Manulife’s workforce

By democratizing access to AI-enabled solutions, the company is empowering colleagues to harness its potential in their daily workflows, enhancing efficiency and driving innovation....

Upgrade StaffMarch 10, 2025

HEADLINES

HONOR to intro AI that detects face-swaps in MWC 2025

Deepfakes pose significant threats and risks, with nearly half of companies worldwide reporting incidents in 2024, according to industry reports. HONOR’s innovative solution immediately...

Upgrade StaffFebruary 24, 2025

Search UpgradeMag.com

ELECTRONICS

TCL announces QD-Mini LED TV discount

HEADLINES

CIBI Philippines signs MOU with Korean financial services firms for cross-border credit information sharing

HEADLINES

Emirates’ Aircrafted KIDS initiative reaches 700 young students across Asia

HEADLINES

ZTE showcases digital innovations at Philippines Cloud & Datacenter Convention 2025

Phones

HONOR Magic7 Pro arrives in PH

Phones

HONOR and Lazada join forces

White Papers

Nearly 44% of security incidents involved a web browser

HEADLINES

UnionBank and GMG Productions kick off 2025 partnership

HEADLINES

Majority of Filipinos believe AI enhances creativity and efficiency for communication, Samsung PH study shows

HEADLINES

Alibaba Cloud launches Qwen2.5-Omni-7B unified end-to-end multimodal model in Qwen series

ELECTRONICS

Beko launches new line of air conditioners

Phones

Affordable gaming-centric units via nubia Neo 3 series

HEADLINES

GCash Eco Run plants over 76,000 trees

HEADLINES

Damosa Land welcomes CloudStaff to Damosa IT Park, strengthening Davao’s IT-BPO growth

MOTORING

HATASU launches its first 4-wheeler ebike, HATASU Buggy, priced at SRP ₱86,990

HEADLINES

Epson Philippines launches Customer Experience Site

Like Us On Facebook

You May Also Like

HEADLINES

Majority of Filipinos believe AI enhances creativity and efficiency for communication, Samsung PH study shows

HEADLINES

AI driving communications revolution but ethical tightrope looms

HEADLINES

AI revolution is not just about compute — it’s about connectivity, stresses Ciena study

HEADLINES

Maya Group Chief Technology Officer Alfred Lo unveils homegrown AI breakthroughs

SOFTWARE

Microsoft Copilot updated

HEADLINES

Majority of businesses intrigued by potential of AI in achieving sustainability goals, but energy consumption concerns persist

HEADLINES

GenAI capabilities rolled out to 100% of Manulife’s workforce

HEADLINES

HONOR to intro AI that detects face-swaps in MWC 2025