Microsoft Hails 'Historic Achievement' in Speech Recognition Technology - MacRumors
Skip to Content

Microsoft Hails 'Historic Achievement' in Speech Recognition Technology

by

Researchers at Microsoft claim to have created a new speech recognition technology that transcribes conversational speech as well as a human does (via The Verge).

The system's word error rate is reportedly 5.9 percent, which is about equal to professional transcribers asked to work on the same recordings, according to Microsoft.

speech recognition team Microsoft

Microsoft researchers from the Speech & Dialog research group (Image: Allison Linn)

"We've reached human parity," said chief speech scientist Xuedong Huang in a statement, calling the milestone "an historic achievement".

To reach the milestone, the team used Microsoft’s Computational Network Toolkit, a homegrown system for deep learning that the research team has made available on GitHub via an open source license. The system uses neural network technology that groups similar words together, which allows the models to generalize efficiently from word to word.

The neural networks draw on large amounts of data called training sets to teach the transcribing computers to recognize syntactical patterns in the sounds. Microsoft plans to use the technology in Cortana, its personal voice assistant in Windows and Xbox One, as well as in speech-to-text transcription software.

But the technology still has a long way to go before it can claim to master meaning (semantics) and contextual awareness - key characteristics of everyday language use that need to be grasped for Siri-like personal assistants to process requests and act upon them in a helpful way.

"We are moving away from a world where people must understand computers to a world in which computers must understand us," said Harry Shum, who heads the Microsoft AI Research group. However it will be a long time before computers can understand the real meaning of what's being said, he cautioned. "True artificial intelligence is still on the distant horizon."

Top Rated Comments

keysofanxiety Avatar
123 months ago
I'm still going to feel awkward as hell talking to an inanimate object.
You should meet my ex-wife.
Score: 33 Votes (Like | Disagree)
fitshaced Avatar
123 months ago
'And we were all like omg, and the machine was like 'I know right?' So then we lolled.'
Score: 21 Votes (Like | Disagree)
123 months ago
The quote I see under Accuracy says --

[...]
DNS is very accurate if you speak clearly and directly, period. Contrary to what its name implies, comma, you cannot just speak naturally, period. You have to dictate specifically to it, period.

New paragraph.

The Microsoft experiment, comma, allegedly, comma, transcribes ordinary recorded speech and dialog without any additional effort on the part of the speaker, period.
Score: 7 Votes (Like | Disagree)
123 months ago
You should meet my ex-wife.
You owe me a new keyboard sir. This comment made me do a spit take all over it.:p
Score: 7 Votes (Like | Disagree)
123 months ago
Researchers at Microsoft claim to have created
....

....and you can stop reading here. As both an Apple and MS customer, I never believe a word MS says on future products until it hits the market. And then it is usually 1/2 as good with 1/3 of the features as the promises.
Score: 6 Votes (Like | Disagree)
123 months ago
Properly exciting times.

I remember when I was but a sprog, wide-eyed in wonder, sitting on my Dad's lap as we watched Next Gen. I don't think anybody back then would have imagined technology to be as advanced as it is now.
I'm amazed at how forward-thinking the Star Trek series are. It's literally like looking into the future.

I'm watching the Voyager and Enterprise series again now on Netflix. Never get bored of them. :)
[doublepost=1476889293][/doublepost]
Seems like something Apple should have led the way on?
Hard to figure out Apple these days. They had very accurate speech recognition and speech synthesis (comparatively) back in the early 80s when the Mac first came out. Remember the Talking Moose?

Score: 5 Votes (Like | Disagree)

Popular Stories

iOS 26

iOS 26.4 Adds Two New Features to CarPlay

Tuesday March 24, 2026 1:55 pm PDT by
iOS 26.4 was released today, and it includes a couple of new features for CarPlay: an Ambient Music widget and support for voice-based chatbot apps. To update your iPhone 11 or newer to iOS 26.4, open the Settings app and tap on General → Software Update. CarPlay will automatically offer the new features so long as the iPhone connected to your vehicle is running iOS 26.4 or later....
AirPods Pro Firmware Feature

Apple Releases New Firmware for AirPods Pro 3, AirPods Pro 2 and AirPods 4

Tuesday March 24, 2026 12:31 pm PDT by
Apple today released new firmware for the AirPods Pro 2, AirPods Pro 3, and the AirPods 4. The firmware has a version number of 8B39, up from 8B34 on the AirPods Pro 3, 8B28 on the AirPods Pro 2, and 8B21 on the AirPods 4. There is no word on what's included in the firmware, but Apple has a support document with limited notes. Most updates are limited to bug fixes and performance...
apple tv 4k new orange

tvOS 26.4 Adds These New Features to Your Apple TV

Wednesday March 25, 2026 3:33 pm PDT by
Apple this week released tvOS 26.4, and the software update includes a handful of new features and changes for the Apple TV. tvOS 26.4 is compatible with all Apple TV 4K and Apple TV HD models released since 2015. To update your Apple TV, open the Settings app on the device, navigate to System → Software Updates, and select Update Software. Below, we have recapped what is new in tvOS...