Researchers at Microsoft claim to have created a new speech recognition technology that transcribes conversational speech as well as a human does (via The Verge).

The system's word error rate is reportedly 5.9 percent, which is about equal to professional transcribers asked to work on the same recordings, according to Microsoft.

speech recognition team Microsoft

Microsoft researchers from the Speech & Dialog research group (Image: Allison Linn)

"We've reached human parity," said chief speech scientist Xuedong Huang in a statement, calling the milestone "an historic achievement".

To reach the milestone, the team used Microsoft’s Computational Network Toolkit, a homegrown system for deep learning that the research team has made available on GitHub via an open source license. The system uses neural network technology that groups similar words together, which allows the models to generalize efficiently from word to word.

The neural networks draw on large amounts of data called training sets to teach the transcribing computers to recognize syntactical patterns in the sounds. Microsoft plans to use the technology in Cortana, its personal voice assistant in Windows and Xbox One, as well as in speech-to-text transcription software.

But the technology still has a long way to go before it can claim to master meaning (semantics) and contextual awareness - key characteristics of everyday language use that need to be grasped for Siri-like personal assistants to process requests and act upon them in a helpful way.

"We are moving away from a world where people must understand computers to a world in which computers must understand us," said Harry Shum, who heads the Microsoft AI Research group. However it will be a long time before computers can understand the real meaning of what's being said, he cautioned. "True artificial intelligence is still on the distant horizon."

Top Rated Comments

keysofanxiety Avatar
106 months ago
I'm still going to feel awkward as hell talking to an inanimate object.
You should meet my ex-wife.
Score: 33 Votes (Like | Disagree)
fitshaced Avatar
106 months ago
'And we were all like omg, and the machine was like 'I know right?' So then we lolled.'
Score: 21 Votes (Like | Disagree)
2010mini Avatar
106 months ago
You should meet my ex-wife.
You owe me a new keyboard sir. This comment made me do a spit take all over it.:p
Score: 7 Votes (Like | Disagree)
CreatorCode Avatar
106 months ago
The quote I see under Accuracy says --

[...]
DNS is very accurate if you speak clearly and directly, period. Contrary to what its name implies, comma, you cannot just speak naturally, period. You have to dictate specifically to it, period.

New paragraph.

The Microsoft experiment, comma, allegedly, comma, transcribes ordinary recorded speech and dialog without any additional effort on the part of the speaker, period.
Score: 7 Votes (Like | Disagree)
TXCherokee Avatar
106 months ago
Researchers at Microsoft claim to have created
....

....and you can stop reading here. As both an Apple and MS customer, I never believe a word MS says on future products until it hits the market. And then it is usually 1/2 as good with 1/3 of the features as the promises.
Score: 6 Votes (Like | Disagree)
coolfactor Avatar
106 months ago
Properly exciting times.

I remember when I was but a sprog, wide-eyed in wonder, sitting on my Dad's lap as we watched Next Gen. I don't think anybody back then would have imagined technology to be as advanced as it is now.
I'm amazed at how forward-thinking the Star Trek series are. It's literally like looking into the future.

I'm watching the Voyager and Enterprise series again now on Netflix. Never get bored of them. :)
[doublepost=1476889293][/doublepost]
Seems like something Apple should have led the way on?
Hard to figure out Apple these days. They had very accurate speech recognition and speech synthesis (comparatively) back in the early 80s when the Mac first came out. Remember the Talking Moose?

Score: 5 Votes (Like | Disagree)

Popular Stories

New Things Your iPhone Can Do in iOS 18

18 New Things Your iPhone Can Do in iOS 18.2

Wednesday November 13, 2024 2:09 am PST by
Apple is set to release iOS 18.2 next month, bringing the second round of Apple Intelligence features to iPhone 15 Pro and iPhone 16 models. This update brings several major advancements to Apple's AI integration, including completely new image generation tools and a range of Visual Intelligence-based enhancements. There are a handful of new non-AI related feature controls incoming as well....
M4 MacBook Pros Thumb

M4 MacBook Pro Uses Quantum Dot Display Technology

Thursday November 14, 2024 4:19 pm PST by
The M4 MacBook Pro models feature quantum dot display technology, according to display analyst Ross Young. Apple used a quantum dot film instead of a red KSF phosphor film, a change that provides more vibrant, accurate color results. Young says that Apple has opted for KSF for prior MacBook Pro models because it doesn't use toxic element cadmium (typical for quantum dot) and is more...
AirPods Crackling Feature

Apple Customers Sue Over Unfixed AirPods Pro Crackling Issue

Wednesday November 13, 2024 11:01 am PST by
A trio of Apple customers this month filed a class action lawsuit against Apple, accusing the Cupertino company of violating California consumer protection laws and false advertising for continuing to sell AirPods Pro models that had ongoing issues with crackling or static sounds. A few months after the AirPods Pro came out in October 2019, buyers began to complain about crackling, rattling, ...
google gemini

Google Releases Standalone Gemini AI App for iPhone

Thursday November 14, 2024 2:54 am PST by
Google has launched its dedicated Gemini artificial intelligence app for iPhone users, expanding beyond the previous limited integration within the main Google app. The standalone app offers enhanced functionality, including support for Gemini Live and iOS-specific features like Dynamic Island integration. The new app allows iPhone users to interact with Google's AI through text or voice...
maxresdefault

M4 Max MacBook Pro: Real-World Usage Tests

Wednesday November 13, 2024 11:59 am PST by
Apple last week replaced the M3 Max MacBook Pro with the new M4 Max MacBook Pro, and we picked up one of the new high-end MacBook Pro machines to see how it compares to the prior model with both benchmarks and real-world tests. We tested an M4 Max with a 16-core CPU, 40-core GPU, and 48GB RAM against an M3 Max MacBook Pro with similar specs. The two machines look similar, but the display on...
iphone passcode green

iOS 18 Security Feature Causes iPhone to Reboot After Three Days of Inactivity

Thursday November 14, 2024 2:19 pm PST by
With iOS 18, Apple introduced a feature that causes the iPhone to reboot every three days, security researchers have confirmed (via TechCrunch). In a demo video, security researcher Jiska Classen proved that an iPhone left untouched for 72 hours will automatically restart, and Graykey manufacturer also Magnet Forensics wrote a blog post about the feature. After a reboot, an iPhone is more...