Share this article
Latest news
With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low
Copilot in Outlook will generate personalized themes for you to customize the app
Microsoft will raise the price of its 365 Suite to include AI capabilities
Death Stranding Director’s Cut is now Xbox X|S at a huge discount
Outlook will let users create custom account icons so they can tell their accounts apart easier
IBM attempts to win back speech recognition crown: your move, Microsoft!
3 min. read
Published onMarch 13, 2017
published onMarch 13, 2017
Share this article
Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more
Last year, Microsoft made headlines when it announced thatits speech recognition technology achieved the lowest error rate in the industry: with a 5.9 percent word error rate, the technology giant claimed that it had reached “human parity,” a big milestone for a company betting big onconversation being the next big platform.
Unfortunately for Microsoft, IBM recentlysharedthat its Watson cognitive computing system has since reached new records in speech recognition with a 5.5 percent word error rate on the same SWITCHBOARD Industry corpus that Microsoft used (viaZDNet). In the announcement, the company hinted that Microsoft was misguided to think that its own speech recognition had already reached human parity:
Reaching human parity – meaning an error rate on par with that of two humans speaking – has long been the ultimate industry goal. Others in the industry are chasing this milestone alongside us, and some have recently claimed reaching 5.9 percent as equivalent to human parity…but we’re not popping the champagne yet. As part of our process in reaching today’s milestone, we determined human parity is actually lower than what anyone has yet achieved — at 5.1 percent.
IBM explained that it determined this 5.1 percent with the help of Appen, a global speech and search technology services company which provided guidance on how to reproduce human-level results. “this discovery of human parity at 5.1 percent proved to us we have a way to go before we can claim technology is on par with humans,” shared IBM.
Overall, it’s still not easy to find a standard measurement for human parity across the industry as the SWITCHBOARD corpus is not the only one corpus of linguistic data to use as a reference. IBM explained that it also tested its Watson technology with CallHome, another corpus composed of casual conversations between family members on topics that aren’t fixed in advance. “On this corpus, we achieved a 10.3 percent word error rate – another industry record – but again, with Appen’s help, measured human performance in the same situation to be 6.8 percent,” explained IBM.
It remains to be seen if Microsoft will fight back, but the Redmond giant still seems to have a more viable shot at making voice the new computing interface thanks to Cortana, than IBM’s somewhat smaller distribution efforts. The ubiquitous digital assistant has now been opened todevelopers,car manufacturers, and other device makers, and we expect it to be a major topic during the upcomingBuild 2017developer conference.
Radu Tyrsina
Radu Tyrsina has been a Windows fan ever since he got his first PC, a Pentium III (a monster at that time).
For most of the kids of his age, the Internet was an amazing way to play and communicate with others, but he was deeply impressed by the flow of information and how easily you can find anything on the web.
Prior to founding Windows Report, this particular curiosity about digital content enabled him to grow a number of sites that helped hundreds of millions reach faster the answer they’re looking for.
User forum
0 messages
Sort by:LatestOldestMost Votes
Comment*
Name*
Email*
Commenting as.Not you?
Save information for future comments
Comment
Δ
Radu Tyrsina