HTML5 may help Web pages talk, listen
- 08 September, 2010 02:34
- Comments
Sometime in the near future, users might not only read Web pages but hold conversations with them as well, at least if a new activity group in the W3C (World Wide Consortium) bears fruit.
The W3C is investigating the possibility of incorporating voice recognition and speech synthesis interfaces within Web pages. A new incubator group will file a report a year from now summarizing the feasibility of adding voice and speech features into HTML, the W3C's standard for rendering Web pages.
AT&T, Google, Microsoft and the Mozilla Foundation, among others, all have engineers participating in this effort.
The human voice and the Web are not strangers: Google includes a voice-based Web search app in its Android smartphone operating system and Microsoft promises robust voice-driven features in its upcoming Windows Phone 7.
The HTML Speech Incubator Group is studying the feasibility of developing a standard Web interface for both speech recognition and synthesis, said group chair Dan Burnett, who is also director of speech technologies and standards at voice response system provider Voxeo.
Such an interface could be used across multiple browsers. Using built-in or plug-in voice recognition and speech synthesis engines, browsers could read pages aloud or permit users to audibly fill out Web forms.
While this work may overlap with another voice-based W3C effort, VoiceXML, the two efforts are somewhat different, Burnett said. VoiceXML wouldn't work very well for the Web, given that it was primarily designed for voice-driven applications, such as telephone-based voice response systems, where it is used widely. Like HTML itself, the voice capabilities of HTML would be stateless, or not require a dedicated session with the user.
Burnett noted that while the report would discuss the feasibility of establishing a set of interfaces, the work of developing the interfaces themselves, should they be warranted, would be taken on by another W3C group, such as the HTML Working Group.
The W3C has been busy with speech technologies on a number of other fronts as well. The organization also recently released version 3.0 of VoiceXML. In this new version, the working group added semantic descriptions of the features, and organized the functionality into modules.
The W3C also plans to shortly release version 1.1 of SSML (the Speech Synthesis Markup Language) -- often used in conjunction with VoiceXML -- that will incorporate Asian languages, and provide developers more flexibility with voice selection and handling of content in unexpected languages.
Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com
Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.
- Bookmark this page
- Share this article
- Got more on this story? Email CIO
- Follow CIO on twitter
- IDC Case Study - EMC IT Increasing Efficiency, Reducing Costs, and Optimising IT with Data Deduplication
- How to Choose an SMB - Unified Communications as a Service (UCAAS) Solution
- There is a HP Printer for everyone
- Why Hackers have Turned to Malicious JavaScript Attacks
- Business Intelligence Best Practices for Dashboard Design
-
Face Time - Interview with John Brennan and Robert DiStefano
-
Monday Grok: Will Siri crack the walls of GOOG?
-
Face Time - Interview with John Brennan and Robert DiStefano
-
Face Time - Interview with John Brennan and Robert DiStefano
-
Phones are distractions during catch-ups
-
Removing BPM Silos to Unleash Process Power - 15 Best Practices for Enterprise BPM
You are about to get a lot smarter about Enterprise Business Process Management (BPM ). T his article is the first in a series of our soon-to-be-published book, “The Intelligent Guide to Enterprise BPM .” So consider this first article your all-important primer. -
Yes. We. Can. Flexible Policy 2.0
Social media may have changed the way we do business, but the rules of engagement are still the same. Dynamic business environments call for flexibility. Context is everything when it comes to deciding what information needs to be blocked or controlled, and when. Read this whitepaper. -
IBM agility@scale™: Become as Agile as You Can Be
In this eBook, Scott Ambler, IBM Rational software's Chief Methodologist for Agile and Lean discusses how IT organisations are finding that agile project teams, as compared to traditional project teams, enjoy higher success rates, deliver higher quality projects, have greater levels of stakeholder satisfaction, provide better return on investment (ROI) and deliver systems to market sooner.
-
Computers for Seniors for Dummies, 2nd Edition
-
MYOB Software for Dummies 6E Australian Edition
-
Windows 7 for Dummies® Dvd+book Bundle
-
Excel 2007 All-In-One Desk Reference for Dummies
-
Office 2007 for Dummies
-
Microsoft Office
-
Windows 7 for Dummies®
-
Teach Yourself Visually Windows 7
-
Office 2007 All-In-One Desk Reference for Dummies








Comments
Post new comment