Movatterモバイル変換

[0]ホーム

Jump to content

VoiceXML

Edit links

From Wikipedia, the free encyclopedia

Digital document standard

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "VoiceXML" – news ·newspapers ·books ·scholar ·JSTOR(February 2017) (Learn how and when to remove this message)

VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service portals. VoiceXML applications are developed and deployed in a manner analogous to how aweb browser interprets and visually renders theHypertext Markup Language (HTML) it receives from aweb server. VoiceXML documents are interpreted by avoice browser and in common deployment architectures, users interact with voice browsers via thepublic switched telephone network (PSTN).

The VoiceXML document format is based onExtensible Markup Language (XML). It is a standard developed by theWorld Wide Web Consortium (W3C).

Usage

[edit]

VoiceXML applications are commonly used in many industries and segments of commerce. These applications include order inquiry, package tracking, driving directions, emergency notification, wake-up, flight tracking, voice access to email, customer relationship management, prescription refilling, audio news magazines, voice dialing, real-estate information and nationaldirectory assistance applications.^{[citation needed]}

VoiceXML has tags that instruct thevoice browser to providespeech synthesis, automaticspeech recognition, dialog management, and audio playback. The following is an example of a VoiceXML document:

<vxmlversion="2.0"xmlns="http://www.w3.org/2001/vxml"><form><block><prompt>Helloworld!</prompt></block></form></vxml>

When interpreted by a VoiceXML interpreter this will output "Hello world" with synthesized speech.

Typically,HTTP is used as the transport protocol for fetching VoiceXML pages. Some applications may use static VoiceXML pages, while others rely on dynamic VoiceXML page generation using anapplication server likeTomcat,Weblogic,IIS, orWebSphere.

Historically, VoiceXML platform vendors have implemented the standard in different ways, and added proprietary features. But the VoiceXML 2.0 standard, adopted as a W3C Recommendation on 16 March 2004, clarified most areas of difference. The VoiceXML Forum, an industry group promoting the use of the standard, provides aconformance testing process that certifies vendors' implementations as conformant.

History

[edit]

AT&T Corporation,IBM,Lucent, andMotorola formed the VoiceXML Forum in March 1999, in order to develop a standard markup language for specifying voice dialogs. By September 1999 the Forum released VoiceXML 0.9 for member comment, and in March 2000 they published VoiceXML 1.0. Soon afterwards, the Forum turned over the control of the standard to the W3C.^[1] The W3C produced several intermediate versions of VoiceXML 2.0, which reached the final "Recommendation" stage in March 2004.^[2]

VoiceXML 2.1 added a relatively small set of additional features to VoiceXML 2.0, based on feedback from implementations of the 2.0 standard. It is backward compatible with VoiceXML 2.0 and reached W3C Recommendation status in June 2007.^[3]

Future versions of the standard

[edit]

VoiceXML 3.0 was slated to be the next major release of VoiceXML, with new major features. However, with the disbanding of the VoiceXML Forum in May 2022,^[4] the development of the new standard was scrapped.

Implementations

[edit]

As of December 2022, there are few VoiceXML 2.0/2.1 platform implementations being offered.

Hewlett-Packard (OCMP)
OnMobile (Ozone Speech Platform)
Alvaria
Avaya (Avaya Experience Portal)
OpenVXI
Cisco
Genesys (company)
Nuance Communications
Phonologies
Plum Voice
Telesoft Technologies

Related standards

[edit]

The W3C's Speech Interface Framework also defines these other standards closely associated with VoiceXML.

SRGS and SISR

[edit]

TheSpeech Recognition Grammar Specification (SRGS) is used to tell the speech recognizer what sentence patterns it should expect to hear: these patterns are called grammars. Once the speech recognizer determines the most likely sentence it heard, it needs to extract the semantic meaning from that sentence and return it to the VoiceXML interpreter. This semantic interpretation is specified via theSemantic Interpretation for Speech Recognition (SISR) standard. SISR is used inside SRGS to specify the semantic results associated with the grammars, i.e., the set of ECMAScript assignments that create the semantic structure returned by the speech recognizer.

SSML

[edit]

TheSpeech Synthesis Markup Language (SSML) is used to decorate textual prompts with information on how best to render them in synthetic speech, for example which speech synthesizer voice to use or when to speak louder or softer.

PLS

[edit]

ThePronunciation Lexicon Specification (PLS) is used to define how words are pronounced. The generated pronunciation information is meant to be used by both speech recognizers and speech synthesizers in voice browsing applications.

CCXML

[edit]

TheCall Control eXtensible Markup Language (CCXML) is a complementary W3C standard. A CCXML interpreter is used on some VoiceXML platforms to handle the initial call setup between the caller and the voice browser, and to provide telephony services like call transfer and disconnect to the voice browser. CCXML can also be used in non-VoiceXML contexts.

MSML, MSCML, MediaCTRL

[edit]

Inmedia server applications, it is often necessary for several call legs to interact with each other, for example in a multi-party conference. Some deficiencies were identified in VoiceXML for this application and so companies designed specific scripting languages to deal with this environment. TheMedia Server Markup Language (MSML) was Convedia's solution, andMedia Server Control Markup Language (MSCML) was Snowshore's solution. Snowshore is now owned by Dialogic and Convedia is now owned by Radisys. These languages also contain 'hooks' so that external scripts (like VoiceXML) can run on call legs whereIVR functionality is required.

There was an IETF working group calledmediactrl ("media control") that was working on a successor for these scripting systems, which it is hoped will progress to an open and widely adopted standard.^[5] The mediactrl working group concluded in 2013.^[6]

References

[edit]

^"Introduction – VoiceXML".Voicexml.org. Retrieved2017-02-23.
^Schwartz, Ephraim (2004-03-17)."W3C recommends VoiceXML 2.0". InfoWorld. Retrieved2017-02-23.
^"Voice Extensible Markup Language (VoiceXML) 2.1".W3.org. Retrieved2017-02-23.
^"VoiceXML Forum Dissolves After Successful Completion of its Mission".voicexml.org. Retrieved2022-05-31.
^"Media Server Control (mediactrl)". Archived fromthe original on 2009-01-30. Retrieved2009-01-18.
^"Media Server Control (Mediactrl) -".
^"OpenVXI".voip-info.org. 2018-07-31. Retrieved2019-06-03.

External links

[edit]

Listen to this article (9 minutes)

This audio file was created from a revision of this article dated 29 October 2011 (2011-10-29), and does not reflect subsequent edits.

(Audio help ·More spoken articles)

W3C's Voice Browser Working Group, Official VoiceXML Standards
VoiceXML Forum, VoiceXML Trademark Holder
VoiceXML tutorials

World Wide Web Consortium (W3C)

Products and
standards

Recommendations	ActivityPub Activity Streams ARIA Canonical XML CDF CSS Animations Flexbox Grid DOM EXI EmotionML Geolocation API HTML HTML5 IndexedDB ITS JSON-LD Linked Data Notifications MathML Micropub OWL PLS RDF Schema RDFa SISR SKOS SMIL SOAP SRGS SRI SSML SVG Filter Effects SCXML SHACL SPARQL Timed text VoiceXML WebAssembly WoT TD Web storage WSDL Webmention WebSub WebVTT WOFF XHTML +RDFa XML Base Encryption Events Information Set Namespace Schema Signature XForms XInclude XLink XOP XPath 2.0 3.x XPointer XProc XQuery XSL XSL-FO XSLT elements
Notes	IndieAuth XAdES XBL XHTML+SMIL XUP
Working drafts	CCXML CURIE EME InkML MSE RIF SMIL Timesheets sXBL WebGPU WebXR XFDL XFrames XMLHttpRequest
Guidelines	Web Content Accessibility Guidelines
Initiative	Markup Validation Service Web Accessibility Initiative Web Components
Deprecated	C-HTML HDML JSSS PGML VML WebPlatform
Obsoleted	P3P XHTML+MathML+SVG

Organizations

WHATWG Defunct:World Wide Web Foundation
Elected groups	AB Board TAG
Working groups	CSS SVG WebAssembly WebAuthn
Community & business groups	Web Advertising BG WebAssembly CG
Closed groups	Device Description (DDWG) HTML Multimodal Interaction Activity (MMI)