The followingXML specifications (currently in advanced Working draft state) are already addressing various parts of the Core Requirements :
EMMA (Extensible Multi-Modal Annotations): a data exchange format for the interface between input processors and interaction management systems. It will define the means for recognizers to annotate application specific data with information such as confidence scores, time stamps, input mode (e.g. key strokes, speech or pen), alternative recognition hypotheses, and partial recognition results etc.
InkML – anXML language for digital ink traces: an XML data exchange format for ink entered with an electronic pen or stylus as part of a multimodal system.
Multimodal architecture: A loosely coupled architecture for the multimodal interaction framework that focuses on providing a general means for components to communicate with each other, plus basic infrastructure for application control and platform services.
Emotion Markup Language: EmotionML will provide representations of emotions and related states for technological applications.