Movatterモバイル変換


[0]ホーム

URL:


RFC 9559Matroska FormatMarch 2024
Lhomme, et al.Standards Track[Page]
Stream:
Internet Engineering Task Force (IETF)
RFC:
9559
Updates:
8794
Category:
Standards Track
Published:
ISSN:
2070-1721
Authors:
S. Lhomme
M. Bunkus
D. Rice

RFC 9559

Matroska Media Container Format Specifications

Abstract

This document defines the Matroska audiovisual data container structure,including definitions of its structural elements, terminology,vocabulary, and application.

This document updates RFC 8794 to permit the use of a previously reserved Extensible Binary Meta Language (EBML) Element ID.

Status of This Memo

This is an Internet Standards Track document.

This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on Internet Standards is available in Section 2 of RFC 7841.

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained athttps://www.rfc-editor.org/info/rfc9559.

Copyright Notice

Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.

Table of Contents

1.Introduction

Matroska is an audiovisual data container format. It was derived from aproject called[MCF] but diverges from itsignificantly because it is based on EBML (Extensible Binary Meta Language)[RFC8794], a binary derivative of XML. EBMLprovides significant advantages in terms of future format extensibilitywithout breaking file support in parsers reading the previous versions.

First, it is essential to clarify exactly "What an Audio/Video container is", to avoid any misunderstandings:

Matroska is designed with the future in mind. It incorporates features such as:

2.Status of This Document

This document covers Matroska versions 1, 2, 3, and 4. Matroska version 4 is the current version.Matroska 1 to 3 are no longer maintained. No new elements are expected in files with version numbers 1, 2, or 3.

3.Notation and Conventions

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14[RFC2119][RFC8174] when, and only when, they appear in all capitals, as shown here.

This document defines specific terms in order to define the format and application ofMatroska:

Matroska:
A multimedia container format based on Extensible Binary Meta Language (EBML).
Matroska Reader:
A data parser that interprets the semantics of a Matroska document and creates a way for programs to useMatroska.
Matroska Player:
AMatroska Reader with the primary purpose of playing audiovisual files, includingMatroska documents.
Matroska Writer:
A data writer that createsMatroska documents.

4.Matroska Overview

4.1.Principles

Matroska is a Document Type of EBML.This specification is dependent on the EBML Specification[RFC8794].For an understanding of Matroska's EBML Schema, see the following sections of[RFC8794] that coverEBML Element Types (Section 7 of [RFC8794]),EBML Schema (Section 11.1 of [RFC8794]),and EBML Structure (Section 3 of [RFC8794]).

4.2.Updates to RFC 8794

Because of an oversight,[RFC8794] reserved EBML ID 0x80, which is used by deployed Matroska implementations.For this reason, this specification updates[RFC8794] to make 0x80 a legal EBML ID. Additionally, this specification makes the following updates:

  • Section 17.1 of [RFC8794] (per Errata 7189[Err7189])

    OLD:

    One-octet Element IDsMUST be between 0x81 and 0xFE. These items are valuable because they are short, and they need to be used for commonly repeated elements. Element IDs are to be allocated within this range according to the "RFC Required" policy[RFC8126].

    The following one-octet Element IDs are RESERVED: 0xFF and 0x80.

    NEW:

    One-octet Element IDsMUST be between 0x80 and 0xFE. These items are valuable because they are short, and they need to be used for commonly repeated elements. Element IDs are to be allocated within this range according to the "RFC Required" policy[RFC8126].

    The following one-octet Element ID is RESERVED: 0xFF.

  • Section 5 of [RFC8794] (per Errata 7191[Err7191])

    OLD:

    +=========================+================+=================+| Element ID Octet Length | Range of Valid | Number of Valid ||                         |  Element IDs   |     Element IDs |+=========================+================+=================+|            1            |  0x81 - 0xFE   |             126 |+-------------------------+----------------+-----------------+

    NEW:

    +=========================+================+=================+| Element ID Octet Length | Range of Valid | Number of Valid ||                         |  Element IDs   |     Element IDs |+=========================+================+=================+|            1            |  0x80 - 0xFE   |             127 |+-------------------------+----------------+-----------------+

4.3.Added EBML Constraints

As an EBML Document Type, Matroska adds the following constraints to the EBML specification:

  • ThedocType of theEBML HeaderMUST be "matroska".
  • TheEBMLMaxIDLength of theEBML HeaderMUST be 4.
  • TheEBMLMaxSizeLength of theEBML HeaderMUST be between 1 and 8 inclusive.

4.4.Design Rules

The Root Element and all Top-Level ElementsMUST use 4 octets for their EBML Element ID -- i.e., Segment and direct children of Segment.

Legacy EBML / Matroska parsers did not handle Empty Elements properly. Elements were present in the file, but had a length of 0.They always assumed the value was 0 for integers/dates or 0x0p+0, the textual expression of floats using the[ISO9899] format, no matter the default value of the element which should have been used instead.Therefore, Matroska WritersMUST NOT use EBML Empty Elements, if the element has a default value that is not 0 for integers/dates and 0x0p+0 for floats.

When adding new elements to Matroska, these rules apply:

  • A non-mandatory integer/date ElementMUST NOT have a default value other than 0.
  • A non-mandatory float ElementMUST NOT have a default value other than 0x0p+0.
  • A non-mandatory string ElementMUST NOT have a default value, as empty strings cannot be defined in the XML Schema.

4.5.Data Layout

A Matroska fileMUST be composed of at least oneEBML Document using theMatroska Document Type.EachEBML DocumentMUST start with anEBML Header andMUST be followed by theEBML Root Element, defined as aSegment in Matroska. Matroska defines severalTop-Level Elementsthat may occur within theSegment.

As an example, a simple Matroska file consisting of a singleEBML Document could be represented like this:

  • EBML Header
  • Segment

A more complex Matroska file consisting of anEBML Stream (containing twoEBML Documents) could be represented like this:

  • EBML Header
  • Segment
  • EBML Header
  • Segment

The following diagram represents a simple Matroska file, comprised of anEBML Documentwith anEBML Header, aSegment Element (theRoot Element), and all eight MatroskaTop-Level Elements. In the following diagrams of this section, horizontal spacing expressesa parent-child relationship between Matroska Elements (e.g., theInfo Element is contained withintheSegment Element), whereas vertical alignment represents the storage order within the file.

+-------------+| EBML Header |+---------------------------+| Segment     | SeekHead    ||             |-------------||             | Info        ||             |-------------||             | Tracks      ||             |-------------||             | Chapters    ||             |-------------||             | Cluster     ||             |-------------||             | Cues        ||             |-------------||             | Attachments ||             |-------------||             | Tags        |+---------------------------+
Figure 1:Basic Layout of a Matroska File

The MatroskaEBML Schema defines eightTop-Level Elements:

TheSeekHead Element (also known asMetaSeek) contains an index ofTop-Level Elementslocations within theSegment. Use of theSeekHead Element isRECOMMENDED. Without aSeekHead Element,a Matroska parser would have to search the entire file to find all of the otherTop-Level Elements.This is due to Matroska's flexible ordering requirements; for instance, it is acceptable fortheChapters Element to be stored after theCluster Element.

+--------------------------------+| SeekHead | Seek | SeekID       ||          |      |--------------||          |      | SeekPosition |+--------------------------------+
Figure 2:Representation of aSeekHead Element

TheInfo Element contains vital information for identifying the wholeSegment.This includes the title for theSegment, a randomly generated unique identifier (UID),and the UID(s) of any linkedSegment Elements.

+-------------------------+| Info | SegmentUUID      ||      |------------------||      | SegmentFilename  ||      |------------------||      | PrevUUID         ||      |------------------||      | PrevFilename     ||      |------------------||      | NextUUID         ||      |------------------||      | NextFilename     ||      |------------------||      | SegmentFamily    ||      |------------------||      | ChapterTranslate ||      |------------------||      | TimestampScale   ||      |------------------||      | Duration         ||      |------------------||      | DateUTC          ||      |------------------||      | Title            ||      |------------------||      | MuxingApp        ||      |------------------||      | WritingApp       ||-------------------------|
Figure 3:Representation of anInfo Element and ItsChild Elements

TheTracks Element defines the technical details for each track and can store the name,number, UID, language, and type (audio, video, subtitles, etc.) of each track.For example, theTracks ElementMAY store information about the resolution of a video trackor a sample rate of an audio track.

TheTracks ElementMUST identify all the data needed by the codec to decode the data of thespecified track. However, the data required is contingent on the codec used for the track.For example, aTrack Element for uncompressed audio only requires the audio bit rate to be present.A codec such as AC-3 would require that theCodecID Element be present for all tracks,as it is the primary way to identify which codec to use to decode the track.

+------------------------------------+| Tracks | TrackEntry | TrackNumber  ||        |            |--------------||        |            | TrackUID     ||        |            |--------------||        |            | TrackType    ||        |            |--------------||        |            | Name         ||        |            |--------------||        |            | Language     ||        |            |--------------||        |            | CodecID      ||        |            |--------------||        |            | CodecPrivate ||        |            |--------------||        |            | CodecName    ||        |            |----------------------------------+|        |            | Video        | FlagInterlaced    ||        |            |              |-------------------||        |            |              | FieldOrder        ||        |            |              |-------------------||        |            |              | StereoMode        ||        |            |              |-------------------||        |            |              | AlphaMode         ||        |            |              |-------------------||        |            |              | PixelWidth        ||        |            |              |-------------------||        |            |              | PixelHeight       ||        |            |              |-------------------||        |            |              | DisplayWidth      ||        |            |              |-------------------||        |            |              | DisplayHeight     ||        |            |              |-------------------||        |            |              | AspectRatioType   ||        |            |              |-------------------||        |            |              | Color             ||        |            |----------------------------------||        |            | Audio        | SamplingFrequency ||        |            |              |-------------------||        |            |              | Channels          ||        |            |              |-------------------||        |            |              | BitDepth          ||--------------------------------------------------------|
Figure 4:Representation of theTracks Element and a Selection of ItsDescendant Elements

TheChapters Element lists all of the chapters. Chapters set predefinedpoints to jump to in video or audio.

+-----------------------------------------+| Chapters | Edition | EditionUID         ||          | Entry   |--------------------||          |         | EditionFlagDefault ||          |         |--------------------||          |         | EditionFlagOrdered ||          |         |---------------------------------+|          |         | ChapterAtom | ChapterUID        ||          |         |             |-------------------||          |         |             | ChapterStringUID  ||          |         |             |-------------------||          |         |             | ChapterTimeStart  ||          |         |             |-------------------||          |         |             | ChapterTimeEnd    ||          |         |             |-------------------||          |         |             | ChapterFlagHidden ||          |         |             |-------------------------------+|          |         |             | ChapterDisplay | ChapString   ||          |         |             |                |--------------||          |         |             |                | ChapLanguage |+------------------------------------------------------------------+
Figure 5:Representation of theChapters Element and a Selection of ItsDescendant Elements

Cluster Elements contain the content for each track, e.g., video frames. A Matroska fileSHOULD contain at least oneCluster Element.In the rare case it doesn't, there should be a form of Segment linking with other Segments, possibly using Chapters, seeSection 17.

TheCluster Element helps to break upSimpleBlock orBlockGroup Elements and helps with seeking and error protection.EveryCluster ElementMUST contain aTimestamp Element.ThisSHOULD be theTimestamp Element used to play the firstBlock in theCluster Element,unless a different value is needed to accommodate for more Blocks; seeSection 11.2.

Cluster Elements contain one or more block element, such asBlockGroup orSimpleBlock elements.In some situations, aCluster ElementMAY contain no block element, e.g., in a live recordingwhen no data has been collected.

ABlockGroup ElementMAY contain aBlock of data and any information relating directly to thatBlock.

+--------------------------+| Cluster | Timestamp      ||         |----------------||         | Position       ||         |----------------||         | PrevSize       ||         |----------------||         | SimpleBlock    ||         |----------------||         | BlockGroup     |+--------------------------+
Figure 6:Representation of aCluster Element and Its ImmediateChild Elements
+----------------------------------+| Block | Portion of | Data Type   ||       | a Block    |  - Bit Flag ||       |--------------------------+|       | Header     | TrackNumber ||       |            |-------------||       |            | Timestamp   ||       |            |-------------||       |            | Flags       ||       |            |  - Gap      ||       |            |  - Lacing   ||       |            |  - Reserved ||       |--------------------------||       | Optional   | FrameSize   ||       |--------------------------||       | Data       | Frame       |+----------------------------------+
Figure 7:Representation of theBlock Element Structure

EachClusterMUST contain exactly oneTimestamp Element. TheTimestamp Element valueMUSTbe stored once perCluster. TheTimestamp Element in theCluster is relative to the entireSegment.TheTimestamp ElementSHOULD be the firstElement in theCluster it belongs to,or the secondElement if that Cluster contains a CRC-32 element (Section 6.2)

Additionally, theBlock contains an offset that, when added to theCluster'sTimestamp Element value,yields theBlock's effective timestamp. Therefore, the timestamp in theBlock itself is relative totheTimestamp Element in theCluster. For example, if theTimestamp Element in theClusteris set to 10 seconds and aBlock in thatCluster is supposed to be played 12 seconds into the clip,the timestamp in theBlock would be set to 2 seconds.

TheReferenceBlock in theBlockGroup is used instead of the basic "P-frame"/"B-frame" description.Instead of simply saying that thisBlock depends on theBlock directly before or directly after,theTimestamp of the necessaryBlock is used. Because there can be as manyReferenceBlock Elementsas necessary for aBlock, it allows for some extremely complex referencing.

TheCues Element is used to seek when playing back a file by providing a temporal indexfor some of theTracks. It is similar to theSeekHead Element, but is used for seeking toa specific time when playing back the file. It is possible to seek without this element,but it is much more difficult because aMatroska Reader would have to "hunt and peck"through the file to look for the correct timestamp.

TheCues ElementSHOULD contain at least oneCuePoint Element. EachCuePoint Elementstores the position of theCluster that contains theBlockGroup orSimpleBlock Element.The timestamp is stored in theCueTime Element and the location is stored in theCueTrackPositions Element.

TheCues Element is flexible. For instance, theCues Element can be used to index everysingle timestamp of everyBlock or they can be indexed selectively.

+-------------------------------------+| Cues | CuePoint | CueTime           ||      |          |-------------------||      |          | CueTrackPositions ||      |------------------------------||      | CuePoint | CueTime           ||      |          |-------------------||      |          | CueTrackPositions |+-------------------------------------+
Figure 8:Representation of aCues Element and Two Levels of ItsDescendant Elements

TheAttachments Element is for attaching files to a Matroska file, such as pictures,fonts, web pages, etc.

+------------------------------------------------+| Attachments | AttachedFile | FileDescription   ||             |              |-------------------||             |              | FileName          ||             |              |-------------------||             |              | FileMediaType     ||             |              |-------------------||             |              | FileData          ||             |              |-------------------||             |              | FileUID           ||             |              |-------------------||             |              | FileName          ||             |              |-------------------||             |              | FileReferral      ||             |              |-------------------||             |              | FileUsedStartTime ||             |              |-------------------||             |              | FileUsedEndTime   |+------------------------------------------------+
Figure 9:Representation of anAttachments Element

TheTags Element contains metadata that describes theSegment and potentiallyitsTracks,Chapters, andAttachments. EachTrack orChapter that those tagsapplies to has its UID listed in theTags. TheTags contain all extra information aboutthe file: scriptwriters, singers, actors, directors, titles, edition, price, dates, genre, comments,etc. Tags can contain their values in multiple languages. For example, a movie's "title"Tagmight contain both the original English title as well as the German title.

+-------------------------------------------+| Tags | Tag | Targets   | TargetTypeValue  ||      |     |           |------------------||      |     |           | TargetType       ||      |     |           |------------------||      |     |           | TagTrackUID      ||      |     |           |------------------||      |     |           | TagEditionUID    ||      |     |           |------------------||      |     |           | TagChapterUID    ||      |     |           |------------------||      |     |           | TagAttachmentUID ||      |     |------------------------------||      |     | SimpleTag | TagName          ||      |     |           |------------------||      |     |           | TagLanguage      ||      |     |           |------------------||      |     |           | TagDefault       ||      |     |           |------------------||      |     |           | TagString        ||      |     |           |------------------||      |     |           | TagBinary        ||      |     |           |------------------||      |     |           | SimpleTag        |+-------------------------------------------+
Figure 10:Representation of aTags Element and Three Levels of ItsChildren Elements

5.Matroska Schema

This specification includes anEBML Schema that defines the Elements and structureof Matroska using the EBML Schema elements and attributes defined inSection 11.1 of [RFC8794].The EBML Schema defines every valid Matroska element in a manner defined by the EBML specification.

Attributes using their default value, such asminOccurs,minver, etc., or attributes with undefined values, suchlength,maxver, etc., are omitted.

The definitions of each Matroska Element is provided below.

5.1.Segment Element

id / type:
0x18538067 / master
unknownsizeallowed:
True
path:
\Segment
minOccurs / maxOccurs:
1 / 1
definition:
The Root Element that contains all other Top-Level Elements; seeSection 4.5.

5.1.1.SeekHead Element

id / type:
0x114D9B74 / master
path:
\Segment\SeekHead
maxOccurs:
2
definition:
Contains seeking information of Top-Level Elements; seeSection 4.5.
5.1.1.1.Seek Element
id / type:
0x4DBB / master
path:
\Segment\SeekHead\Seek
minOccurs:
1
definition:
Contains a single seek entry to an EBML Element.
5.1.1.1.1.SeekID Element
id / type:
0x53AB / binary
length:
4
path:
\Segment\SeekHead\Seek\SeekID
minOccurs / maxOccurs:
1 / 1
definition:
The binary EBML ID of a Top-Level Element.
5.1.1.1.2.SeekPosition Element
id / type:
0x53AC / uinteger
path:
\Segment\SeekHead\Seek\SeekPosition
minOccurs / maxOccurs:
1 / 1
definition:
The Segment Position (Section 16) of a Top-Level Element.

5.1.2.Info Element

id / type:
0x1549A966 / master
path:
\Segment\Info
minOccurs / maxOccurs:
1 / 1
recurring:
True
definition:
Contains general information about the Segment.
5.1.2.1.SegmentUUID Element
id / type:
0x73A4 / binary
length:
16
path:
\Segment\Info\SegmentUUID
maxOccurs:
1
definition:
A randomly generated UID that identifies the Segment amongst many others (128 bits). It is equivalent to a Universally Unique Identifier (UUID) v4[RFC4122] with all bits randomly (or pseudorandomly) chosen. An actual UUID v4 value, where some bits are not random,MAY also be used.
usage notes:
If the Segment is a part of a Linked Segment, then this Element isREQUIRED.The value of the UIDMUST contain at least one bit set to 1.
5.1.2.2.SegmentFilename Element
id / type:
0x7384 / utf-8
path:
\Segment\Info\SegmentFilename
maxOccurs:
1
definition:
A filename corresponding to this Segment.
5.1.2.3.PrevUUID Element
id / type:
0x3CB923 / binary
length:
16
path:
\Segment\Info\PrevUUID
maxOccurs:
1
definition:
An ID used that identifies the previous Segment of a Linked Segment.
usage notes:
If the Segment is a part of a Linked Segment that uses Hard Linking (Section 17.1),then either the PrevUUID or the NextUUID Element isREQUIRED. If a Segment contains a PrevUUID, but not a NextUUID,then itMAY be considered as the last Segment of the Linked Segment. The PrevUUIDMUST NOT be equal to the SegmentUUID.
5.1.2.4.PrevFilename Element
id / type:
0x3C83AB / utf-8
path:
\Segment\Info\PrevFilename
maxOccurs:
1
definition:
A filename corresponding to the file of the previous Linked Segment.
usage notes:
Provision of the previous filename is for display convenience,but PrevUUIDSHOULD be considered authoritative for identifying the previous Segment in a Linked Segment.
5.1.2.5.NextUUID Element
id / type:
0x3EB923 / binary
length:
16
path:
\Segment\Info\NextUUID
maxOccurs:
1
definition:
An ID that identifies the next Segment of a Linked Segment.
usage notes:
If the Segment is a part of a Linked Segment that uses Hard Linking (Section 17.1),then either the PrevUUID or the NextUUID Element isREQUIRED. If a Segment contains a NextUUID, but not a PrevUUID,then itMAY be considered as the first Segment of the Linked Segment. The NextUUIDMUST NOT be equal to the SegmentUUID.
5.1.2.6.NextFilename Element
id / type:
0x3E83BB / utf-8
path:
\Segment\Info\NextFilename
maxOccurs:
1
definition:
A filename corresponding to the file of the next Linked Segment.
usage notes:
Provision of the next filename is for display convenience,but NextUUIDSHOULD be considered authoritative for identifying the Next Segment.
5.1.2.7.SegmentFamily Element
id / type:
0x4444 / binary
length:
16
path:
\Segment\Info\SegmentFamily
definition:
A UID that all Segments of a Linked SegmentMUST share (128 bits). It is equivalent to a UUID v4[RFC4122] with all bits randomly (or pseudo-randomly) chosen. An actual UUID v4 value, where some bits are not random,MAY also be used.
usage notes:
If the Segment Info contains aChapterTranslate element, this Element isREQUIRED.
5.1.2.8.ChapterTranslate Element
id / type:
0x6924 / master
path:
\Segment\Info\ChapterTranslate
definition:
The mapping between thisSegment and a Segment value in the given Chapter Codec.
rationale:
Chapter Codec may need to address different Segments, but they may not know of the way to identify such Segment when stored in Matroska.This element and its child elements add a way to map the internal Segments known to the Chapter Codec to the Segment IDs in Matroska.This allows remuxing a file with Chapter Codec without changing the content of the codec data and just the Segment mapping.
5.1.2.8.1.ChapterTranslateID Element
id / type:
0x69A5 / binary
path:
\Segment\Info\ChapterTranslate\ChapterTranslateID
minOccurs / maxOccurs:
1 / 1
definition:
The binary value used to represent this Segment in the chapter codec data.The format depends on the ChapProcessCodecID used; seeSection 5.1.7.1.4.15.
5.1.2.8.2.ChapterTranslateCodec Element
id / type:
0x69BF / uinteger
path:
\Segment\Info\ChapterTranslate\ChapterTranslateCodec
minOccurs / maxOccurs:
1 / 1
definition:
ThisChapterTranslate applies to this chapter codec of the given chapter edition(s); seeSection 5.1.7.1.4.15.
defined values:
Table 1:ChapterTranslateCodec Values
valuelabeldefinition
0Matroska ScriptChapter commands using the Matroska Script codec.
1DVD-menuChapter commands using the DVD-like codec.
5.1.2.8.3.ChapterTranslateEditionUID Element
id / type:
0x69FC / uinteger
path:
\Segment\Info\ChapterTranslate\ChapterTranslateEditionUID
definition:
Specify a chapter edition UID on which thisChapterTranslate applies.
usage notes:
When noChapterTranslateEditionUID is specified in theChapterTranslate, theChapterTranslate applies to all chapter editions found in the Segment using the givenChapterTranslateCodec.
5.1.2.9.TimestampScale Element
id / type / default:
0x2AD7B1 / uinteger / 1000000
range:
not 0
path:
\Segment\Info\TimestampScale
minOccurs / maxOccurs:
1 / 1
definition:
Base unit for Segment Ticks and Track Ticks in nanoseconds. A TimestampScale value of 1000000 means scaled timestamps in the Segment are expressed in milliseconds; seeSection 11 on how to interpret timestamps.
5.1.2.10.Duration Element
id / type:
0x4489 / float
range:
> 0x0p+0
path:
\Segment\Info\Duration
maxOccurs:
1
definition:
Duration of the Segment expressed in Segment Ticks, which are based on TimestampScale; seeSection 11.1.
5.1.2.11.DateUTC Element
id / type:
0x4461 / date
path:
\Segment\Info\DateUTC
maxOccurs:
1
definition:
The date and time that the Segment was created by the muxing application or library.
5.1.2.12.Title Element
id / type:
0x7BA9 / utf-8
path:
\Segment\Info\Title
maxOccurs:
1
definition:
General name of the Segment.
5.1.2.13.MuxingApp Element
id / type:
0x4D80 / utf-8
path:
\Segment\Info\MuxingApp
minOccurs / maxOccurs:
1 / 1
definition:
Muxing application or library (example: "libmatroska-0.4.3").
usage notes:
Include the full name of the application or library followed by the version number.
5.1.2.14.WritingApp Element
id / type:
0x5741 / utf-8
path:
\Segment\Info\WritingApp
minOccurs / maxOccurs:
1 / 1
definition:
Writing application (example: "mkvmerge-0.3.3").
usage notes:
Include the full name of the application followed by the version number.

5.1.3.Cluster Element

id / type:
0x1F43B675 / master
unknownsizeallowed:
True
path:
\Segment\Cluster
definition:
The Top-Level Element containing the (monolithic) Block structure.
5.1.3.1.Timestamp Element
id / type:
0xE7 / uinteger
path:
\Segment\Cluster\Timestamp
minOccurs / maxOccurs:
1 / 1
definition:
Absolute timestamp of the cluster expressed in Segment Ticks, which are based on TimestampScale; seeSection 11.1.
usage notes:
This elementSHOULD be the first child element of the Cluster it belongs to or the second child element if that Cluster contains a CRC-32 element (Section 6.2).
5.1.3.2.Position Element
id / type:
0xA7 / uinteger
path:
\Segment\Cluster\Position
maxOccurs:
1
maxver:
4
definition:
The Segment Position of the Cluster in the Segment (0 in live streams).It might help to resynchronise the offset on damaged streams.
5.1.3.3.PrevSize Element
id / type:
0xAB / uinteger
path:
\Segment\Cluster\PrevSize
maxOccurs:
1
definition:
Size of the previous Cluster in octets. Can be useful for backward playing.
5.1.3.4.SimpleBlock Element
id / type:
0xA3 / binary
path:
\Segment\Cluster\SimpleBlock
minver:
2
definition:
Similar to Block, seeSection 10.1, but without all the extra information,mostly used to reduced overhead when no extra feature is needed; seeSection 10.2 on SimpleBlock Structure.
5.1.3.5.BlockGroup Element
id / type:
0xA0 / master
path:
\Segment\Cluster\BlockGroup
definition:
Basic container of information containing a single Block and information specific to that Block.
5.1.3.5.1.Block Element
id / type:
0xA1 / binary
path:
\Segment\Cluster\BlockGroup\Block
minOccurs / maxOccurs:
1 / 1
definition:
Block containing the actual data to be rendered and a timestamp relative to the Cluster Timestamp;seeSection 10.1 on Block Structure.
5.1.3.5.2.BlockAdditions Element
id / type:
0x75A1 / master
path:
\Segment\Cluster\BlockGroup\BlockAdditions
maxOccurs:
1
definition:
Contains additional binary data to complete the main one; see the Codec BlockAdditions section of[MatroskaCodec] for more information.An EBML parser that has no knowledge of the Block structure could still see and use/skip this data.
5.1.3.5.2.1.BlockMore Element
id / type:
0xA6 / master
path:
\Segment\Cluster\BlockGroup\BlockAdditions\BlockMore
minOccurs:
1
definition:
Contains the BlockAdditional and some parameters.
5.1.3.5.2.2.BlockAdditional Element
id / type:
0xA5 / binary
path:
\Segment\Cluster\BlockGroup\BlockAdditions\BlockMore\BlockAdditional
minOccurs / maxOccurs:
1 / 1
definition:
Interpreted by the codec as it wishes (using the BlockAddID).
5.1.3.5.2.3.BlockAddID Element
id / type / default:
0xEE / uinteger / 1
range:
not 0
path:
\Segment\Cluster\BlockGroup\BlockAdditions\BlockMore\BlockAddID
minOccurs / maxOccurs:
1 / 1
definition:
An ID that identifies how to interpret the BlockAdditional data; seeSection 4.1.5 of [MatroskaCodec] for more information.A value of 1 indicates that the meaning of the BlockAdditional data is defined by the codec.Any other value indicates the meaning of the BlockAdditional data is found in the BlockAddIDType found in the TrackEntry.
usage notes:
Each BlockAddID valueMUST be unique between all BlockMore elements found in a BlockAdditions.
usage notes:
To keep MaxBlockAdditionID as low as possible, small valuesSHOULD be used.
5.1.3.5.3.BlockDuration Element
id / type:
0x9B / uinteger
path:
\Segment\Cluster\BlockGroup\BlockDuration
minOccurs / maxOccurs:
see implementation notes / 1
definition:
The duration of the Block expressed in Track Ticks; seeSection 11.1.The BlockDuration Element can be useful at the end of a Track to define the duration of the last frame (as there is no subsequent Block available),or when there is a break in a track like there is for subtitle tracks.
notes:
Table 2:BlockDuration Implementation Notes
attributenote
minOccursBlockDurationMUST be set (minOccurs=1) if the associated TrackEntry stores a DefaultDuration value.
defaultWhen not written and with no DefaultDuration, the value is assumed to be the difference between the timestamp of this Block and the timestamp of the next Block in "display" order (not coding order).
5.1.3.5.4.ReferencePriority Element
id / type / default:
0xFA / uinteger / 0
path:
\Segment\Cluster\BlockGroup\ReferencePriority
minOccurs / maxOccurs:
1 / 1
definition:
This frame is referenced and has the specified cache priority.In cache, only a frame of the same or higher priority can replace this frame. A value of f0 means the frame is not referenced.
5.1.3.5.5.ReferenceBlock Element
id / type:
0xFB / integer
path:
\Segment\Cluster\BlockGroup\ReferenceBlock
definition:
A timestamp value, relative to the timestamp of the Block in this BlockGroup, expressed in Track Ticks; seeSection 11.1.This is used to reference other frames necessary to decode this frame.The relative valueSHOULD correspond to a validBlock that thisBlock depends on.Historically, Matroska Writers didn't write the actualBlock(s) that thisBlock depends on; however, they did writesomeBlock(s) in the past.

The value "0"MAY also be used to signify that thisBlock cannot be decoded on its own, but without knowledge of whichBlock is necessary. In this case, otherReferenceBlock ElementsMUST NOT be found in the sameBlockGroup.

If theBlockGroup doesn't have aReferenceBlock element, then theBlock it contains can be decoded without using any otherBlock data.

5.1.3.5.6.CodecState Element
id / type:
0xA4 / binary
path:
\Segment\Cluster\BlockGroup\CodecState
maxOccurs:
1
minver:
2
definition:
The new codec state to use. Data interpretation is private to the codec.This informationSHOULD always be referenced by a seek entry.
5.1.3.5.7.DiscardPadding Element
id / type:
0x75A2 / integer
path:
\Segment\Cluster\BlockGroup\DiscardPadding
maxOccurs:
1
minver:
4
definition:
Duration of the silent data added to the Block expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1(padding at the end of the Block for positive values and at the beginning of the Block for negative values).The duration of DiscardPadding is not calculated in the duration of the TrackEntry andSHOULD be discarded during playback.

5.1.4.Tracks Element

id / type:
0x1654AE6B / master
path:
\Segment\Tracks
maxOccurs:
1
recurring:
True
definition:
A Top-Level Element of information with many tracks described.
5.1.4.1.TrackEntry Element
id / type:
0xAE / master
path:
\Segment\Tracks\TrackEntry
minOccurs:
1
definition:
Describes a track with all Elements.
5.1.4.1.1.TrackNumber Element
id / type:
0xD7 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\TrackNumber
minOccurs / maxOccurs:
1 / 1
definition:
The track number as used in the Block Header.
5.1.4.1.2.TrackUID Element
id / type:
0x73C5 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\TrackUID
minOccurs / maxOccurs:
1 / 1
definition:
A UID that identifies the Track.
stream copy:
True (Section 8)
5.1.4.1.3.TrackType Element
id / type:
0x83 / uinteger
path:
\Segment\Tracks\TrackEntry\TrackType
minOccurs / maxOccurs:
1 / 1
definition:
TheTrackType defines the type of each frame found in the Track.The valueSHOULD be stored on 1 octet.
defined values:
Table 3:TrackType Values
valuelabelcontents of each frame
1videoAn image.
2audioAudio samples.
3complexA mix of different other TrackType. The codec needs to define how theMatroska Player should interpret such data.
16logoAn image to be rendered over the video track(s).
17subtitleSubtitle or closed caption data to be rendered over the video track(s).
18buttonsInteractive button(s) to be rendered over the video track(s).
32controlMetadata used to control the player of theMatroska Player.
33metadataTimed metadata that can be passed on to theMatroska Player.
stream copy:
True (Section 8)
5.1.4.1.4.FlagEnabled Element
id / type / default:
0xB9 / uinteger / 1
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagEnabled
minOccurs / maxOccurs:
1 / 1
minver:
2
definition:
Set to 1 if the track is usable. It is possible to turn a track that is not usable into a usable track using chapter codecs or control tracks.
5.1.4.1.5.FlagDefault Element
id / type / default:
0x88 / uinteger / 1
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagDefault
minOccurs / maxOccurs:
1 / 1
definition:
Set if the track (audio, video or subs) is eligible for automatic selection by the player; seeSection 19 for more details.
5.1.4.1.6.FlagForced Element
id / type / default:
0x55AA / uinteger / 0
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagForced
minOccurs / maxOccurs:
1 / 1
definition:
Applies only to subtitles. Set if the track is eligible for automatic selection by the player if it matches the user's language preference,even if the user's preferences wouldn't normally enable subtitles with the selected audio track;this can be used for tracks containing only translations of audio in foreign languages or on-screen text.SeeSection 19 for more details.
5.1.4.1.7.FlagHearingImpaired Element
id / type:
0x55AB / uinteger
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagHearingImpaired
maxOccurs:
1
minver:
4
definition:
Set to 1 if and only if the track is suitable for users with hearing impairments.
5.1.4.1.8.FlagVisualImpaired Element
id / type:
0x55AC / uinteger
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagVisualImpaired
maxOccurs:
1
minver:
4
definition:
Set to 1 if and only if the track is suitable for users with visual impairments.
5.1.4.1.9.FlagTextDescriptions Element
id / type:
0x55AD / uinteger
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagTextDescriptions
maxOccurs:
1
minver:
4
definition:
Set to 1 if and only if the track contains textual descriptions of video content.
5.1.4.1.10.FlagOriginal Element
id / type:
0x55AE / uinteger
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagOriginal
maxOccurs:
1
minver:
4
definition:
Set to 1 if and only if the track is in the content's original language.
5.1.4.1.11.FlagCommentary Element
id / type:
0x55AF / uinteger
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagCommentary
maxOccurs:
1
minver:
4
definition:
Set to 1 if and only if the track contains commentary.
5.1.4.1.12.FlagLacing Element
id / type / default:
0x9C / uinteger / 1
range:
0-1
path:
\Segment\Tracks\TrackEntry\FlagLacing
minOccurs / maxOccurs:
1 / 1
definition:
Set to 1 if the trackMAY contain blocks that use lacing. When set to 0, all blocksMUST have their lacing flags set to No lacing; seeSection 10.3 on Block Lacing.
5.1.4.1.13.DefaultDuration Element
id / type:
0x23E383 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\DefaultDuration
maxOccurs:
1
definition:
Number of nanoseconds per frame expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1("frame" in terms of Matroska -- one Element put into a (Simple)Block).
stream copy:
True (Section 8)
5.1.4.1.14.DefaultDecodedFieldDuration Element
id / type:
0x234E7A / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\DefaultDecodedFieldDuration
maxOccurs:
1
minver:
4
definition:
The period between two successive fields at the output of the decoding process expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.seeSection 9 for more information
stream copy:
True (Section 8)
5.1.4.1.15.TrackTimestampScale Element
id / type / default:
0x23314F / float / 0x1p+0
range:
> 0x0p+0
path:
\Segment\Tracks\TrackEntry\TrackTimestampScale
minOccurs / maxOccurs:
1 / 1
maxver:
3
definition:
The scale to apply on this track to work at a normal speed in relation with other tracks(mostly used to adjust video speed when the audio length differs).
stream copy:
True (Section 8)
5.1.4.1.16.MaxBlockAdditionID Element
id / type / default:
0x55EE / uinteger / 0
path:
\Segment\Tracks\TrackEntry\MaxBlockAdditionID
minOccurs / maxOccurs:
1 / 1
definition:
The maximum value of BlockAddID (Section 5.1.3.5.2.3).A value 0 means there is no BlockAdditions (Section 5.1.3.5.2) for this track.
5.1.4.1.17.BlockAdditionMapping Element
id / type:
0x41E4 / master
path:
\Segment\Tracks\TrackEntry\BlockAdditionMapping
minver:
4
definition:
Contains elements that extend the track format by adding content either to each frame,with BlockAddID (Section 5.1.3.5.2.3), or to the track as a wholewith BlockAddIDExtraData.
5.1.4.1.17.1.BlockAddIDValue Element
id / type:
0x41F0 / uinteger
range:
>=2
path:
\Segment\Tracks\TrackEntry\BlockAdditionMapping\BlockAddIDValue
maxOccurs:
1
minver:
4
definition:
If the track format extension needs content beside frames,the value refers to the BlockAddID (Section 5.1.3.5.2.3) value being described.
usage notes:
To keep MaxBlockAdditionID as low as possible, small valuesSHOULD be used.
5.1.4.1.17.2.BlockAddIDName Element
id / type:
0x41A4 / string
path:
\Segment\Tracks\TrackEntry\BlockAdditionMapping\BlockAddIDName
maxOccurs:
1
minver:
4
definition:
A human-friendly name describing the type of BlockAdditional dataas defined by the associated Block Additional Mapping.
5.1.4.1.17.3.BlockAddIDType Element
id / type / default:
0x41E7 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\BlockAdditionMapping\BlockAddIDType
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Stores the registered identifier of the Block Additional Mappingto define how the BlockAdditional data should be handled.
usage notes:
If BlockAddIDType is 0, BlockAddIDValue and corresponding BlockAddID valuesMUST be 1.
5.1.4.1.17.4.BlockAddIDExtraData Element
id / type:
0x41ED / binary
path:
\Segment\Tracks\TrackEntry\BlockAdditionMapping\BlockAddIDExtraData
maxOccurs:
1
minver:
4
definition:
Extra binary data that the BlockAddIDType can use to interpret the BlockAdditional data.The interpretation of the binary data depends on the BlockAddIDType value and the corresponding Block Additional Mapping.
5.1.4.1.18.Name Element
id / type:
0x536E / utf-8
path:
\Segment\Tracks\TrackEntry\Name
maxOccurs:
1
definition:
A human-readable track name.
5.1.4.1.19.Language Element
id / type / default:
0x22B59C / string / eng
path:
\Segment\Tracks\TrackEntry\Language
minOccurs / maxOccurs:
1 / 1
definition:
The language of the trackin the Matroska languages form; seeSection 12 on language codes.This ElementMUST be ignored if the LanguageBCP47 Element is used in the same TrackEntry.
5.1.4.1.20.LanguageBCP47 Element
id / type:
0x22B59D / string
path:
\Segment\Tracks\TrackEntry\LanguageBCP47
maxOccurs:
1
minver:
4
definition:
The language of the trackin the[BCP47] form; seeSection 12 on language codes.If this Element is used, then any Language Elements used in the same TrackEntryMUST be ignored.
5.1.4.1.21.CodecID Element
id / type:
0x86 / string
path:
\Segment\Tracks\TrackEntry\CodecID
minOccurs / maxOccurs:
1 / 1
definition:
An ID corresponding to the codec;see[MatroskaCodec] for more info.
stream copy:
True (Section 8)
5.1.4.1.22.CodecPrivate Element
id / type:
0x63A2 / binary
path:
\Segment\Tracks\TrackEntry\CodecPrivate
maxOccurs:
1
definition:
Private data only known to the codec.
stream copy:
True (Section 8)
5.1.4.1.23.CodecName Element
id / type:
0x258688 / utf-8
path:
\Segment\Tracks\TrackEntry\CodecName
maxOccurs:
1
definition:
A human-readable string specifying the codec.
5.1.4.1.24.AttachmentLink Element
id / type:
0x7446 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\AttachmentLink
maxOccurs:
1
maxver:
3
definition:
The UID of an attachment that is used by this codec.
usage notes:
The valueMUST match theFileUID value of an attachment found in this Segment.
5.1.4.1.25.CodecDelay Element
id / type / default:
0x56AA / uinteger / 0
path:
\Segment\Tracks\TrackEntry\CodecDelay
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
The built-in delay for codec expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.It represents the number of codec samples that will be discarded by the decoder during playback.This timestamp valueMUST be subtracted from each frame timestamp in order to get the timestamp that will be actually played.The valueSHOULD be small so the muxing of tracks with the same actual timestamp are in the same Cluster.
stream copy:
True (Section 8)
5.1.4.1.26.SeekPreRoll Element
id / type / default:
0x56BB / uinteger / 0
path:
\Segment\Tracks\TrackEntry\SeekPreRoll
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
After a discontinuity, SeekPreRoll is the duration of the datathat the decoderMUST decode before the decoded data is valid and is expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.
stream copy:
True (Section 8)
5.1.4.1.27.TrackTranslate Element
id / type:
0x6624 / master
path:
\Segment\Tracks\TrackEntry\TrackTranslate
definition:
The mapping between thisTrackEntry and a track value in the given Chapter Codec.
rationale:
The Chapter Codec may need to address content in a specific track, but they may not know of the way to identify tracks in Matroska.This element and its child elements add a way to map the internal tracks known to the Chapter Codec to the track IDs in Matroska.This allows remuxing a file with Chapter Codec without changing the content of the codec data and just the track mapping.
5.1.4.1.27.1.TrackTranslateTrackID Element
id / type:
0x66A5 / binary
path:
\Segment\Tracks\TrackEntry\TrackTranslate\TrackTranslateTrackID
minOccurs / maxOccurs:
1 / 1
definition:
The binary value used to represent thisTrackEntry in the chapter codec data.The format depends on theChapProcessCodecID used; seeSection 5.1.7.1.4.15.
5.1.4.1.27.2.TrackTranslateCodec Element
id / type:
0x66BF / uinteger
path:
\Segment\Tracks\TrackEntry\TrackTranslate\TrackTranslateCodec
minOccurs / maxOccurs:
1 / 1
definition:
ThisTrackTranslate applies to this chapter codec of the given chapter edition(s); seeSection 5.1.7.1.4.15.
defined values:
Table 4:TrackTranslateCodec Values
valuelabeldefinition
0Matroska ScriptChapter commands using the Matroska Script codec.
1DVD-menuChapter commands using the DVD-like codec.
5.1.4.1.27.3.TrackTranslateEditionUID Element
id / type:
0x66FC / uinteger
path:
\Segment\Tracks\TrackEntry\TrackTranslate\TrackTranslateEditionUID
definition:
Specifies a chapter-edition UID in which thisTrackTranslate applies.
usage notes:
When noTrackTranslateEditionUID is specified in theTrackTranslate, theTrackTranslate applies to all chapter editions found in the Segment using the givenTrackTranslateCodec.
5.1.4.1.28.Video Element
id / type:
0xE0 / master
path:
\Segment\Tracks\TrackEntry\Video
maxOccurs:
1
definition:
Video settings.
5.1.4.1.28.1.FlagInterlaced Element
id / type / default:
0x9A / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\FlagInterlaced
minOccurs / maxOccurs:
1 / 1
minver:
2
definition:
Specify whether the video frames in this track are interlaced.
defined values:
Table 5:FlagInterlaced Values
valuelabeldefinition
0undeterminedUnknown status. This valueSHOULD be avoided.
1interlacedInterlaced frames.
2progressiveNo interlacing.
stream copy:
True (Section 8)
5.1.4.1.28.2.FieldOrder Element
id / type / default:
0x9D / uinteger / 2
path:
\Segment\Tracks\TrackEntry\Video\FieldOrder
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Specifies the field ordering of video frames in this track.
defined values:
Table 6:FieldOrder Values
valuelabeldefinition
0progressiveInterlaced frames. This valueSHOULD be avoided; setting FlagInterlaced to 2 is sufficient.
1tffTop field displayed first. Top field stored first.
2undeterminedUnknown field order. This valueSHOULD be avoided.
6bffBottom field displayed first. Bottom field stored first.
9bff(swapped)Top field displayed first. Fields are interleaved in storage with the top line of the top field stored first.
14tff(swapped)Bottom field displayed first. Fields are interleaved in storage with the top line of the top field stored first.
usage notes:
If FlagInterlaced is not set to 1, this ElementMUST be ignored.
stream copy:
True (Section 8)
5.1.4.1.28.3.StereoMode Element
id / type / default:
0x53B8 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\StereoMode
minOccurs / maxOccurs:
1 / 1
minver:
3
definition:
Stereo-3D video mode; seeSection 18.10 for more details.
restrictions:
Table 7:StereoMode Values
valuelabel
0mono
1side by side (left eye first)
2top - bottom (right eye is first)
3top - bottom (left eye is first)
4checkboard (right eye is first)
5checkboard (left eye is first)
6row interleaved (right eye is first)
7row interleaved (left eye is first)
8column interleaved (right eye is first)
9column interleaved (left eye is first)
10anaglyph (cyan/red)
11side by side (right eye first)
12anaglyph (green/magenta)
13both eyes laced in one Block (left eye is first)
14both eyes laced in one Block (right eye is first)
stream copy:
True (Section 8)
5.1.4.1.28.4.AlphaMode Element
id / type / default:
0x53C0 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\AlphaMode
minOccurs / maxOccurs:
1 / 1
minver:
3
definition:
Indicates whether the BlockAdditional Element with BlockAddID of "1" contains Alpha data as defined by to the Codec Mapping for theCodecID.Undefined valuesSHOULD NOT be used, as the behavior of known implementations is different (considered either as 0 or 1).
defined values:
Table 8:AlphaMode Values
valuelabeldefinition
0noneThe BlockAdditional Element with BlockAddID of "1" does not exist orSHOULD NOT be considered as containing such data.
1presentThe BlockAdditional Element with BlockAddID of "1" contains alpha channel data.
stream copy:
True (Section 8)
5.1.4.1.28.5.OldStereoMode Element
id / type:
0x53B9 / uinteger
path:
\Segment\Tracks\TrackEntry\Video\OldStereoMode
maxOccurs:
1
maxver:
2
definition:
Bogus StereoMode value used in old versions of libmatroska.
restrictions:
Table 9:OldStereoMode Values
valuelabel
0mono
1right eye
2left eye
3both eyes
usage notes:
This ElementMUST NOT be used. It was an incorrect value used in libmatroska up to 0.9.0.
5.1.4.1.28.6.PixelWidth Element
id / type:
0xB0 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\Video\PixelWidth
minOccurs / maxOccurs:
1 / 1
definition:
Width of the encoded video frames in pixels.
stream copy:
True (Section 8)
5.1.4.1.28.7.PixelHeight Element
id / type:
0xBA / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\Video\PixelHeight
minOccurs / maxOccurs:
1 / 1
definition:
Height of the encoded video frames in pixels.
stream copy:
True (Section 8)
5.1.4.1.28.8.PixelCropBottom Element
id / type / default:
0x54AA / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\PixelCropBottom
minOccurs / maxOccurs:
1 / 1
definition:
The number of video pixels to remove at the bottom of the image.
stream copy:
True (Section 8)
5.1.4.1.28.9.PixelCropTop Element
id / type / default:
0x54BB / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\PixelCropTop
minOccurs / maxOccurs:
1 / 1
definition:
The number of video pixels to remove at the top of the image.
stream copy:
True (Section 8)
5.1.4.1.28.10.PixelCropLeft Element
id / type / default:
0x54CC / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\PixelCropLeft
minOccurs / maxOccurs:
1 / 1
definition:
The number of video pixels to remove on the left of the image.
stream copy:
True (Section 8)
5.1.4.1.28.11.PixelCropRight Element
id / type / default:
0x54DD / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\PixelCropRight
minOccurs / maxOccurs:
1 / 1
definition:
The number of video pixels to remove on the right of the image.
stream copy:
True (Section 8)
5.1.4.1.28.12.DisplayWidth Element
id / type:
0x54B0 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\Video\DisplayWidth
maxOccurs:
1
definition:
Width of the video frames to display. Applies to the video frame after cropping (PixelCrop* Elements).
notes:
Table 10:DisplayWidth Implementation Notes
attributenote
defaultIf the DisplayUnit of the same TrackEntry is 0, then the default value for DisplayWidth is equal to PixelWidth - PixelCropLeft - PixelCropRight; otherwise, there is no default value.
stream copy:
True (Section 8)
5.1.4.1.28.13.DisplayHeight Element
id / type:
0x54BA / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\Video\DisplayHeight
maxOccurs:
1
definition:
Height of the video frames to display. Applies to the video frame after cropping (PixelCrop* Elements).
notes:
Table 11:DisplayHeight Implementation Notes
attributenote
defaultIf the DisplayUnit of the same TrackEntry is 0, then the default value for DisplayHeight is equal to PixelHeight - PixelCropTop - PixelCropBottom; otheriwse, there is no default value.
stream copy:
True (Section 8)
5.1.4.1.28.14.DisplayUnit Element
id / type / default:
0x54B2 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\DisplayUnit
minOccurs / maxOccurs:
1 / 1
definition:
How DisplayWidth and DisplayHeight are interpreted.
restrictions:
Table 12:DisplayUnit Values
valuelabel
0pixels
1centimeters
2inches
3display aspect ratio
4unknown
5.1.4.1.28.15.UncompressedFourCC Element
id / type:
0x2EB524 / binary
length:
4
path:
\Segment\Tracks\TrackEntry\Video\UncompressedFourCC
minOccurs / maxOccurs:
see implementation notes / 1
definition:
Specifies the uncompressed pixel format used for the Track's data as a FourCC.This value is similar in scope to the biCompression value of AVI'sBITMAPINFO[AVIFormat]. There is neither a definitive list of FourCC values nor an official registry. Some common values for YUV pixel formats can be found at[MSYUV8],[MSYUV16], and[FourCC-YUV]. Some common values for uncompressed RGB pixel formats can be found at[MSRGB] and[FourCC-RGB].
notes:
Table 13:UncompressedFourCC Implementation Notes
attributenote
minOccursUncompressedFourCCMUST be set (minOccurs=1) in the TrackEntry when the CodecID Element of the TrackEntry is set to "V_UNCOMPRESSED".
stream copy:
True (Section 8)
5.1.4.1.28.16.Colour Element
id / type:
0x55B0 / master
path:
\Segment\Tracks\TrackEntry\Video\Colour
maxOccurs:
1
minver:
4
definition:
Settings describing the colour format.
stream copy:
True (Section 8)
5.1.4.1.28.17.MatrixCoefficients Element
id / type / default:
0x55B1 / uinteger / 2
path:
\Segment\Tracks\TrackEntry\Video\Colour\MatrixCoefficients
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
The Matrix Coefficients of the video used to derive luma and chroma values from red, green, and blue color primaries.For clarity, the value and meanings for MatrixCoefficients are adopted from Table 4 of[ITU-H.273].
restrictions:
Table 14:MatrixCoefficients Values
valuelabel
0Identity
1ITU-R BT.709
2unspecified
3reserved
4US FCC 73.682
5ITU-R BT.470BG
6SMPTE 170M
7SMPTE 240M
8YCoCg
9BT2020 Non-constant Luminance
10BT2020 Constant Luminance
11SMPTE ST 2085
12Chroma-derived Non-constant Luminance
13Chroma-derived Constant Luminance
14ITU-R BT.2100-0
stream copy:
True (Section 8)
5.1.4.1.28.18.BitsPerChannel Element
id / type / default:
0x55B2 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\Colour\BitsPerChannel
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Number of decoded bits per channel. A value of 0 indicates that the BitsPerChannel is unspecified.
stream copy:
True (Section 8)
5.1.4.1.28.19.ChromaSubsamplingHorz Element
id / type:
0x55B3 / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\ChromaSubsamplingHorz
maxOccurs:
1
minver:
4
definition:
The amount of pixels to remove in the Cr and Cb channels for every pixel not removed horizontally. For example, the ChromaSubsamplingHorzSHOULD be set to 1 for a video with 4:2:0 chroma subsampling.
stream copy:
True (Section 8)
5.1.4.1.28.20.ChromaSubsamplingVert Element
id / type:
0x55B4 / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\ChromaSubsamplingVert
maxOccurs:
1
minver:
4
definition:
The amount of pixels to remove in the Cr and Cb channels for every pixel not removed vertically. For example, the ChromaSubsamplingVertSHOULD be set to 1 for a video with 4:2:0 chroma subsampling.
stream copy:
True (Section 8)
5.1.4.1.28.21.CbSubsamplingHorz Element
id / type:
0x55B5 / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\CbSubsamplingHorz
maxOccurs:
1
minver:
4
definition:
The amount of pixels to remove in the Cb channel for every pixel not removed horizontally.This is additive with ChromaSubsamplingHorz. For example, the ChromaSubsamplingHorz and CbSubsamplingHorzSHOULD be set to 1 for a video with 4:2:1 chroma subsampling.
stream copy:
True (Section 8)
5.1.4.1.28.22.CbSubsamplingVert Element
id / type:
0x55B6 / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\CbSubsamplingVert
maxOccurs:
1
minver:
4
definition:
The amount of pixels to remove in the Cb channel for every pixel not removed vertically.This is additive with ChromaSubsamplingVert.
stream copy:
True (Section 8)
5.1.4.1.28.23.ChromaSitingHorz Element
id / type / default:
0x55B7 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\Colour\ChromaSitingHorz
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
How chroma is subsampled horizontally.
restrictions:
Table 15:ChromaSitingHorz Values
valuelabel
0unspecified
1left collocated
2half
stream copy:
True (Section 8)
5.1.4.1.28.24.ChromaSitingVert Element
id / type / default:
0x55B8 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\Colour\ChromaSitingVert
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
How chroma is subsampled vertically.
restrictions:
Table 16:ChromaSitingVert Values
valuelabel
0unspecified
1top collocated
2half
stream copy:
True (Section 8)
5.1.4.1.28.25.Range Element
id / type / default:
0x55B9 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\Colour\Range
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Clipping of the color ranges.
restrictions:
Table 17:Range Values
valuelabel
0unspecified
1broadcast range
2full range (no clipping)
3defined by MatrixCoefficients / TransferCharacteristics
stream copy:
True (Section 8)
5.1.4.1.28.26.TransferCharacteristics Element
id / type / default:
0x55BA / uinteger / 2
path:
\Segment\Tracks\TrackEntry\Video\Colour\TransferCharacteristics
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
The transfer characteristics of the video. For clarity,the value and meanings for TransferCharacteristics are adopted from Table 3 of[ITU-H.273].
restrictions:
Table 18:TransferCharacteristics Values
valuelabel
0reserved
1ITU-R BT.709
2unspecified
3reserved2
4Gamma 2.2 curve - BT.470M
5Gamma 2.8 curve - BT.470BG
6SMPTE 170M
7SMPTE 240M
8Linear
9Log
10Log Sqrt
11IEC 61966-2-4
12ITU-R BT.1361 Extended Colour Gamut
13IEC 61966-2-1
14ITU-R BT.2020 10 bit
15ITU-R BT.2020 12 bit
16ITU-R BT.2100 Perceptual Quantization
17SMPTE ST 428-1
18ARIB STD-B67 (HLG)
stream copy:
True (Section 8)
5.1.4.1.28.27.Primaries Element
id / type / default:
0x55BB / uinteger / 2
path:
\Segment\Tracks\TrackEntry\Video\Colour\Primaries
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
The colour primaries of the video. For clarity,the value and meanings for Primaries are adopted from Table 2 of[ITU-H.273].
restrictions:
Table 19:Primaries Values
valuelabel
0reserved
1ITU-R BT.709
2unspecified
3reserved2
4ITU-R BT.470M
5ITU-R BT.470BG - BT.601 625
6ITU-R BT.601 525 - SMPTE 170M
7SMPTE 240M
8FILM
9ITU-R BT.2020
10SMPTE ST 428-1
11SMPTE RP 432-2
12SMPTE EG 432-2
22EBU Tech. 3213-E - JEDEC P22 phosphors
stream copy:
True (Section 8)
5.1.4.1.28.28.MaxCLL Element
id / type:
0x55BC / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\MaxCLL
maxOccurs:
1
minver:
4
definition:
Maximum brightness of a single pixel (Maximum Content Light Level)in candelas per square meter (cd/m2).
stream copy:
True (Section 8)
5.1.4.1.28.29.MaxFALL Element
id / type:
0x55BD / uinteger
path:
\Segment\Tracks\TrackEntry\Video\Colour\MaxFALL
maxOccurs:
1
minver:
4
definition:
Maximum brightness of a single full frame (Maximum Frame-Average Light Level)in candelas per square meter (cd/m2).
stream copy:
True (Section 8)
5.1.4.1.28.30.MasteringMetadata Element
id / type:
0x55D0 / master
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata
maxOccurs:
1
minver:
4
definition:
SMPTE 2086 mastering data.
stream copy:
True (Section 8)
5.1.4.1.28.31.PrimaryRChromaticityX Element
id / type:
0x55D1 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryRChromaticityX
maxOccurs:
1
minver:
4
definition:
Red X chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.32.PrimaryRChromaticityY Element
id / type:
0x55D2 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryRChromaticityY
maxOccurs:
1
minver:
4
definition:
Red Y chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.33.PrimaryGChromaticityX Element
id / type:
0x55D3 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryGChromaticityX
maxOccurs:
1
minver:
4
definition:
Green X chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.34.PrimaryGChromaticityY Element
id / type:
0x55D4 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryGChromaticityY
maxOccurs:
1
minver:
4
definition:
Green Y chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.35.PrimaryBChromaticityX Element
id / type:
0x55D5 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryBChromaticityX
maxOccurs:
1
minver:
4
definition:
Blue X chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.36.PrimaryBChromaticityY Element
id / type:
0x55D6 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\PrimaryBChromaticityY
maxOccurs:
1
minver:
4
definition:
Blue Y chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.37.WhitePointChromaticityX Element
id / type:
0x55D7 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\WhitePointChromaticityX
maxOccurs:
1
minver:
4
definition:
White X chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.38.WhitePointChromaticityY Element
id / type:
0x55D8 / float
range:
0x0p+0-0x1p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\WhitePointChromaticityY
maxOccurs:
1
minver:
4
definition:
White Y chromaticity coordinate as defined by[CIE-1931].
stream copy:
True (Section 8)
5.1.4.1.28.39.LuminanceMax Element
id / type:
0x55D9 / float
range:
>= 0x0p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\LuminanceMax
maxOccurs:
1
minver:
4
definition:
Maximum luminance. Represented in candelas per square meter (cd/m2).
stream copy:
True (Section 8)
5.1.4.1.28.40.LuminanceMin Element
id / type:
0x55DA / float
range:
>= 0x0p+0
path:
\Segment\Tracks\TrackEntry\Video\Colour\MasteringMetadata\LuminanceMin
maxOccurs:
1
minver:
4
definition:
Minimum luminance. Represented in candelas per square meter (cd/m2).
stream copy:
True (Section 8)
5.1.4.1.28.41.Projection Element
id / type:
0x7670 / master
path:
\Segment\Tracks\TrackEntry\Video\Projection
maxOccurs:
1
minver:
4
definition:
Describes the video projection details. Used to render spherical, VR videos or flipping videos horizontally/vertically.
stream copy:
True (Section 8)
5.1.4.1.28.42.ProjectionType Element
id / type / default:
0x7671 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\Video\Projection\ProjectionType
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Describes the projection used for this video track.
restrictions:
Table 20:ProjectionType Values
valuelabel
0rectangular
1equirectangular
2cubemap
3mesh
stream copy:
True (Section 8)
5.1.4.1.28.43.ProjectionPrivate Element
id / type:
0x7672 / binary
path:
\Segment\Tracks\TrackEntry\Video\Projection\ProjectionPrivate
maxOccurs:
1
minver:
4
definition:

Private data that only applies to a specific projection.

  • IfProjectionType equals 0 (rectangular), then this elementMUST NOT be present.
  • IfProjectionType equals 1 (equirectangular), then this elementMUST be present and contain the same binary data that would be stored inside an ISOBMFF Equirectangular Projection Box ("equi").
  • IfProjectionType equals 2 (cubemap), then this elementMUST be present and contain the same binary data that would be stored inside an ISOBMFF Cubemap Projection Box ("cbmp").
  • IfProjectionType equals 3 (mesh), then this elementMUST be present and contain the same binary data that would be stored inside an ISOBMFF Mesh Projection Box ("mshp").
usage notes:
ISOBMFF box size and FourCC fields are not included in the binary data,but the FullBox version and flag fields are. This is to avoidredundant framing information while preserving versioning and semantics between the two container formats.
stream copy:
True (Section 8)
5.1.4.1.28.44.ProjectionPoseYaw Element
id / type / default:
0x7673 / float / 0x0p+0
range:
>= -0xB4p+0, <= 0xB4p+0
path:
\Segment\Tracks\TrackEntry\Video\Projection\ProjectionPoseYaw
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Specifies a yaw rotation to the projection.
stream copy:
True (Section 8)

Value represents a clockwise rotation, in degrees, around the up vector. This rotation must be appliedbefore anyProjectionPosePitch orProjectionPoseRoll rotations.The value of this elementMUST be in the -180 to 180 degree range, both included.

SettingProjectionPoseYaw to -180 or 180 degrees with theProjectionPoseRoll andProjectionPosePitch set to 0 degrees flips the image horizontally.

5.1.4.1.28.45.ProjectionPosePitch Element
id / type / default:
0x7674 / float / 0x0p+0
range:
>= -0x5Ap+0, <= 0x5Ap+0
path:
\Segment\Tracks\TrackEntry\Video\Projection\ProjectionPosePitch
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Specifies a pitch rotation to the projection.
stream copy:
True (Section 8)

Value represents a counter-clockwise rotation, in degrees, around the right vector. This rotation must be appliedafter theProjectionPoseYaw rotation and before theProjectionPoseRoll rotation.The value of this elementMUST be in the -90 to 90 degree range, both included.

5.1.4.1.28.46.ProjectionPoseRoll Element
id / type / default:
0x7675 / float / 0x0p+0
range:
>= -0xB4p+0, <= 0xB4p+0
path:
\Segment\Tracks\TrackEntry\Video\Projection\ProjectionPoseRoll
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
Specifies a roll rotation to the projection.
stream copy:
True (Section 8)

Value represents a counter-clockwise rotation, in degrees, around the forward vector. This rotation must be appliedafter theProjectionPoseYaw andProjectionPosePitch rotations.The value of this elementMUST be in the -180 to 180 degree range, both included.

SettingProjectionPoseRoll to -180 or 180 degrees andProjectionPoseYaw to 180 or -180 degrees withProjectionPosePitch set to 0 degrees flips the image vertically.

SettingProjectionPoseRoll to 180 or -180 degrees withProjectionPoseYaw andProjectionPosePitch set to 0 degrees flips the image horizontally and vertically.

5.1.4.1.29.Audio Element
id / type:
0xE1 / master
path:
\Segment\Tracks\TrackEntry\Audio
maxOccurs:
1
definition:
Audio settings.
5.1.4.1.29.1.SamplingFrequency Element
id / type / default:
0xB5 / float / 0x1.f4p+12
range:
> 0x0p+0
path:
\Segment\Tracks\TrackEntry\Audio\SamplingFrequency
minOccurs / maxOccurs:
1 / 1
definition:
Sampling frequency in Hz.
stream copy:
True (Section 8)
5.1.4.1.29.2.OutputSamplingFrequency Element
id / type:
0x78B5 / float
range:
> 0x0p+0
path:
\Segment\Tracks\TrackEntry\Audio\OutputSamplingFrequency
maxOccurs:
1
definition:
Real output sampling frequency in Hz (used for SBR techniques).
notes:
Table 21:OutputSamplingFrequency Implementation Notes
attributenote
defaultThe default value for OutputSamplingFrequency of the same TrackEntry is equal to the SamplingFrequency.
5.1.4.1.29.3.Channels Element
id / type / default:
0x9F / uinteger / 1
range:
not 0
path:
\Segment\Tracks\TrackEntry\Audio\Channels
minOccurs / maxOccurs:
1 / 1
definition:
Numbers of channels in the track.
stream copy:
True (Section 8)
5.1.4.1.29.4.BitDepth Element
id / type:
0x6264 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\Audio\BitDepth
maxOccurs:
1
definition:
Bits per sample. Mostly used for PCM.
stream copy:
True (Section 8)
5.1.4.1.30.TrackOperation Element
id / type:
0xE2 / master
path:
\Segment\Tracks\TrackEntry\TrackOperation
maxOccurs:
1
minver:
3
definition:
Operation that needs to be applied on tracks to create this virtual track; seeSection 18.8 for more details.
stream copy:
True (Section 8)
5.1.4.1.30.1.TrackCombinePlanes Element
id / type:
0xE3 / master
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackCombinePlanes
maxOccurs:
1
minver:
3
definition:
Contains the list of all video plane tracks that need to be combined to create this 3D track.
stream copy:
True (Section 8)
5.1.4.1.30.2.TrackPlane Element
id / type:
0xE4 / master
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackCombinePlanes\TrackPlane
minOccurs:
1
minver:
3
definition:
Contains a video plane track that needs to be combined to create this 3D track.
stream copy:
True (Section 8)
5.1.4.1.30.3.TrackPlaneUID Element
id / type:
0xE5 / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackCombinePlanes\TrackPlane\TrackPlaneUID
minOccurs / maxOccurs:
1 / 1
minver:
3
definition:
The trackUID number of the track representing the plane.
stream copy:
True (Section 8)
5.1.4.1.30.4.TrackPlaneType Element
id / type:
0xE6 / uinteger
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackCombinePlanes\TrackPlane\TrackPlaneType
minOccurs / maxOccurs:
1 / 1
minver:
3
definition:
The kind of plane this track corresponds to.
restrictions:
Table 22:TrackPlaneType Values
valuelabel
0left eye
1right eye
2background
stream copy:
True (Section 8)
5.1.4.1.30.5.TrackJoinBlocks Element
id / type:
0xE9 / master
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackJoinBlocks
maxOccurs:
1
minver:
3
definition:
Contains the list of all tracks whose Blocks need to be combined to create this virtual track.
stream copy:
True (Section 8)
5.1.4.1.30.6.TrackJoinUID Element
id / type:
0xED / uinteger
range:
not 0
path:
\Segment\Tracks\TrackEntry\TrackOperation\TrackJoinBlocks\TrackJoinUID
minOccurs:
1
minver:
3
definition:
The trackUID number of a track whose blocks are used to create this virtual track.
stream copy:
True (Section 8)
5.1.4.1.31.ContentEncodings Element
id / type:
0x6D80 / master
path:
\Segment\Tracks\TrackEntry\ContentEncodings
maxOccurs:
1
definition:
Settings for several content encoding mechanisms like compression or encryption.
stream copy:
True (Section 8)
5.1.4.1.31.1.ContentEncoding Element
id / type:
0x6240 / master
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding
minOccurs:
1
definition:
Settings for one content encoding like compression or encryption.
stream copy:
True (Section 8)
5.1.4.1.31.2.ContentEncodingOrder Element
id / type / default:
0x5031 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncodingOrder
minOccurs / maxOccurs:
1 / 1
definition:
Tell in which order to apply eachContentEncoding of theContentEncodings.The decoder/demuxerMUST start with theContentEncoding with the highestContentEncodingOrder and work its way down to theContentEncoding with the lowestContentEncodingOrder.This valueMUST be unique over for eachContentEncoding found in theContentEncodings of thisTrackEntry.
stream copy:
True (Section 8)
5.1.4.1.31.3.ContentEncodingScope Element
id / type / default:
0x5032 / uinteger / 1
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncodingScope
minOccurs / maxOccurs:
1 / 1
definition:
A bit field that describes which Elements have been modified in this way.Values (big-endian) can be OR'ed.
defined values:
Table 23:ContentEncodingScope Values
valuelabeldefinition
1BlockAll frame contents excluding lacing data.
2PrivateThe track'sCodecPrivate data.
4NextThe next ContentEncoding (nextContentEncodingOrder; either the data insideContentCompression and/orContentEncryption). This valueSHOULD NOT be used, as it's not supported by players.
stream copy:
True (Section 8)
5.1.4.1.31.4.ContentEncodingType Element
id / type / default:
0x5033 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncodingType
minOccurs / maxOccurs:
1 / 1
definition:
A value describing the kind of transformation that is applied.
restrictions:
Table 24:ContentEncodingType Values
valuelabel
0Compression
1Encryption
stream copy:
True (Section 8)
5.1.4.1.31.5.ContentCompression Element
id / type:
0x5034 / master
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentCompression
maxOccurs:
1
definition:
Settings describing the compression used.This ElementMUST be present if the value of ContentEncodingType is 0 and absent otherwise.Each blockMUST be decompressable, even if no previous block is available in order to not prevent seeking.
stream copy:
True (Section 8)
5.1.4.1.31.6.ContentCompAlgo Element
id / type / default:
0x4254 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentCompression\ContentCompAlgo
minOccurs / maxOccurs:
1 / 1
definition:
The compression algorithm used.
defined values:
Table 25:ContentCompAlgo Values
valuelabeldefinition
0zlibzlib compression[RFC1950]
1bzlibbzip2 compression[BZIP2],SHOULD NOT be used; see usage notes.
2lzo1xLempel-Ziv-Oberhumer compression[LZO],SHOULD NOT be used; see usage notes.
3Header StrippingOctets inContentCompSettings (Section 5.1.4.1.31.7) have been stripped from each frame.
usage notes:
Compression method "1" (bzlib) and "2" (lzo1x) are lacking proper documentation on the format, which limits implementation possibilities.Due to licensing conflicts on commonly available libraries compression methods, "2" (lzo1x) does not offer widespread interoperability.A Matroska WriterSHOULD NOT use these compression methods by default.A Matroska ReaderMAY support methods "1" and "2" if possible andSHOULD support other methods.
stream copy:
True (Section 8)
5.1.4.1.31.7.ContentCompSettings Element
id / type:
0x4255 / binary
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentCompression\ContentCompSettings
maxOccurs:
1
definition:
Settings that might be needed by the decompressor. For Header Stripping (ContentCompAlgo=3),the bytes that were removed from the beginning of each frames of the track.
stream copy:
True (Section 8)
5.1.4.1.31.8.ContentEncryption Element
id / type:
0x5035 / master
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption
maxOccurs:
1
definition:
Settings describing the encryption used.This ElementMUST be present if the value ofContentEncodingType is 1 (encryption) andMUST be ignored otherwise.A Matroska PlayerMAY support encryption.
stream copy:
True (Section 8)
5.1.4.1.31.9.ContentEncAlgo Element
id / type / default:
0x47E1 / uinteger / 0
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentEncAlgo
minOccurs / maxOccurs:
1 / 1
definition:
The encryption algorithm used.
defined values:
Table 26:ContentEncAlgo Values
valuelabeldefinition
0Not encryptedThe data is not encrypted.
1DESData Encryption Standard (DES)[FIPS46-3]. This valueSHOULD be avoided.
23DESTriple Data Encryption Algorithm[SP800-67]. This valueSHOULD be avoided.
3TwofishTwofish Encryption Algorithm[Twofish].
4BlowfishBlowfish Encryption Algorithm[Blowfish]. This valueSHOULD be avoided.
5AESAdvanced Encryption Standard (AES)[FIPS197].
stream copy:
True (Section 8)
5.1.4.1.31.10.ContentEncKeyID Element
id / type:
0x47E2 / binary
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentEncKeyID
maxOccurs:
1
definition:
The ID of the public key that the data was encrypted with for public key algorithms.
stream copy:
True (Section 8)
5.1.4.1.31.11.ContentEncAESSettings Element
id / type:
0x47E7 / master
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentEncAESSettings
maxOccurs:
1
minver:
4
definition:
Settings describing the encryption algorithm used.
notes:
Table 27:ContentEncAESSettings Implementation Notes
attributenote
maxOccursContentEncAESSettingsMUST NOT be set (maxOccurs=0) if ContentEncAlgo is not AES (5).
stream copy:
True (Section 8)
5.1.4.1.31.12.AESSettingsCipherMode Element
id / type:
0x47E8 / uinteger
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentEncAESSettings\AESSettingsCipherMode
minOccurs / maxOccurs:
1 / 1
minver:
4
definition:
The AES cipher mode used in the encryption.
defined values:
Table 28:AESSettingsCipherMode Values
valuelabeldefinition
1AES-CTRCounter[SP800-38A].
2AES-CBCCipher Block Chaining[SP800-38A].
notes:
Table 29:AESSettingsCipherMode Implementation Notes
attributenote
maxOccursAESSettingsCipherModeMUST NOT be set (maxOccurs=0) if ContentEncAlgo is not AES (5).
stream copy:
True (Section 8)

5.1.5.Cues Element

id / type:
0x1C53BB6B / master
path:
\Segment\Cues
minOccurs / maxOccurs:
see implementation notes / 1
definition:
A Top-Level Element to speed seeking access.All entries are local to the Segment.
notes:
Table 30:Cues Implementation Notes
attributenote
minOccursThis ElementSHOULD be set when the Segment is not transmitted as a live stream; seeSection 23.2.
5.1.5.1.CuePoint Element
id / type:
0xBB / master
path:
\Segment\Cues\CuePoint
minOccurs:
1
definition:
Contains all information relative to a seek point in the Segment.
5.1.5.1.1.CueTime Element
id / type:
0xB3 / uinteger
path:
\Segment\Cues\CuePoint\CueTime
minOccurs / maxOccurs:
1 / 1
definition:
Absolute timestamp of the seek point expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.
5.1.5.1.2.CueTrackPositions Element
id / type:
0xB7 / master
path:
\Segment\Cues\CuePoint\CueTrackPositions
minOccurs:
1
definition:
Contains positions for different tracks corresponding to the timestamp.
5.1.5.1.2.1.CueTrack Element
id / type:
0xF7 / uinteger
range:
not 0
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueTrack
minOccurs / maxOccurs:
1 / 1
definition:
The track for which a position is given.
5.1.5.1.2.2.CueClusterPosition Element
id / type:
0xF1 / uinteger
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueClusterPosition
minOccurs / maxOccurs:
1 / 1
definition:
The Segment Position (Section 16) of the Cluster containing the associated Block.
5.1.5.1.2.3.CueRelativePosition Element
id / type:
0xF0 / uinteger
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueRelativePosition
maxOccurs:
1
minver:
4
definition:
The relative position inside the Cluster of the referenced SimpleBlock or BlockGroupwith 0 being the first possible position for an Element inside that Cluster.
5.1.5.1.2.4.CueDuration Element
id / type:
0xB2 / uinteger
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueDuration
maxOccurs:
1
minver:
4
definition:
The duration of the block expressed in Segment Ticks, which are based on TimestampScale; seeSection 11.1.If this element is missing, the track's DefaultDuration does not apply and no duration information is available in terms of the cues.
5.1.5.1.2.5.CueBlockNumber Element
id / type:
0x5378 / uinteger
range:
not 0
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueBlockNumber
maxOccurs:
1
definition:
Number of the Block in the specified Cluster.
5.1.5.1.2.6.CueCodecState Element
id / type / default:
0xEA / uinteger / 0
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueCodecState
minOccurs / maxOccurs:
1 / 1
minver:
2
definition:
The Segment Position (Section 16) of the Codec State corresponding to this Cue Element.0 means that the data is taken from the initial Track Entry.
5.1.5.1.2.7.CueReference Element
id / type:
0xDB / master
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueReference
minver:
2
definition:
The Clusters containing the referenced Blocks.
5.1.5.1.2.8.CueRefTime Element
id / type:
0x96 / uinteger
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueReference\CueRefTime
minOccurs / maxOccurs:
1 / 1
minver:
2
definition:
Timestamp of the referenced Block expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.

5.1.6.Attachments Element

id / type:
0x1941A469 / master
path:
\Segment\Attachments
maxOccurs:
1
definition:
Contains attached files.
5.1.6.1.AttachedFile Element
id / type:
0x61A7 / master
path:
\Segment\Attachments\AttachedFile
minOccurs:
1
definition:
An attached file.
5.1.6.1.1.FileDescription Element
id / type:
0x467E / utf-8
path:
\Segment\Attachments\AttachedFile\FileDescription
maxOccurs:
1
definition:
A human-friendly name for the attached file.
5.1.6.1.2.FileName Element
id / type:
0x466E / utf-8
path:
\Segment\Attachments\AttachedFile\FileName
minOccurs / maxOccurs:
1 / 1
definition:
Filename of the attached file.
5.1.6.1.3.FileMediaType Element
id / type:
0x4660 / string
path:
\Segment\Attachments\AttachedFile\FileMediaType
minOccurs / maxOccurs:
1 / 1
definition:
Media type of the file following the format described in[RFC6838].
stream copy:
True (Section 8)
5.1.6.1.4.FileData Element
id / type:
0x465C / binary
path:
\Segment\Attachments\AttachedFile\FileData
minOccurs / maxOccurs:
1 / 1
definition:
The data of the file.
stream copy:
True (Section 8)
5.1.6.1.5.FileUID Element
id / type:
0x46AE / uinteger
range:
not 0
path:
\Segment\Attachments\AttachedFile\FileUID
minOccurs / maxOccurs:
1 / 1
definition:
UID representing the file, as random as possible.
stream copy:
True (Section 8)

5.1.7.Chapters Element

id / type:
0x1043A770 / master
path:
\Segment\Chapters
maxOccurs:
1
recurring:
True
definition:
A system to define basic menus and partition data.For more detailed information, seeSection 20.
5.1.7.1.EditionEntry Element
id / type:
0x45B9 / master
path:
\Segment\Chapters\EditionEntry
minOccurs:
1
definition:
Contains all information about a Segment edition.
5.1.7.1.1.EditionUID Element
id / type:
0x45BC / uinteger
range:
not 0
path:
\Segment\Chapters\EditionEntry\EditionUID
maxOccurs:
1
definition:
A UID to identify the edition. It's useful for tagging an edition.
stream copy:
True (Section 8)
5.1.7.1.2.EditionFlagDefault Element
id / type / default:
0x45DB / uinteger / 0
range:
0-1
path:
\Segment\Chapters\EditionEntry\EditionFlagDefault
minOccurs / maxOccurs:
1 / 1
definition:
Set to 1 if the editionSHOULD be used as the default one.
5.1.7.1.3.EditionFlagOrdered Element
id / type / default:
0x45DD / uinteger / 0
range:
0-1
path:
\Segment\Chapters\EditionEntry\EditionFlagOrdered
minOccurs / maxOccurs:
1 / 1
definition:
Set to 1 if the chapters can be defined multiple times and the order to play them is enforced; seeSection 20.1.3.
5.1.7.1.4.ChapterAtom Element
id / type:
0xB6 / master
path:
\Segment\Chapters\EditionEntry\+ChapterAtom
minOccurs:
1
recursive:
True
definition:
Contains the atom information to use as the chapter atom (applies to all tracks).
5.1.7.1.4.1.ChapterUID Element
id / type:
0x73C4 / uinteger
range:
not 0
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterUID
minOccurs / maxOccurs:
1 / 1
definition:
A UID to identify the Chapter.
stream copy:
True (Section 8)
5.1.7.1.4.2.ChapterStringUID Element
id / type:
0x5654 / utf-8
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterStringUID
maxOccurs:
1
minver:
3
definition:
A unique string ID to identify the Chapter.For example, it is used as the storage for[WebVTT] cue identifier values.
5.1.7.1.4.3.ChapterTimeStart Element
id / type:
0x91 / uinteger
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterTimeStart
minOccurs / maxOccurs:
1 / 1
definition:
Timestamp of the start of Chapter expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.
5.1.7.1.4.4.ChapterTimeEnd Element
id / type:
0x92 / uinteger
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterTimeEnd
minOccurs / maxOccurs:
see implementation notes / 1
definition:
Timestamp of the end of Chapter timestamp excluded expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1.The valueMUST be greater than or equal to theChapterTimeStart of the sameChapterAtom.
usage notes:
With theChapterTimeEnd timestamp value being excluded, itMUST take into account the duration ofthe last frame it includes, especially for theChapterAtom using the last frames of theSegment.
notes:
Table 31:ChapterTimeEnd Implementation Notes
attributenote
minOccursChapterTimeEndMUST be set (minOccurs=1) if the Edition is an ordered edition; seeSection 20.1.3. If it's aParent Chapter, seeSection 20.2.3.
5.1.7.1.4.5.ChapterFlagHidden Element
id / type / default:
0x98 / uinteger / 0
range:
0-1
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterFlagHidden
minOccurs / maxOccurs:
1 / 1
definition:
Set to 1 if a chapter is hidden. Hidden chaptersSHOULD NOT be available to the user interface(but still to Control Tracks; seeSection 20.2.5 on Chapter flags).
5.1.7.1.4.6.ChapterSegmentUUID Element
id / type:
0x6E67 / binary
length:
16
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterSegmentUUID
minOccurs / maxOccurs:
see implementation notes / 1
definition:
The SegmentUUID of another Segment to play during this chapter.
usage notes:
The valueMUST NOT be theSegmentUUID value of theSegment it belongs to.
notes:
Table 32:ChapterSegmentUUID Implementation Notes
attributenote
minOccursChapterSegmentUUIDMUST be set (minOccurs=1) if ChapterSegmentEditionUID is used; seeSection 17.2 on Medium-Linking Segments.
5.1.7.1.4.7.ChapterSegmentEditionUID Element
id / type:
0x6EBC / uinteger
range:
not 0
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterSegmentEditionUID
maxOccurs:
1
definition:
The EditionUID to play from the Segment linked in ChapterSegmentUUID.If ChapterSegmentEditionUID is undeclared, then no Edition of the linked Segment is used; seeSection 17.2 on Medium-Linking Segments.
5.1.7.1.4.8.ChapterPhysicalEquiv Element
id / type:
0x63C3 / uinteger
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterPhysicalEquiv
maxOccurs:
1
definition:
Specifies the physical equivalent of this ChapterAtom as "DVD" (60) or "SIDE" (50);seeSection 20.4 for a complete list of values.
5.1.7.1.4.9.ChapterDisplay Element
id / type:
0x80 / master
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterDisplay
definition:
Contains all possible strings to use for the chapter display.
5.1.7.1.4.10.ChapString Element
id / type:
0x85 / utf-8
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterDisplay\ChapString
minOccurs / maxOccurs:
1 / 1
definition:
Contains the string to use as the chapter atom.
5.1.7.1.4.11.ChapLanguage Element
id / type / default:
0x437C / string / eng
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterDisplay\ChapLanguage
minOccurs:
1
definition:
A language corresponding to the stringin the Matroska languages form; seeSection 12 on language codes.This ElementMUST be ignored if a ChapLanguageBCP47 Element is used within the same ChapterDisplay Element.
5.1.7.1.4.12.ChapLanguageBCP47 Element
id / type:
0x437D / string
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterDisplay\ChapLanguageBCP47
minver:
4
definition:
A language corresponding to the ChapStringin the[BCP47] form; seeSection 12 on language codes.If a ChapLanguageBCP47 Element is used, then any ChapLanguage and ChapCountry Elements used in the same ChapterDisplayMUST be ignored.
5.1.7.1.4.13.ChapCountry Element
id / type:
0x437E / string
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapterDisplay\ChapCountry
definition:
A country corresponding to the stringin the Matroska countries form; seeSection 13.This ElementMUST be ignored if a ChapLanguageBCP47 Element is used within the same ChapterDisplay Element.
5.1.7.1.4.14.ChapProcess Element
id / type:
0x6944 / master
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess
definition:
Contains all the commands associated to the Atom.
5.1.7.1.4.15.ChapProcessCodecID Element
id / type / default:
0x6955 / uinteger / 0
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess\ChapProcessCodecID
minOccurs / maxOccurs:
1 / 1
definition:
Contains the type of the codec used for processing.A value of 0 means built-in Matroska processing (to be defined) and a value of 1 means the DVD command set is used; seeSection 20.3.More codec IDs can be added later.
5.1.7.1.4.16.ChapProcessPrivate Element
id / type:
0x450D / binary
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess\ChapProcessPrivate
maxOccurs:
1
definition:
Optional data attached to the ChapProcessCodecID information.For ChapProcessCodecID=1, it is the "DVD level" equivalent; seeSection 20.3.
5.1.7.1.4.17.ChapProcessCommand Element
id / type:
0x6911 / master
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess\ChapProcessCommand
definition:
Contains all the commands associated with the Atom.
5.1.7.1.4.18.ChapProcessTime Element
id / type:
0x6922 / uinteger
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess\ChapProcessCommand\ChapProcessTime
minOccurs / maxOccurs:
1 / 1
definition:
Defines when the process commandSHOULD be handled.
restrictions:
Table 33:ChapProcessTime Values
valuelabel
0during the whole chapter
1before starting playback
2after playback of the chapter
5.1.7.1.4.19.ChapProcessData Element
id / type:
0x6933 / binary
path:
\Segment\Chapters\EditionEntry\+ChapterAtom\ChapProcess\ChapProcessCommand\ChapProcessData
minOccurs / maxOccurs:
1 / 1
definition:
Contains the command information.The dataSHOULD be interpreted depending on the ChapProcessCodecID value. For ChapProcessCodecID = 1,the data corresponds to the binary DVD cell pre/post commands; seeSection 20.3.

5.1.8.Tags Element

id / type:
0x1254C367 / master
path:
\Segment\Tags
definition:
Element containing metadata describing Tracks, Editions, Chapters, Attachments, or Segments as a whole.A list of valid tags can be found in[MatroskaTags].
5.1.8.1.Tag Element
id / type:
0x7373 / master
path:
\Segment\Tags\Tag
minOccurs:
1
definition:
A single metadata descriptor.
5.1.8.1.1.Targets Element
id / type:
0x63C0 / master
path:
\Segment\Tags\Tag\Targets
minOccurs / maxOccurs:
1 / 1
definition:
Specifies which other elements the metadata represented by the Tag applies to.If this element is empty or omitted, then the Tag describes everything in the Segment.
5.1.8.1.1.1.TargetTypeValue Element
id / type / default:
0x68CA / uinteger / 50
path:
\Segment\Tags\Tag\Targets\TargetTypeValue
minOccurs / maxOccurs:
1 / 1
definition:
A number to indicate the logical level of the target.
defined values:
Table 34:TargetTypeValue Values
valuelabeldefinition
70COLLECTIONThe highest hierarchical level that tags can describe.
60EDITION / ISSUE / VOLUME / OPUS / SEASON / SEQUELA list of lower levels grouped together.
50ALBUM / OPERA / CONCERT / MOVIE / EPISODEThe most common grouping level of music and video (equal to an episode for TV series).
40PART / SESSIONWhen an album or episode has different logical parts.
30TRACK / SONG / CHAPTERThe common parts of an album or movie.
20SUBTRACK / MOVEMENT / SCENECorresponds to parts of a track for audio, such as a movement or scene in a movie.
10SHOTThe lowest hierarchy found in music or movies.
5.1.8.1.1.2.TargetType Element
id / type:
0x63CA / string
path:
\Segment\Tags\Tag\Targets\TargetType
maxOccurs:
1
definition:
An informational string that can be used to display the logical level of the target, such as "ALBUM", "TRACK", "MOVIE", "CHAPTER", etc.
restrictions:
Table 35:TargetType Values
valuelabel
COLLECTIONTargetTypeValue 70
EDITIONTargetTypeValue 60
ISSUETargetTypeValue 60
VOLUMETargetTypeValue 60
OPUSTargetTypeValue 60
SEASONTargetTypeValue 60
SEQUELTargetTypeValue 60
ALBUMTargetTypeValue 50
OPERATargetTypeValue 50
CONCERTTargetTypeValue 50
MOVIETargetTypeValue 50
EPISODETargetTypeValue 50
PARTTargetTypeValue 40
SESSIONTargetTypeValue 40
TRACKTargetTypeValue 30
SONGTargetTypeValue 30
CHAPTERTargetTypeValue 30
SUBTRACKTargetTypeValue 20
MOVEMENTTargetTypeValue 20
SCENETargetTypeValue 20
SHOTTargetTypeValue 10
5.1.8.1.1.3.TagTrackUID Element
id / type / default:
0x63C5 / uinteger / 0
path:
\Segment\Tags\Tag\Targets\TagTrackUID
definition:
A UID to identify the Track(s) that the tags belong to.
usage notes:
If the value is 0 at this level, the tags apply to all tracks in the Segment.If it is set to any other value, itMUST match theTrackUID value of a track found in this Segment.
5.1.8.1.1.4.TagEditionUID Element
id / type / default:
0x63C9 / uinteger / 0
path:
\Segment\Tags\Tag\Targets\TagEditionUID
definition:
A UID to identify the EditionEntry(s) that the tags belong to.
usage notes:
If the value is 0 at this level, the tags apply to all editions in the Segment.If it is set to any other value, itMUST match theEditionUID value of an edition found in this Segment.
5.1.8.1.1.5.TagChapterUID Element
id / type / default:
0x63C4 / uinteger / 0
path:
\Segment\Tags\Tag\Targets\TagChapterUID
definition:
A UID to identify the Chapter(s) the tags belong to.
usage notes:
If the value is 0 at this level, the tags apply to all chapters in the Segment.If it is set to any other value, itMUST match theChapterUID value of a chapter found in this Segment.
5.1.8.1.1.6.TagAttachmentUID Element
id / type / default:
0x63C6 / uinteger / 0
path:
\Segment\Tags\Tag\Targets\TagAttachmentUID
definition:
A UID to identify the Attachment(s) the tags belong to.
usage notes:
If the value is 0 at this level, the tags apply to all the attachments in the Segment.If it is set to any other value, itMUST match theFileUID value of an attachment found in this Segment.
5.1.8.1.2.SimpleTag Element
id / type:
0x67C8 / master
path:
\Segment\Tags\Tag\+SimpleTag
minOccurs:
1
recursive:
True
definition:
Contains general information about the target.
5.1.8.1.2.1.TagName Element
id / type:
0x45A3 / utf-8
path:
\Segment\Tags\Tag\+SimpleTag\TagName
minOccurs / maxOccurs:
1 / 1
definition:
The name of the Tag that is going to be stored.
5.1.8.1.2.2.TagLanguage Element
id / type / default:
0x447A / string / und
path:
\Segment\Tags\Tag\+SimpleTag\TagLanguage
minOccurs / maxOccurs:
1 / 1
definition:
Specifies the language of the specified tag in the Matroska languages form; seeSection 12 on language codes.This ElementMUST be ignored if the TagLanguageBCP47 Element is used within the same SimpleTag Element.
5.1.8.1.2.3.TagLanguageBCP47 Element
id / type:
0x447B / string
path:
\Segment\Tags\Tag\+SimpleTag\TagLanguageBCP47
maxOccurs:
1
minver:
4
definition:
The language used in the TagStringin the[BCP47] form; seeSection 12.If this Element is used, then any TagLanguage Elements used in the same SimpleTagMUST be ignored.
5.1.8.1.2.4.TagDefault Element
id / type / default:
0x4484 / uinteger / 1
range:
0-1
path:
\Segment\Tags\Tag\+SimpleTag\TagDefault
minOccurs / maxOccurs:
1 / 1
definition:
A boolean value to indicate if this is the default/original language to use for the given tag.
5.1.8.1.2.5.TagString Element
id / type:
0x4487 / utf-8
path:
\Segment\Tags\Tag\+SimpleTag\TagString
maxOccurs:
1
definition:
The value of the Tag.
5.1.8.1.2.6.TagBinary Element
id / type:
0x4485 / binary
path:
\Segment\Tags\Tag\+SimpleTag\TagBinary
maxOccurs:
1
definition:
The values of the Tag if it is binary. Note that this cannot be used in the same SimpleTag as TagString.

6.Matroska Element Ordering

With the exceptions of theEBML Header and theCRC-32 Element, the EBML specification does notrequire any particular storage order forElements. However, this specification defines, mandates, and recommends the order ofcertainElements to facilitate better playback, seeking, and editingefficiency. This section describes and offers rationale for orderingrequirements and recommendations for Matroska.

6.1.Top-Level Elements

TheInfo Element is the onlyREQUIREDTop-Level Element in a Matroska file.To be playable, MatroskaMUST also contain at least oneTracks Element andCluster Element.The firstInfo Element and the firstTracks ElementMUST either be stored before the firstCluster Element or bothSHALL be referenced by aSeekHead Element occurring before the firstCluster Element.

AllTop-Level ElementsMUST use an EBML Element ID that is 4 octets long.

When using Medium Linking, chapters are used to reference other Segments to play in a given orderSection 17.2.A Segment containing these Linked Chapters does not require aTrack Element or aCluster Element.

It is possible to edit a Matroska file after it has been created. For example, chapters,tags, or attachments can be added. When newTop-Level Elements are added to a Matroska file,theSeekHead Element(s)MUST be updated so that theSeekHead Element(s) itemizethe identity and position of allTop-Level Elements.

Editing, removing, or addingElements to a Matroska file often requires that some existingElements be voidedor extended.Transforming the existingElements intoVoid Elements as padding can be usedas a method to avoid moving large amounts of data around.

6.2.CRC-32

As noted by the EBML specification, if aCRC-32 Element is used, then theCRC-32 ElementMUST be the first orderedElement within itsParent Element.

In Matroska, allTop-Level Elements of an EBML DocumentSHOULD include aCRC-32 Elementas their firstChild Element.TheSegment Element, which is theRoot Element,SHOULD NOT have aCRC-32 Element.

6.3.SeekHead

If used, the firstSeekHead ElementMUST be the first non-CRC-32 Child Elementof theSegment Element. If a secondSeekHead Element is used, then the firstSeekHead ElementMUST reference the identity and position of the secondSeekHead Element.

Additionally, the secondSeekHead ElementMUST only referenceCluster Elementsand not any otherTop-Level Element already contained within the firstSeekHead Element.

The secondSeekHead ElementMAY be stored in any order relative to the otherTop-Level Elements.Whether one or twoSeekHead Element(s) are used, theSeekHead Element(s)MUSTcollectively reference the identity and position of allTop-Level Elements exceptfor the firstSeekHead Element.

6.4.Cues (Index)

TheCues Element isRECOMMENDED to optimize seeking access in Matroska. It isprogrammatically simpler to add theCues Element after allCluster Elementshave been written because this does not require a prediction of how much space toreserve before writing theCluster Elements. However, storing theCues Elementbefore theCluster Elements can provide some seeking advantages. If theCues Elementis present, then itSHOULD either be stored before the firstCluster Elementor be referenced by aSeekHead Element.

6.5.Info

The firstInfo ElementSHOULD occur before the firstTracks Element and firstCluster Element except when it is referenced by aSeekHead Element.

6.6.Chapters Element

TheChapters ElementSHOULD be placed before theCluster Element(s). TheChapters Element can be used during playback even if the user does not need to seek.It immediately gives the user information about what section is being read and whatother sections are available. In the case of Ordered Chapters, it isRECOMMENDED to evaluatethe logical linking even before playing. TheChapters ElementSHOULD be placed beforethe firstTracks Element and after the firstInfo Element.

6.7.Attachments

TheAttachments Element is not intended to be used by default when playing the file,but could contain information relevant to the content, such as cover art or fonts.Cover art is useful even before the file is played and fonts could be needed before playbackstarts for the initialization of subtitles. TheAttachments ElementMAY be placed beforethe firstCluster Element; however, if theAttachments Element is likely to be edited,then itSHOULD be placed after the lastCluster Element.

6.8.Tags

TheTags Element is most subject to changes after the file was originally created.For easier editing, theTags Element can be placed at the end of theSegment Elementand after theAttachments Element. On the other hand, it is inconvenient to have toseek in theSegment for tags, especially for network streams; thus, it's better if theTags Element is found early in the stream. When editing theTags Element, the originalTags Element at the beginning can be overwritten with aVoid Element and anewTags Element written at the end of theSegment Element. The file and Segment sizes will only marginally change.

7.Matroska Versioning

Matroska is based on the principle that a reading application does not have to support100% of the specifications in order to be able to play the file. Therefore, a Matroska file contains version indicators that tell a reading application what to expect.

It is possible and valid to have the version fields indicate that the file containsMatroskaElements from a higher specification version number while signaling that areading applicationMUST only support a lower version number properly in order to playit back (possibly with a reduced feature set).

TheEBML Header of each Matroska document informs the reading application on whatversion of Matroska to expect. TheElements within theEBML Header with jurisdictionover this information areDocTypeVersion andDocTypeReadVersion.

DocTypeVersionMUST be equal to or greater than the highest Matroska version number ofanyElement present in the Matroska file. For example, a file using theSimpleBlock Element (Section 5.1.3.4)MUST have aDocTypeVersion equal to or greater than 2. A file containingCueRelativePositionElements (Section 5.1.5.1.2.3)MUST have aDocTypeVersion equal to or greater than 4.

TheDocTypeReadVersionMUST contain the minimum version number that a reading applicationcan minimally support in order to play the file back -- optionally with a reduced featureset. For example, if a file contains onlyElements of version 2 or lower except forCueRelativePosition (which is a version 4 MatroskaElement), thenDocTypeReadVersionSHOULD still be set to 2 and not 4 because evaluatingCueRelativePosition is notnecessary for standard playback -- it makes seeking more precise if used.

A reading application supporting Matroska versionVMUST NOT refuse to read afile withDocReadTypeVersion equal to or lower thanV, even ifDocTypeVersionis greater thanV.

A reading applicationsupporting Matroska versionV at minimum and reading a file whoseDocTypeReadVersionfield is equal to or lower thanVMUST skip Matroska / EBMLElements it encountersbut does not know about if that unknown element fits into the size constraints setby the currentParent Element.

8.Stream Copy

It is sometimes necessary to create a Matroska file from another Matroska file; e.g., to add subtitles in a languageor to edit out a portion of the content.Some values from the original Matroska file need to be kept the same in the destination file.For example, the SamplingFrequency of an audio track wouldn't change between the two files.Some other values may change between the two files, such as the TrackNumber of an audio track when another track has been added.

An Element is marked with the property "stream copy: True" when the values of that Element need to be kept identical between the source and destination files.If that property is not set, elements may or may not keep the same value between the source and destination files.

9.DefaultDecodedFieldDuration

TheDefaultDecodedFieldDuration Element can signal to the displaying application howoften fields of a video sequence will be available for displaying. It can be used for bothinterlaced and progressive content.

If the video sequence is signaled as interlacedSection 5.1.4.1.28.1, thenDefaultDecodedFieldDuration equalsthe period between two successive fields at the output of the decoding process.For video sequences signaled as progressive,DefaultDecodedFieldDuration is half ofthe period between two successive frames at the output of the decoding process.

These values are valid at the end of the decoding process before post-processing(such as deinterlacing or inverse telecine) is applied.

Examples:

10.Cluster Blocks

Frames using referencesSHOULD be stored in "coding order", i.e., storing the references first and thenthe frames referencing them. A consequence is that timestamps might not be consecutive.However, a frame with a past timestampMUST reference a frame already known. Otherwise, the frame is considered bad/void.

Matroska has two similar ways to store frames in a block:

TheSimpleBlock is usually preferred unless some extra elements of theBlockGroup need to be used.A Matroska ReaderMUST support both types of blocks.

Each block contains the same parts in the following order:

The block header starts with the number of the Track it corresponds to.The valueMUST correspond to theTrackNumber (Section 5.1.4.1.1) of aTrackEntry of theSegment.

TheTrackNumber is coded using the Variable-Size Integer (VINT) mechanism described inSection 4 of [RFC8794].To save space, the shortest VINT formSHOULD be used. The value can be coded on up to 8 octets.This is the only element with a variable size in the block header.

The timestamp is expressed in Track Ticks; seeSection 11.1.The value is stored as a signed value on 16 bits.

10.1.Block Structure

This section describes the binary data contained in theBlock Element (Section 5.1.3.5.1). Bit 0 is the most significant bit.

As theTrackNumber size can vary between 1 and 8 octets, there are 8 different sizes for theBlock header.The definitions forTrackNumber sizes of 1 and 2 are provided; the other variants can be deduced by extending the size of theTrackNumber by multiples of 8 bits.

 0                   1                   2                   3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|               |                               |       |I|LAC|U||  Track Number |         Timestamp             | Rsvrd |N|ING|N||               |                               |       |V|   |U|+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 11:Block Header with 1 Octet TrackNumber
 0                   1                   2                   3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|          Track Number         |         Timestamp             |+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|       |I|LAC|U|| Rsvrd |N|ING|N|                     ...|       |V|   |U|+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 12:Block Header with 2 Octets TrackNumber

where:

Track Number:
8, 16, 24, 32, 40, 48 or 64 bits. An EBML VINT-coded track number.
Timestamp:
16 bits. Signed timestamp in Track Ticks.
Rsvrd:
4 bits. Reserved bitsMUST be set to 0.
INV:
1 bit. Invisible. The codecSHOULD decode this frame but not display it.
LACING:

2 bits. Uses lacing mode.

00b:
no lacing (Section 10.3.1)
01b:
Xiph lacing (Section 10.3.2)
11b:
EBML lacing (Section 10.3.3)
10b:
fixed-size lacing (Section 10.3.4)
UNU:
1 bit that is unused.

The following data in theBlock corresponds to the lacing data and frames usage as described in each respective lacing mode.

10.2.SimpleBlock Structure

This section describes the binary data contained in theSimpleBlock Element (Section 5.1.3.4). Bit 0 is the most significant bit.

TheSimpleBlock structure is inspired by the Block structure; seeSection 10.1.The main differences are the added Keyframe flag and Discardable flag. Otherwise, everything is the same.

As theTrackNumber size can vary between 1 and 8 octets, there are 8 different sizes for theSimpleBlock header.The definitions forTrackNumber sizes of 1 and 2 are provided; the other variants can be deduced by extending the size of theTrackNumber by multiples of 8 bits.

 0                   1                   2                   3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|               |                               |K|     |I|LAC|D||  Track Number |         Timestamp             |E|Rsvrd|N|ING|I||               |                               |Y|     |V|   |S|+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 13:SimpleBlock Header with 1 Octet TrackNumber
 0                   1                   2                   3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|          Track Number         |         Timestamp             |+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+|K|     |I|LAC|D||E|Rsvrd|N|ING|I|                     ...|Y|     |V|   |S|+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 14:SimpleBlock Header with 2 Octets TrackNumber

where:

Track Number:
8, 16, 24, 32, 40, 48 or 64 bits. An EBML VINT-coded track number.
Timestamp:
16 bits. Signed timestamp in Track Ticks.
KEY:
1 bit. Keyframe. Set when the Block contains only keyframes.
Rsvrd:
3 bits. Reserved bitsMUST be set to 0.
INV:
1 bit. Invisible. The codecSHOULD decode this frame but not display it.
LACING:

2 bits. Uses lacing mode.

00b:
no lacing (Section 10.3.1)
01b:
Xiph lacing (Section 10.3.2)
11b:
EBML lacing (Section 10.3.3)
10b:
fixed-size lacing (Section 10.3.4)
DIS:
1 bit. Discardable. The frames of the Block can be discarded during playing if needed.

The following data in theSimpleBlock correspond to the lacing data and frames usage as described in each respective lacing mode.

10.3.Block Lacing

Lacing is a mechanism to save space when storing data. It is typically used for small blocksof data (referred to as frames in Matroska). It packs multiple frames into a singleBlock orSimpleBlock.

LacingMUST NOT be used to store a single frame in aBlock orSimpleBlock.

There are three types of lacing:

  • Xiph, which is inspired by what is found in the Ogg container[RFC3533].
  • EBML, which is the same with sizes coded differently.
  • Fixed-size, where the size is not coded.

When lacing is not used, i.e., to store a single frame, lacing bits 5 and 6 of theBlock orSimpleBlockMUST be set to 0.

For example, a user wants to store 3 frames of the same track. The first frame is 800 octets long,the second is 500 octets long, and the third is 1000 octets long. Since these frames are small,they can be stored in a lace to save space.

It is possible to not use lacing at all and just store a single frame without any extra data.When the FlagLacing (Section 5.1.4.1.12) is set to "0", all blocks of that trackMUST NOT use lacing.

10.3.1.No Lacing

When no lacing is used, the number of frames in the lace is ommitted and only one frame can be stored in the Block.Bits 5 and 6 of the Block Header flags are set to0b00.

The Block for an 800-octet frame is as follows:

Table 36:No Lacing
Block OctetsValueDescription
4-803<frame>Single frame data

When a Block contains a single frame, itMUST use this No lacing mode.

10.3.2.Xiph Lacing

The Xiph lacing uses the same coding of size as found in the Ogg container[RFC3533].Bits 5 and 6 of the Block Header flags are set to0b01.

The Block data with laced frames is stored as follows:

  • Lacing Head on 1 octet: Number of frames in the lace minus 1.
  • Lacing size of each frame except the last one.
  • Binary data of each frame consecutively.

The lacing size is split into 255 values, stored as unsigned octets -- for example, 500 is coded 255;245 or [0xFF 0xF5].A frame with a size multiple of 255 is coded with a 0 at the end of the size -- for example, 765 is coded 255;255;255;0 or [0xFF 0xFF 0xFF 0x00].

The size of the last frame is deduced from the size remaining in the Block after the other frames.

Because large sizes result in large coding of the sizes, it isRECOMMENDED to use Xiph lacing only with small frames.

In our example, the 800, 500, and 1000-octet frames are stored with Xiph lacing in a Block as follows:

Table 37:Xiph Lacing Example
Block OctetsValueDescription
40x02Number of frames minus 1
5-80xFF 0xFF 0xFF 0x23Size of the first frame (255; 255; 255; 35)
9-100xFF 0xF5Size of the second frame (255; 245)
11-810First frame data
811-1310Second frame data
1311-2310Third frame data

The Block is 2311 octets and the last frame starts at 1311, so we can deduce that the size of the last frame is 2311 - 1311 = 1000.

10.3.3.EBML Lacing

The EBML lacing encodes the frame size with an EBML-like encoding[RFC8794].Bits 5 and 6 of the Block Header flags are set to0b11.

The Block data with laced frames is stored as follows:

  • Lacing Head on 1 Octet: Number of frames in the lace minus 1.
  • Lacing size of each frame except the last one.
  • Binary data of each frame consecutively.

The first frame size is encoded as an EBML VINT value.The remaining frame sizes are encoded as signed values using the difference between the frame size and the previous frame size.These signed values are encoded as VINT with a mapping from signed to unsigned numbers.Decoding the unsigned number stored in the VINT to a signed number is done by subtracting 2((7*n)-1)-1, wheren is the octet size of the VINT.

Table 38:EBML Lacing Signed VINT Bits Usage
Bit Representation of Signed VINTPossible Value Range
1xxx xxxx2^7 values from -(26-1) to 26
01xx xxxx xxxx xxxx2^14 values from -(213-1) to 213
001x xxxx xxxx xxxx xxxx xxxx2^21 values from -(220-1) to 220
0001 xxxx xxxx xxxx xxxx xxxx xxxx xxxx2^28 values from -(227-1) to 227
0000 1xxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx2^35 values from -(234-1) to 234

In our example, the 800, 500 and 1000-octet frames are stored with EBML lacing in a Block as follows:

Table 39:EBML Lacing Example
Block OctetsValueDescription
40x02Number of frames minus 1
5-60x43 0x20Size of the first frame (800 = 0x320 + 0x4000)
7-80x5E 0xD3Size of the second frame (500 - 800 = -300 = - 0x12C + 0x1FFF + 0x4000)
8-807<frame1>First frame data
808-1307<frame2>Second frame data
1308-2307<frame3>Third frame data

The Block is 2308 octets and the last frame starts at 1308, so we can deduce that the size of the last frame is 2308 - 1308 = 1000.

10.3.4.Fixed-size Lacing

Fixed-size lacing doesn't store the frame size; rather, it only stores the number of frames in the lace.Each frameMUST have the same size. The frame size of each frame is deduced from the total size of the Block.Bits 5 and 6 of the Block Header flags are set to0b10.

The Block data with laced frames is stored as follows:

  • Lacing Head on 1 Octet: Number of frames in the lace minus 1.
  • Binary data of each frame consecutively.

For example, for three frames that are 800 octets each:

Table 40:Fixed-Size Lacing Example
Block OctetsValueDescription
40x02Number of frames minus 1
5-804<frame1>First frame data
805-1604<frame2>Second frame data
1605-2404<frame3>Third frame data

This gives a Block of 2405 octets. When reading the Block, we find that there are three frames (Octet 4).The data start at Octet 5, so the size of each frame is (2405 - 5) / 3 = 800.

10.3.5.Laced Frames Timestamp

A Block only contains a single timestamp value. But when lacing is used, it contains more than one frame.Each frame originally has its own timestamp, or Presentation Timestamp (PTS). That timestamp applies tothe first frame in the lace.

In the lace, each frame after the first one has an underdetermined timestamp.However, each of these framesMUST be contiguous, i.e., the decoded dataMUST NOT contain any gapbetween them. If there is a gap in the stream, the frames around the gapMUST NOT be in the same Block.

Lacing is only useful for small contiguous data to save space. This is usually the case for audio tracksand not the case for video (which use a lot of data) or subtitle tracks (which have long gaps).For audio, there is usually a fixed output sampling frequency for the whole track,so the decoder should be able to recover the timestamp of each sample knowing eachoutput sample is contiguous with a fixed frequency.For subtitles, this is usually not the case; therefore, lacingSHOULD NOT be used.

10.4.Random Access Points

Random Access Points (RAPs) are positions where the parser can seek to and start playback without decodingof what was before. In Matroska,BlockGroups andSimpleBlocks can be RAPs.To seek to these elements, it is still necessary to seek to theCluster containing them,read the Cluster Timestamp,and start playback from theBlockGroup orSimpleBlock that is a RAP.

Because a Matroska File is usually composed of multiple tracks playing at the same time-- video, audio and subtitles -- to seek properly to a RAP, each selected track must betaken in account. Usually, all audio and subtitleBlockGroups orSimpleBlocks are RAPs.They are independent of each other and can be played randomly.

On the other hand, video tracks often use references to previous and future frames for bettercoding efficiency. Frames with such referencesMUST either contain one or moreReferenceBlock Elements in theirBlockGroup orMUST be markedas non-keyframe in aSimpleBlock; seeSection 10.2.

  • BlockGroup with a frame that references another frame, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- References a Block 40 Track Ticks before this one -->    <ReferenceBlock>-40</ReferenceBlock>    <Block/>  </BlockGroup>  ...</Cluster>
  • SimpleBlock with a frame that references another frame, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <SimpleBlock/> (octet 3 bit 0 not set)  ...</Cluster>

Frames that are RAPs (i.e., frames that don't depend on other frames)MUST set the keyframeflag if they are in aSimpleBlock or their parentBlockGroupMUST NOT containaReferenceBlock.

  • BlockGroup with a frame that references no other frame, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- No ReferenceBlock allowed in this BlockGroup -->    <Block/>  </BlockGroup>  ...</Cluster>
  • SimpleBlock with a frame that references no other frame, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <SimpleBlock/> (octet 3 bit 0 set)  ...</Cluster>

There may be cases where the use ofBlockGroup is necessary, as the frame may need aBlockDuration,BlockAdditions,CodecState, orDiscardPadding element.For thoses cases, aSimpleBlockMUST NOT be used; rather,the reference informationSHOULD be recovered for non-RAP frames.

  • SimpleBlock with a frame that references another frame, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <SimpleBlock/> (octet 3 bit 0 not set)  ...</Cluster>
  • Same frame that references another frame put inside a BlockGroup to addBlockDuration, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- ReferenceBlock value recovered based on the codec -->    <ReferenceBlock>-40</ReferenceBlock>    <BlockDuration>20</BlockDuration>    <Block/>  </BlockGroup>  ...</Cluster>

When a frame in aBlockGroup is not a RAP, theBlockGroupMUST contain at least aReferenceBlock.TheReferenceBlocksMUST be used in one of the following ways:

  • Each reference frame listed as aReferenceBlock;
  • some referenced frame listed as aReferenceBlock, even if the timestamp value is accurate; or
  • oneReferenceBlock with the timestamp value "0" corresponding to a self or unknown reference.

The lack ofReferenceBlock would mean such a frame is a RAP and seeking on thatframe that actually depends on other frames may create a bogus output or even crash.

  • Same frame that references another frame put inside a BlockGroup but the reference could not be recovered, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- ReferenceBlock value not recovered from the codec -->    <ReferenceBlock>0</ReferenceBlock>    <BlockDuration>20</BlockDuration>    <Block/>  </BlockGroup>  ...</Cluster>
  • BlockGroup with a frame that references two other frames, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- References a Block 80 Track Ticks before this one -->    <ReferenceBlock>-80</ReferenceBlock>    <!-- References a Block 40 Track Ticks after this one -->    <ReferenceBlock>40</ReferenceBlock>    <Block/>  </BlockGroup>  ...</Cluster>

Intra-only video frames, such as the ones found in AV1 or VP9, can be decoded without any otherframe, but they don't reset the codec state. Thus, seeking to these frames is not possible,as the next frames may need frames that are not known from this seeking point.Such intra-only framesMUST NOT be considered as keyframes, so the keyframe flagMUST NOT be set in theSimpleBlock or aReferenceBlockMUST be usedto signify the frame is not a RAP. The timestamp value of theReferenceBlockMUSTbe "0", meaning it's referencing itself.

  • Intra-only frame not an RAP, with the EBML tree shown as XML:
<Cluster>  <Timestamp>123456</Timestamp>  <BlockGroup>    <!-- References itself to mark it should not be used as RAP -->    <ReferenceBlock>0</ReferenceBlock>    <Block/>  </BlockGroup>  ...</Cluster>

Because a videoSimpleBlock has less information on references than a videoBlockGroup,it is possible to remux a video track usingBlockGroup into aSimpleBlockas long as it doesn't use any otherBlockGroup features thanReferenceBlock.

11.Timestamps

Historically, timestamps in Matroska were mistakenly called timecodes. TheTimestamp Elementwas called Timecode, theTimestampScale Element was called TimecodeScale, theTrackTimestampScale Element was called TrackTimecodeScale, and theReferenceTimestamp Element was called ReferenceTimeCode.

11.1.Timestamp Ticks

All timestamp values in Matroska are expressed in multiples of a tick.They are usually stored as integers.There are three types of ticks possible: Matroska Ticks, Segment Ticks, and Track Ticks.

11.1.1.Matroska Ticks

For such elements, the timestamp value is stored directly in nanoseconds.

The elements storing values in Matroska Ticks/nanoseconds are:

11.1.2.Segment Ticks

Elements in Segment Ticks involve the use of theTimestampScale Element of the Segment to get the timestampin nanoseconds of the element with the following formula:

timestamp in nanosecond = element value * TimestampScale

This allows for storage of smaller integer values in the elements.

When using the default value of "1,000,000" forTimestampScale, one Segment Tick represents one millisecond.

The elements storing values in Segment Ticks are:

11.1.3.Track Ticks

Elements in Track Ticks involve the use of theTimestampScale Element of the Segment and theTrackTimestampScale Element of the Trackto get the timestamp in nanoseconds of the element with the following formula:

timestamp in nanoseconds =    element value * TrackTimestampScale * TimestampScale

This allows for storage of smaller integer values in the elements.The resulting floating-point values of the timestamps are still expressed in nanoseconds.

When using the default values of "1,000,000" forTimestampScale and "1.0" forTrackTimestampScale, one Track Tick represents one millisecond.

The elements storing values in Track Ticks are:

When theTrackTimestampScale is interpreted as "1.0", Track Ticks are equivalent to Segment Ticksand give an integer value in nanoseconds. This is the most common case asTrackTimestampScale is usually omitted.

A value ofTrackTimestampScale other than 1.0MAY be usedto scale the timestamps more in tune with each Track sampling frequency.For historical reasons, a lot of Matroska readers don't take theTrackTimestampScale value into account; thus, using a value other than 1.0 might not work in many places.

11.2.Block Timestamps

ABlock Element andSimpleBlock Element timestamp is the time when the decoded data of the firstframe in the Block/SimpleBlockMUST be presented if the track of that Block/SimpleBlock is selected for playback.This is also known as the PTS.

TheBlock Element andSimpleBlock Element store their timestamps as signed integers, relativeto theCluster\Timestamp value of theCluster they are stored in.To get the timestamp of aBlock orSimpleBlock in nanoseconds, the following formula has to be used:

( Cluster\Timestamp + ( block timestamp * TrackTimestampScale ) ) *TimestampScale

TheBlock Element andSimpleBlock Element store their timestamps as 16-bit signed integers,allowing a range from "-32768" to "+32767" Track Ticks.Although these values can be negative, when added to theCluster\Timestamp, the resulting frame timestampSHOULD NOT be negative.

When aCodecDelay Element is set, its valueMUST be substracted from each Block timestamp of that track.To get the timestamp in nanoseconds of the first frame in aBlock orSimpleBlock, the formula becomes:

( ( Cluster\Timestamp + ( block timestamp * TrackTimestampScale ) ) *  TimestampScale ) - CodecDelay

The resulting frame timestampSHOULD NOT be negative.

During playback, when a frame has a negative timestamp, the contentMUST be decoded by the decoder, but not played to the user.

11.3.TimestampScale Rounding

The default Track Tick duration is one millisecond.

TheTimestampScale is a floating-point value that is usually 1.0. When it's not 1.0, the multipliedBlock Timestamp is a floating-point value in nanoseconds.TheMatroska ReaderSHOULD use the nearest rounding value in nanoseconds to getthe proper nanosecond timestamp of a Block. This allows some cleverTimestampScale valuesto have a more refined timestamp precision per frame.

12.Language Codes

Matroska from version 1 through 3 uses language codes that can be either the 3 lettersbibliographic ISO 639-2 form[ISO639-2] (like "fre" for French),or such a language code followed by a dash and a country code for specialities in languages (like "fre-ca" for Canadian French).TheISO 639-2 Language Elements are "Language Element", "TagLanguage Element", and "ChapLanguage Element".

Starting in Matroska version 4, either[ISO639-2] or[BCP47]MAY be used,althoughBCP 47 isRECOMMENDED. TheBCP 47 Language Elements are "LanguageBCP47 Element","TagLanguageBCP47 Element", and "ChapLanguageBCP47 Element". If aBCP 47 Language Element and anISO 639-2 Language Elementare used within the sameParent Element, then theISO 639-2 Language ElementMUST be ignored; precedence is given to theBCP 47 Language Element.

13.Country Codes

Country codes are the[BCP47] two-letter region subtags without the UK exception.

14.Encryption

This Matroska specification provides no interoperable solution for securing thedata container with any assurances of confidentiality, integrity, authenticity,or to provide authorization. TheContentEncryption Element (Section 5.1.4.1.31.8)and associated sub-fields (Section 5.1.4.1.31.9 toSection 5.1.4.1.31.12) are definedonly for the benefit of implementers to construct their own proprietary solutionor as the basis for further standardization activities. How to use thesefields to secure a Matroska data container is out of scope, as are any relatedissues, such as key management and distribution.

AMatroska Reader who encounters containers that use the fields defined in thissectionMUST rely on out-of-scope guidance to decode the associated content.

Because encryption occurs within theBlock Element, it is possible to manipulateencrypted streams without decrypting them. The streams could potentially be copied,deleted, cut, appended, or any number of other possible editing techniques withoutdecryption. The data can be used without having to expose it or go through the decrypting process.

Encryption can also be layered within Matroska. This means that two completely differenttypes of encryption can be used, requiring two separate keys to be able to decrypt a stream.

Encryption information is stored in theContentEncodings Element under theContentEncryption Element.

For encryption systems sharing public/private keys, the creation of the keys and the exchange of keysare not covered by this document. They have to be handled by the system using Matroska.

The algorithms described inTable 26 supportdifferent modes of operations and key sizes. The specification of theseparameters is required for a complete solution, but is out of scope of thisdocument and left to the proprietary implementations using them or subsequentprofiles of this document.

TheContentEncodingScope Element gives an idea of which part of the track is encrypted, but eachContentEncAlgo Element and its sub-elements (such asAESSettingsCipherMode) define exactly how the encrypted track should be interpreted.

An example of an extension that builds upon these security-related fields in this specification is[WebM-Enc].It uses AES-CTR,ContentEncAlgo = 5 (Section 5.1.4.1.31.9), andAESSettingsCipherMode = 1 (Section 5.1.4.1.31.12).

AMatroska WriterMUST NOT use insecure cryptographic algorithms to create newarchives or streams, but aMatroska ReaderMAY support these algorithms to readpreviously made archives or streams.

15.Image Presentation

15.1.Cropping

ThePixelCrop Elements (PixelCropTop,PixelCropBottom,PixelCropRight, andPixelCropLeft)indicate when, and by how much, encoded video framesSHOULD be cropped for display.These Elements allow edges of the frame that are not intended for display to be stored, but hidden. Examples include thesprockets of a full-frame film scan or the VANC area of a digitized analog videotape.PixelCropTop andPixelCropBottom store an integer of how manyrows of pixelsSHOULD be cropped from the top and bottom of the image, respectively.PixelCropLeft andPixelCropRight store an integer of how many columns of pixelsSHOULD be cropped from the left and right of the image, respectively.

For example, a pillar-boxed video that stores a 1440x1080 visual image within the center of a padded 1920x1080 encoded image may set bothPixelCropLeft andPixelCropRight to "240" so that aMatroska Player can crop off 240 columns of pixels from the left and right of the encoded image to present the image with the pillar-boxes hidden.

Cropping has to be performed before resizing and the display dimensions given byDisplayWidth,DisplayHeight, andDisplayUnit apply to the image that is already cropped.

15.2.Rotation

The ProjectionPoseRoll Element (Section 5.1.4.1.28.46) can be used to indicatethat the image from the associated video trackSHOULD be rotated for presentation.For instance, the following example of the Projection Element (Section 5.1.4.1.28.41)and the ProjectionPoseRoll Element represents a video track where the imageSHOULD bepresented with a 90-degree counter-clockwise rotation, with the EBML tree shown as XML:

<Projection>  <ProjectionPoseRoll>90</ProjectionPoseRoll></Projection>
Figure 15:Rotation Example

16.Segment Position

TheSegment Position of anElement refers to the position of the first octet of theElement ID of thatElement, measured in octets, from the beginning of theElement Datasection of the containingSegment Element. In other words, theSegment Position of anElement is the distance in octets from the beginning of its containingSegment Elementminus the size of theElement ID andElement Data Size of thatSegment Element.TheSegment Position of the firstChild Element of theSegment Element is 0.AnElement that is not stored within aSegment Element, such as theElements oftheEBML Header, do not have aSegment Position.

16.1.Segment Position Exception

Elements that are defined to store aSegment PositionMAY define reserved values toindicate a special meaning.

16.2.Example of Segment Position

This table presents an example of aSegment Position by showing a hexadecimal representationof a very small Matroska file with labels to show the offsets in octets. The file containsaSegment Element with anElement ID of "0x18538067" and aMuxingApp Element with anElement ID of "0x4D80".

   0                             1                             2   0  1  2  3  4  5  6  7  8  9  0  1  2  3  4  5  6  7  8  9  0   +--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+--+ 0 |1A|45|DF|A3|8B|42|82|88|6D|61|74|72|6F|73|6B|61|   ^ EBML Header 0 |                                               |18|53|80|67|                                                   ^ Segment ID20 |93|   ^ Segment Data Size20 |  |15|49|A9|66|8E|4D|80|84|69|65|74|66|57|41|84|69|65|74|66|      ^ Start of Segment data20 |                 |4D|80|84|69|65|74|66|57|41|84|69|65|74|66|                     ^ MuxingApp start

In the above example, theElement ID of theSegment Element is stored at offset 16,theElement Data Size of theSegment Element is stored at offset 20, and theElement Data of theSegment Element is stored at offset 21.

TheMuxingApp Element is stored at offset 26. Since theSegment Position ofanElement is calculated by subtracting the position of theElement Data ofthe containingSegment Element from the position of thatElement, theSegment Positionof theMuxingApp Element in the above example is "26 - 21" or "5".

17.Linked Segments

Matroska provides several methods to link two or moreSegment Elements together to createaLinked Segment. ALinked Segment is a set of multipleSegments linked together intoa single presentation by using Hard Linking or Medium Linking.

AllSegments within aLinked SegmentMUST have aSegmentUUID.

AllSegments within aLinked SegmentSHOULD be stored within the same directoryor be quickly accessible based on theirSegmentUUIDin order to have a seamless transition between Segments.

AllSegments within aLinked SegmentMAY set aSegmentFamily with a common value to makeit easier for aMatroska Player to know whichSegments are meant to be played together.

TheSegmentFilename,PrevFilename, andNextFilename elementsMAY also give hints onthe original filenames that were used when the Segment links were created in case someSegmentUUIDs are damaged.

17.1.Hard Linking

Hard Linking, also called splitting, is the process of creating aLinked Segmentby linking multipleSegment Elements using theNextUUID andPrevUUID Elements.

AllSegments within aHard-Linked SegmentMUST use the sameTracks list andTimestampScale.

Within aLinked Segment, the timestamps ofBlock andSimpleBlockMUST consecutively follow the timestamps ofBlock andSimpleBlock from the previousSegment in linking order.

With Hard Linking, the chapters of anySegment within theLinked SegmentMUST only reference the currentSegment.TheNextUUID andPrevUUID reference the respectiveSegmentUUID values of the next and previousSegments.

The firstSegment of aLinked SegmentMUST NOT have aPrevUUID Element.The lastSegment of aLinked SegmentMUST NOT have aNextUUID Element.

For each node of the chain ofSegments of aLinked Segment, at least oneSegmentMUST reference the otherSegment within the chain.

In a chain ofSegments of aLinked Segment, theNextUUID always takes precedence over thePrevUUID.If SegmentA has aNextUUID to SegmentB and SegmentB has aPrevUUID to SegmentC,the link to use isNextUUID between SegmentA and SegmentB; SegmentC is not part of the Linked Segment.

If SegmentB has aPrevUUID to SegmentA, but SegmentA has noNextUUID, then the Matroska PlayerMAY consider these two Segments linked as SegmentA followed by SegmentB.

As an example, threeSegments can be Hard Linked as aLinked Segment throughcross-referencing each other withSegmentUUID,PrevUUID, andNextUUID as shown inTable 41:

Table 41:Usual Hard-Linking UIDs
file nameSegmentUUIDPrevUUIDNextUUID
start.mkv71000c23cd310998 53fbc94dd984a5ddInvalida77b3598941cb803 eac0fcdafe44fac9
middle.mkva77b3598941cb803 eac0fcdafe44fac971000c23cd310998 53fbc94dd984a5dd6c92285fa6d3e827 b198d120ea3ac674
end.mkv6c92285fa6d3e827 b198d120ea3ac674a77b3598941cb803 eac0fcdafe44fac9Invalid

An example where only theNextUUID Element is used:

Table 42:Hard Linking without PrevUUID
file nameSegmentUUIDPrevUUIDNextUUID
start.mkv71000c23cd310998 53fbc94dd984a5ddInvalida77b3598941cb803 eac0fcdafe44fac9
middle.mkva77b3598941cb803 eac0fcdafe44fac9n/a6c92285fa6d3e827 b198d120ea3ac674
end.mkv6c92285fa6d3e827 b198d120ea3ac674n/aInvalid

An example where only thePrevUUID Element is used:

Table 43:Hard Linking without NextUUID
file nameSegmentUUIDPrevUUIDNextUUID
start.mkv71000c23cd310998 53fbc94dd984a5ddInvalidn/a
middle.mkva77b3598941cb803 eac0fcdafe44fac971000c23cd310998 53fbc94dd984a5ddn/a
end.mkv6c92285fa6d3e827 b198d120ea3ac674a77b3598941cb803 eac0fcdafe44fac9Invalid

An example where only themiddle.mkv is using thePrevUUID andNextUUID Elements:

Table 44:Hard Linking with Mixed UID Links
file nameSegmentUUIDPrevUUIDNextUUID
start.mkv71000c23cd310998 53fbc94dd984a5ddInvalidn/a
middle.mkva77b3598941cb803 eac0fcdafe44fac971000c23cd310998 53fbc94dd984a5dd6c92285fa6d3e827 b198d120ea3ac674
end.mkv6c92285fa6d3e827 b198d120ea3ac674n/aInvalid

17.2.Medium Linking

Medium Linking creates relationships betweenSegments using Ordered Chapters (Section 20.1.3) and theChapterSegmentUUID Element. AChapter Edition with Ordered ChaptersMAY containChapter elements that reference timestamp ranges from otherSegments. TheSegmentreferenced by the Ordered Chapter via theChapterSegmentUUID ElementSHOULD be played aspart of a Linked Segment.

The timestamps of Segment content referenced by Ordered ChaptersMUST be adjusted according to the cumulative duration of the previous Ordered Chapters.

As an example, a file namedintro.mkv could have aSegmentUUID of "0xb16a58609fc7e60653a60c984fc11ead".Another file calledprogram.mkv could use a Chapter Edition that contains two Ordered Chapters.The first chapter references theSegment ofintro.mkv with the use of aChapterSegmentUUID,ChapterSegmentEditionUID,ChapterTimeStart, and an optionalChapterTimeEnd element.The second chapter references content within theSegment ofprogram.mkv. AMatroska PlayerSHOULD recognize theLinked Segment created by the use ofChapterSegmentUUID in an enabledEdition and present the reference content of the twoSegments as a single presentation.

TheChapterSegmentUUID represents the Segment that holds the content to play in place of theLinked Chapter.TheChapterSegmentUUIDMUST NOT be theSegmentUUID of its ownSegment.

There are two ways to use a chapter link:

  • Linked-Duration linking,

  • Linked-Edition linking

17.2.1.Linked Duration

AMatroska PlayerMUST play the content of the linked Segmentfrom theChapterTimeStart until theChapterTimeEnd timestamp in place of theLinked Chapter.

ChapterTimeStart andChapterTimeEnd represent timestamps in the Linked Segment matching the value ofChapterSegmentUUID.Their valuesMUST be in the range of the linked Segment duration.

TheChapterTimeEnd valueMUST be set when using Linked-Duration chapter linking.ChapterSegmentEditionUIDMUST NOT be set.

17.2.2.Linked Edition

AMatroska PlayerMUST play the whole LinkedEdition of the linked Segment in place of theLinked Chapter.

ChapterSegmentEditionUID represents a valid Edition from the Linked Segment matching the value ofChapterSegmentUUID.

When using Linked-Edition chapter linking,ChapterTimeEnd isOPTIONAL.

18.Track Flags

18.1.Default Flag

The "Default Track flag" is a hint for aMatroska Player indicating that a given trackSHOULD be eligible to be automatically selected as the default track for a givenlanguage. If no tracks in a given language have the Default Track flag set, then all tracksin that language are eligible for automatic selection. This can be used to indicate thata track provides "regular service" that is suitable for users with default settings as opposed tospecialized services, such as commentary, hearing-impaired captions, or descriptive audio.

TheMatroska PlayerMAY override the Default Track flag for any reason, includinguser preferences to prefer tracks providing accessibility services.

18.2.Forced Flag

The "Forced flag" tells theMatroska Player that itSHOULD display this subtitle track,even if user preferences usually would not call for any subtitles to be displayed alongsidethe audio track that is currently selected. This can be used to indicate that a track contains translationsof onscreen text or dialogue spoken in a different language than the track's primary language.

18.3.Hearing-Impaired Flag

The "Hearing-impaired flag" tells theMatroska Player that itSHOULD prefer this trackwhen selecting a default track for a hearing-impaired user and that itMAY prefer to selecta different track when selecting a default track for a user that is not hearing-impaired.

18.4.Visually Impaired Flag

The "Visually Impaired flag" tells theMatroska Player that itSHOULD prefer this trackwhen selecting a default track for a visually impaired user and that itMAY prefer to selecta different track when selecting a default track for a user that is not visually impaired.

18.5.Descriptions Flag

The "Descriptions flag" tells theMatroska Player that this trackis suitable to play via a text-to-speech system for a visually impaired userand that itSHOULD NOT automatically select this track whenselecting a default track for a user that is not visually impaired.

18.6.Original Flag

The "Original flag" tells theMatroska Player that this track is in the original languageand that itSHOULD prefer the original language if it's configured to prefer original-language tracks of thistrack's type.

18.7.Commentary Flag

The "Commentary flag" tells theMatroska Player that this track contains commentary onthe content.

18.8.Track Operation

TrackOperation allows for the combination of multiple tracks to make a virtual one. It usestwo separate system to combine tracks. One to create a 3D "composition" (left / right / background planes)and one to simplify join two tracks together to make a single track.

A track created withTrackOperation is a proper track with a UID and all its flags.However, the codec ID is meaningless because each "sub" track needs to be decoded by itsown decoder before the "operation" is applied. TheCues Elements corresponding to sucha virtual trackSHOULD be the union of theCues Elements for each of the tracks it's composed of (when theCues are defined per track).

In the case ofTrackJoinBlocks, theBlock Elements (fromBlockGroup andSimpleBlock)of all the tracksSHOULD be used as if they were defined for this new virtualTrack.When twoBlock Elements have overlapping start or end timestamps, it's up to the underlyingsystem to either drop some of these frames or render them the way they overlap.This situationSHOULD be avoided when creating such tracks, as you can never be sureof the end result on different platforms.

18.9.Overlay Track

Overlay tracksSHOULD be rendered in the same channel as the track it's linked to.When content is found in such a track, itSHOULD be played on the rendering channelinstead of the original track.

18.10.Multi-planar and 3D Videos

There are two different ways to compress 3D videos: have each eye track in a separate trackand have one track have both eyes combined inside (which is more efficient compression-wise).Matroska supports both ways.

For the single track variant, there is theStereoMode Element, which defines how planes areassembled in the track (mono or left-right combined). Odd values of StereoMode means the leftplane comes first for more convenient reading. The pixel count of the track (PixelWidth/PixelHeight)is the raw amount of pixels, e.g., 3840x1080 for full HD side by side and theDisplayWidth/DisplayHeightin pixels is the amount of pixels for one plane (1920x1080 for that full HD stream).Old stereo 3D were displayed using anaglyph (cyan and red colors separated).For compatibility with such movies, there is a value of the StereoMode that corresponds to AnaGlyph.

There is also a "packed" mode (values 13 and 14) that consists of packing two frames togetherin aBlock that uses lacing. The first frame is the left eye and the other frame is the right eye(or vice versa). The framesSHOULD be decoded in that order and are possibly dependenton each other (P and B frames).

For separate tracks, Matroska needs to define exactly which track does what.TrackOperation withTrackCombinePlanes does that. For more details, seeSection 18.8 to view how TrackOperation works.

The 3D support is still in infancy and may evolve to support more features.

The StereoMode used to be part of Matroska v2, but it didn't meet the requirementfor multiple tracks. There was also a bug in libmatroska prior to 0.9.0 that would save/readit as0x53B9 instead of0x53B8; see OldStereoMode (Section 5.1.4.1.28.5).Matroska ReadersMAY support these legacy files by checkingMatroska v2 or0x53B9.The older values of StereoMode were 0: mono, 1: right eye, 2: left eye, and 3: both eyes; these are the only values that can be found in OldStereoMode.They are not compatible with the StereoMode values found in Matroska v3 and above.

19.Default Track Selection

This section provides some example sets of Tracks and hypothetical user settings, along withindications of which Tracks that a similarly-configuredMatroska PlayerSHOULD automaticallyselect for playback by default in such a situation. A playerMAY provide additional settingswith more detailed controls for more nuanced scenarios. These examples are provided as guidelinesto illustrate the intended usages of the various supported Track flags and their expected behaviors.

Track names are shown in English for illustrative purposes; actual files may have titlesin the language of each track or provide titles in multiple languages.

19.1.Audio Selection

Example track set:

Table 45:Audio Tracks for Default Selection
No.TypeLangLayoutOriginalDefaultOther FlagsName
1VideoundN/AN/AN/ANone
2Audioeng5.111None
3Audioeng2.011None
4Audioeng2.010Visually ImpairedDescriptive audio
5Audioesp5.101None
6Audioesp2.000Visually ImpairedDescriptive audio
7Audioeng2.010CommentaryDirector's Commentary
8Audioeng2.010NoneKaraoke

The table above shows a file with 7 audio tracks, 5 of which are in English and 2 are in Spanish.

The English tracks all have the Original flag indicating that English is the original content language.

Generally, the player will first consider the track languages. If the player has an option to preferoriginal-language audio and the user has enabled it, then it should prefer one of the tracks that have the Original flag.If configured to specifically prefer audio tracks in English or Spanish, the player should select one ofthe tracks in the corresponding language. The player may also wish to prefer a track with the Original flagif no tracks matching any of the user's explicitly-preferred languages are available.

Two of the tracks have the Visually Impaired flag. If the player has been configured to prefer such tracks,it should select one; otherwise, it should avoid them if possible.

If selecting an English track, when other settings have left multiple possible options,it may be useful to exclude the tracks that lack the Default flag. Here, one provides descriptive service forthe visually impaired (which has its own flag and may be automatically selected by user configuration,but is unsuitable for users with default-configured players), one is a commentary track(which has its own flag and the player may or may not have specialized handling for),and the last option contains karaoke versions of the music that plays during the film (which is an unusualspecialized audio service that Matroska has no built-in support for indicating, so it's indicatedin the track name instead). By not setting the Default flag on these specialized tracks, the file's authorhints that they should not be automatically selected by a default-configured player.

Having narrowed its choices down, the example player now may have to select between tracks 2 and 3.The only difference between these tracks is their channel layouts. 2 is 5.1 surround while 3 is stereo.If the player is aware that the output device is a pair of headphones or stereo speakers, it may wishto prefer the stereo mix automatically. On the other hand, if it knows that the device is a surround system,it may wish to prefer the surround mix.

If the player finishes analyzing all of the available audio tracks and finds that multiple seem equallyand maximally preferable, itSHOULD default to the first of the group.

19.2.Subtitle Selection

Example track set:

Table 46:Subtitle Tracks for Default Selection
No.TypeLangOriginalDefaultForcedOther flagsName
1VideoundN/AN/AN/ANone
2Audiofra11N/ANone
3Audiopor01N/ANone
4Subtitlesfra110None
5Subtitlesfra100Hearing-impairedCaptions for the hearing-impaired
6Subtitlespor010None
7Subtitlespor001NoneSigns
8Subtitlespor000Hearing-impairedSDH

The table above shows 2 audio tracks and 5 subtitle tracks. As we can see, French is the original language.

We'll start by discussing the case where the user prefers French (or original-language)audio (or has explicitly selected the French audio track), and also prefers French subtitles.

In this case, if the player isn't configured to display captions when the audio matches theirpreferred subtitle languages, the player doesn't need to select a subtitle track at all.

If the userhas indicated that they want captions to be displayed, the selection simplycomes down to whether hearing-impaired subtitles are preferred.

The situation for a user who prefers Portuguese subtitles starts out somewhat analogous.If they select the original French audio (either by explicit audio language preference,preference for original-language tracks, or by explicitly selecting that track), then theselection once again comes down to the hearing-impaired preference.

However, the case where the Portuguese audio track is selected has an important catch:a Forced track in Portuguese is present. This may contain translations of onscreen textfrom the video track or of portions of the audio that are not translated (music, for instance).This means that even if the user's preferences wouldn't normally call for captions here,the Forced track should be selected nonetheless rather than selecting no track at all.On the other hand, if the user's preferencesdo call for captions, the non-Forced tracksshould be preferred, as the Forced track will not contain captioning for the dialogue.

20.Chapters

The Matroska Chapters system can have multipleEditions and eachEdition can consist ofSimple Chapters where a chapter start time is used as a marker in the timeline only. AnEdition can be more complex withOrdered Chapters where a chapter end time stamp is additionallyused or much more complex withLinked Chapters. The Matroska Chapters system can also have a menustructure borrowed from the DVD-menu system[DVD-Video] or have its own built-in Matroska menu structure.

20.1.EditionEntry

TheEditionEntry is also called anEdition.AnEdition contains a set ofEdition flags andMUST contain at least oneChapterAtom Element.Chapters are always inside anEdition (or a Chapter itself is part of anEdition).Multiple Editions are allowed. Some of these EditionsMAY be ordered and others are not.

20.1.1.EditionFlagDefault

Only oneEditionSHOULD have anEditionFlagDefault flag set totrue.

20.1.2.Default Edition

TheDefault Edition is theEdition that aMatroska PlayerSHOULD use for playback by default.

The firstEdition with theEditionFlagDefault flag set totrue is theDefault Edition.

When allEditionFlagDefault flags are set tofalse, then the firstEditionis theDefault Edition.

Table 47:Default Edition, All Default
EditionFlagDefaultDefault Edition
Edition 1trueX
Edition 2true
Edition 3true
Table 48:Default Edition, No Default
EditionFlagDefaultDefault Edition
Edition 1falseX
Edition 2false
Edition 3false
Table 49:Default Edition, With Default
EditionFlagDefaultDefault Edition
Edition 1false
Edition 2trueX
Edition 3false

20.1.3.EditionFlagOrdered

TheEditionFlagOrdered Flag is a significant feature, as it enables anEditionofOrdered Chapters that define and arrange a virtual timeline rather than simplylabeling points within the timeline. For example, withEditions ofOrdered Chapters,a singleMatroska file can present multiple edits of a film without duplicating content.Alternatively, if a videotape is digitized in full, oneOrdered Edition could presentthe full content (including colorbars, countdown, slate, a feature presentation, andblack frames) while anotherEdition ofOrdered Chapters can useChapters that onlymark the intended presentation with the colorbars and other ancillary visual informationexcluded. If anEdition ofOrdered Chapters is enabled, then theMatroska PlayerMUSTplay those Chapters in their stored order from the timestamp marked in theChapterTimeStart Element to the timestamp marked in toChapterTimeEnd Element.

If theEditionFlagOrdered Flag evaluates to "0",Simple Chapters are used andonly theChapterTimeStart of aChapter is used as a chapter mark to jump to thepredefined point in the timeline. WithSimple Chapters, aMatroska PlayerMUSTignore certainChapter Elements. In that case, these elements are informational only.

The following list shows the different Chapter elements only found inOrdered Chapters.

  • ChapterAtom/ChapterSegmentUUID
  • ChapterAtom/ChapterSegmentEditionUID
  • ChapterAtom/ChapterTrack
  • ChapterAtom/ChapProcess
  • Info/ChapterTranslate
  • TrackEntry/TrackTranslate

Furthermore, there are other EBMLElements that could be used if theEditionFlagOrderedevaluates to "1".

20.1.3.1.Ordered-Edition and Matroska Segment Linking
Hard Linking:
Ordered Chapters supersede theHard Linking.
Medium Linking:
Ordered Chapters are used in a normal way and can be combinedwith theChapterSegmentUUID element, which establishes a link to another Segment.

SeeSection 17 on the Linked Segments for more informationaboutHard Linking andMedium Linking.

20.2.ChapterAtom

TheChapterAtom is also called aChapter.

20.2.1.ChapterTimeStart

ChapterTimeStart is the timestamp of the start ofChapter with nanosecond accuracy and is not scaled by TimestampScale.ForSimple Chapters, this is the position of the chapter markers in the timeline.

20.2.2.ChapterTimeEnd

ChapterTimeEnd is the timestamp of the end ofChapter with nanosecond accuracy and is not scaled by TimestampScale.The timestamp defined by theChapterTimeEnd is not part of theChapter.AMatroska Player calculates the duration of thisChapter by using the difference between theChapterTimeEnd andChapterTimeStart.The end timestampMUST be greater than or equal to the start timestamp.

When theChapterTimeEnd timestamp is equal to theChapterTimeStart timestamp,the timestamps is included in theChapter. It can be useful to put markers ina file or add chapter commands with ordered chapter commands without having to play anything;seeSection 5.1.7.1.4.14.

Table 50:ChapterTimeEnd Usage Possibilities
ChapterStart timestampEnd timestampDuration
Chapter 1010000000001000000000
Chapter 2100000000050000000004000000000
Chapter 3600000000060000000000
Chapter 490000000008000000000Invalid (-1000000000)

20.2.3.Nested Chapters

AChapterAtom element can contain otherChapterAtom elements.That element is aParent Chapter and theChapterAtom elements it contains areNested Chapters.

Nested Chapters can be useful to tag small parts of a Segment that already has tags oradd Chapter Codec commands on smaller parts of a Segment that already has Chapter Codec commands.

TheChapterTimeStart of aNested ChapterMUST be greater than or equal to theChapterTimeStart of itsParent Chapter.

If theParent Chapter of aNested Chapter has aChapterTimeEnd, theChapterTimeStart of thatNested ChapterMUST be smaller than or equal to theChapterTimeEnd of theParent Chapter.

20.2.4.Nested Chapters in Ordered Chapters

TheChapterTimeEnd of the lowest level ofNested ChaptersMUST be set for Ordered Chapters.

When used with Ordered Chapters, theChapterTimeEnd value of aParent Chapter is useless for playback,as the proper playback sections are described in itsNested Chapters.TheChapterTimeEndSHOULD NOT be set inParent Chapters andMUST be ignored for playback.

20.2.5.ChapterFlagHidden

Each Chapterwithin aChapterFlagHidden flag works independently of Parent Chapters.ANested Chapter with aChapterFlagHidden flag that evaluates to "0" remains visible in the user interface even if theParent ChapterChapterFlagHidden flag is set to "1".

Table 51:ChapterFlagHidden Nested Visibility
Chapter + Nested ChapterChapterFlagHiddenvisible
Chapter 10yes
Nested Chapter 1.10yes
Nested Chapter 1.21no
Chapter 21no
Nested Chapter 2.10yes
Nested Chapter 2.21no

20.3.Menu Features

The menu features are handled like achapter codec. That means each codec has a type,some private data, and some data in the chapters.

The type of the menu system is defined by theChapProcessCodecID parameter. For now,only two values are supported: 0 Matroska Script, 1 menu borrowed from the DVD[DVD-Video].The private data depends on the type of menu system (stored inChapProcessPrivate), which is the same for the data in the chapters (stored inChapProcessData).

The menu system, as well as Chapter Codecs in general, can perform actions on theMatroska Player, such as jumping to another Chapter or Edition, selecting different tracks, and possibly more.The scope of all the possibilities of Chapter Codecs is not covered in this document, as itdepends on the Chapter Codec features and its integration in aMatroska Player.

20.4.Physical Types

Each level can have different meanings for audio and video. TheORIGINAL_MEDIA_TYPE tag[MatroskaTags] can be used tospecify a string for ChapterPhysicalEquiv = 60. Here is the list of possible levels for both audio and video:

Table 52:ChapterPhysicalEquiv Meaning per Track Type
ValueAudioVideoComment
70SET / PACKAGESET / PACKAGEThe collection of different media.
60CD / 12" / 10" / 7" / TAPE / MINIDISC / DATDVD / VHS / LASERDISCThe physical medium, such as a CD or a DVD.
50SIDESIDEWhen the original medium (LP/DVD) has different sides.
40-LAYERAnother physical level on DVDs.
30SESSIONSESSIONAs found on CDs and DVDs.
20TRACK-As found on CDs.
10INDEX-The first logical level of the side/medium.

20.5.Chapter Examples

20.5.1.Example 1: Basic Chaptering

In this example, a movie is split in different chapters. It could also just be anaudio file (album) in which each track corresponds to a chapter.

  • 00000 ms - 05000 ms: Intro
  • 05000 ms - 25000 ms: Before the crime
  • 25000 ms - 27500 ms: The crime
  • 27500 ms - 38000 ms: The killer is arrested
  • 38000 ms - 43000 ms: Credits

This would translate in the following Matroska form, with the EBML tree shown as XML:

<Chapters>  <EditionEntry>    <EditionUID>16603393396715046047</EditionUID>    <ChapterAtom>      <ChapterUID>1193046</ChapterUID>      <ChapterTimeStart>0</ChapterTimeStart>      <ChapterTimeEnd>5000000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Intro</ChapString>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>2311527</ChapterUID>      <ChapterTimeStart>5000000000</ChapterTimeStart>      <ChapterTimeEnd>25000000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Before the crime</ChapString>      </ChapterDisplay>      <ChapterDisplay>        <ChapString>Avant le crime</ChapString>        <ChapLanguage>fra</ChapLanguage>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>3430008</ChapterUID>      <ChapterTimeStart>25000000000</ChapterTimeStart>      <ChapterTimeEnd>27500000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>The crime</ChapString>      </ChapterDisplay>      <ChapterDisplay>        <ChapString>Le crime</ChapString>        <ChapLanguage>fra</ChapLanguage>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>4548489</ChapterUID>      <ChapterTimeStart>27500000000</ChapterTimeStart>      <ChapterTimeEnd>38000000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>After the crime</ChapString>      </ChapterDisplay>      <ChapterDisplay>        <ChapString>Apres le crime</ChapString>        <ChapLanguage>fra</ChapLanguage>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>5666960</ChapterUID>      <ChapterTimeStart>38000000000</ChapterTimeStart>      <ChapterTimeEnd>43000000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Credits</ChapString>      </ChapterDisplay>      <ChapterDisplay>        <ChapString>Generique</ChapString>        <ChapLanguage>fra</ChapLanguage>      </ChapterDisplay>    </ChapterAtom>  </EditionEntry></Chapters>
Figure 16:Basic Chapters Example

20.5.2.Example 2: Nested Chapters

In this example, an (existing) album is split into different chapters and oneof them contains another splitting.

20.5.2.1.The Micronauts "Bleep To Bleep"
  • 00:00 - 12:28: Baby wants to Bleep/Rock

    • 00:00 - 04:38: Baby wants to bleep (pt.1)
    • 04:38 - 07:12: Baby wants to rock
    • 07:12 - 10:33: Baby wants to bleep (pt.2)
    • 10:33 - 12:28: Baby wants to bleep (pt.3)
  • 12:30 - 19:38: Bleeper_O+2
  • 19:40 - 22:20: Baby wants to bleep (pt.4)
  • 22:22 - 25:18: Bleep to bleep
  • 25:20 - 33:35: Baby wants to bleep (k)
  • 33:37 - 44:28: Bleeper

This would translate in the following Matroska form, with the EBML tree shown as XML:

<Chapters>  <EditionEntry>    <EditionUID>1281690858003401414</EditionUID>    <ChapterAtom>      <ChapterUID>1</ChapterUID>      <ChapterTimeStart>0</ChapterTimeStart>      <ChapterTimeEnd>748000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Baby wants to Bleep/Rock</ChapString>      </ChapterDisplay>      <ChapterAtom>        <ChapterUID>2</ChapterUID>        <ChapterTimeStart>0</ChapterTimeStart>        <ChapterTimeEnd>278000000</ChapterTimeEnd>        <ChapterDisplay>          <ChapString>Baby wants to bleep (pt.1)</ChapString>        </ChapterDisplay>      </ChapterAtom>      <ChapterAtom>        <ChapterUID>3</ChapterUID>        <ChapterTimeStart>278000000</ChapterTimeStart>        <ChapterTimeEnd>432000000</ChapterTimeEnd>        <ChapterDisplay>          <ChapString>Baby wants to rock</ChapString>        </ChapterDisplay>      </ChapterAtom>      <ChapterAtom>        <ChapterUID>4</ChapterUID>        <ChapterTimeStart>432000000</ChapterTimeStart>        <ChapterTimeEnd>633000000</ChapterTimeEnd>        <ChapterDisplay>          <ChapString>Baby wants to bleep (pt.2)</ChapString>        </ChapterDisplay>      </ChapterAtom>      <ChapterAtom>        <ChapterUID>5</ChapterUID>        <ChapterTimeStart>633000000</ChapterTimeStart>        <ChapterTimeEnd>748000000</ChapterTimeEnd>        <ChapterDisplay>          <ChapString>Baby wants to bleep (pt.3)</ChapString>        </ChapterDisplay>      </ChapterAtom>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>6</ChapterUID>      <ChapterTimeStart>750000000</ChapterTimeStart>      <ChapterTimeEnd>1178500000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Bleeper_O+2</ChapString>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>7</ChapterUID>      <ChapterTimeStart>1180500000</ChapterTimeStart>      <ChapterTimeEnd>1340000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Baby wants to bleep (pt.4)</ChapString>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>8</ChapterUID>      <ChapterTimeStart>1342000000</ChapterTimeStart>      <ChapterTimeEnd>1518000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Bleep to bleep</ChapString>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>9</ChapterUID>      <ChapterTimeStart>1520000000</ChapterTimeStart>      <ChapterTimeEnd>2015000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Baby wants to bleep (k)</ChapString>      </ChapterDisplay>    </ChapterAtom>    <ChapterAtom>      <ChapterUID>10</ChapterUID>      <ChapterTimeStart>2017000000</ChapterTimeStart>      <ChapterTimeEnd>2668000000</ChapterTimeEnd>      <ChapterDisplay>        <ChapString>Bleeper</ChapString>      </ChapterDisplay>    </ChapterAtom>  </EditionEntry></Chapters>
Figure 17:Nested Chapters Example

21.Attachments

Matroska supports storage of related files and data in theAttachments Element(aTop-Level Element).Attachment Elements can be used to store related cover art,font files, transcripts, reports, error recovery files, picture or text-based annotations,copies of specifications, or other ancillary files related to theSegment.

Matroska ReadersMUST NOT execute files stored asAttachment Elements.

21.1.Cover Art

This section defines a set of guidelines for the storage of cover art in Matroska files.AMatroska ReaderMAY use embedded cover art to display a representationalstill-image depiction of the multimedia contents of the Matroska file.

Only[JPEG] and PNG[RFC2083] image formatsSHOULD be used for cover art pictures.

There can be two different covers for a movie/album: a portrait style (e.g., a DVD case)and a landscape style (e.g., a wide banner ad).

There can be two versions of the same cover: thenormal cover andthesmall cover. The dimension of thenormal coverSHOULD be 600 pixels on the smallest side (e.g., 960x600 forlandscape, 600x800 for portrait, or 600x600 for square). The dimension of thesmall coverSHOULD be 120 pixels on the smallest side(e.g., 192x120 or 120x160).

Versions of cover art can be differentiated by the filename that is stored in theFileName Element. The default filename of thenormal cover in square or portrait modeiscover.(jpg|png). When stored, thenormal coverSHOULD be the first Attachment instorage order. Thesmall coverSHOULD be prefixed with "small_", such assmall_cover.(jpg|png). The landscape variantSHOULD be suffixed with "_land",such ascover_land.(jpg|png). The filenames are case-sensitive.

The following table provides examples of file names for cover art in Attachments.

Table 53:Cover Art Filenames
FileNameImage OrientationPixel Length of Smallest Side
cover.jpgPortrait or square600
small_cover.pngPortrait or square120
cover_land.pngLandscape600
small_cover_land.jpgLandscape120

21.2.Font Files

Font filesMAY be added to a Matroska file as Attachments so that the font file may be usedto display an associated subtitle track. This allows the presentation of a Matroska file to beconsistent in various environments where the needed fonts might not be available on the local system.

Depending on the font format in question, each font file can contain multiple font variants.Each font variant has a name that will be referred to as Font Name from now on.This Font Name can be different from the Attachment'sFileName, even when disregarding the extension.In order to select a font for display, a Matroska playerSHOULD consider both the Font Nameand the base name of the Attachment's FileName, preferring the former when there are multiple matches.

Subtitle codecs, such as SubStation Alpha (SSA/ASS), usually refer to a font by its Font Name and instead of its filename.If none of the Attachments are a match for the Font Name, the Matroska playerSHOULDattempt to find a system font whose Font Name matches the one used in the subtitle track.

Since loading fonts temporarily can take a while, a Matroska player usuallyloads or installs all the fonts found in attachments so they are ready to be used during playback.Failure to use the font attachment might result in incorrect rendering of the subtitles.

If a selected subtitle track has someAttachmentLink elements, the playerMAY restrict its font rendering to use only these fonts.

A Matroska playerSHOULD handle the official font media types from[RFC8081] when the system can handle the type:

font/sfnt:
Generic SFNT Font Type
font/ttf:
TrueType Font (TTF) Font Type
font/otf:
OpenType Layout (OTF) Font Type
font/collection:
Collection Font Type
font/woff:
WOFF 1.0
font/woff2:
WOFF 2.0

Fonts in Matroska existed long before[RFC8081]. A few unofficial media types for fonts were used in existing files.Therefore, it isRECOMMENDED for a Matroska player to support the following legacy media types for font attachments:

application/x-truetype-font:
TTFs equivalent tofont/ttf and sometimesfont/otf.
application/x-font-ttf:
TTFs, equivalent tofont/ttf.
application/vnd.ms-opentype:
OTF fonts, equivalent tofont/otf
application/font-sfnt:
Generic SFNT Font Type, equivalent tofont/sfnt
application/font-woff:
WOFF 1.0, equivalent tofont/woff

There may also be some font attachments with theapplication/octet-stream media type.In that case, the Matroska playerMAY try to guess the font type by checking the file extension of theAttachedFile\FileName string.Common file extensions for fonts are:

  • .ttf for TTFs, equivalent tofont/ttf;

  • .otf for OTF fonts, equivalent tofont/otf; and

  • .ttc for Collection fonts, equivalent tofont/collection.

The file extension checkMUST be case-insensitive.

Matroska WritersSHOULD use a valid font media type from[RFC8081] in theAttachedFile\FileMediaType of the font attachment.TheyMAY use the media types found in older files when compatibility with older players is necessary.

22.Cues

TheCues Element provides an index of certainCluster Elements to allow for optimizedseeking to absolute timestamps within theSegment. TheCues Element contains one ormanyCuePoint Elements, and eachMUST reference an absolute timestamp (via theCueTime Element), aTrack (via theCueTrack Element), and aSegment Position(via theCueClusterPosition Element). Additional non-mandated Elements are part oftheCuePoint Element, such asCueDuration,CueRelativePosition,CueCodecState,and others that provide anyMatroska Reader with additional information to use inthe optimization of seeking performance.

22.1.Recommendations

The following recommendations are provided to optimize Matroska performance.

  • Unless Matroska is used as a live stream, itSHOULD contain aCues Element.

  • For each video track, each keyframeSHOULD be referenced by aCuePoint Element.

  • It isRECOMMENDED to not reference non-keyframes of video tracks inCues unlessit references aCluster Element that contains aCodecState Element, but no keyframes.

  • For each subtitle track present, each subtitle frameSHOULD be referenced by aCuePoint Element with aCueDuration Element.

  • References to audio tracksMAY be skipped inCuePoint Elements if a video trackis present. When included, theCuePoint ElementsSHOULD reference audio keyframesonce every 500 milliseconds at most.

  • If the referenced frame is not stored within the firstSimpleBlock or firstBlockGroup within itsCluster Element, then theCueRelativePosition ElementSHOULD be written to reference where in theCluster the reference frame is stored.

  • If aCuePoint Element references aCluster Element that includes aCodecState Element,then thatCuePoint ElementMUST use aCueCodecState Element.

  • CuePoint ElementsSHOULD be numerically sorted in storage order by the value of theCueTime Element.

23.Matroska Streaming

In Matroska, there are two kinds of streaming: file access and livestreaming.

23.1.File Access

File access can simply be reading a file located on your computer, but it also includesaccessing a file from an HTTP (web) server or Common Internet File System (CIFS) (Windows share) server. These protocolsare usually safe from reading errors and seeking in the stream is possible. However,when a file is stored far away or on a slow server, seeking can be an expensive operationand should be avoided. The guidelines inSection 25, when followed, help reduce the numberof seeking operations for regular playback and also have the playback start quickly withouta lot of data needed to read first (such as aCues Element,Attachment Element, orSeekHead Element).

Matroska, having a small overhead, is well suited for storing music/videos on fileservers without a big impact on the bandwidth used. Matroska does not require the indexto be loaded before playing, which allows playback to start very quickly. The index canbe loaded only when seeking is requested the first time.

23.2.Livestreaming

Livestreaming is the equivalent of television broadcasting on the Internet. There are twofamilies of servers for livestreaming: RTP / Real-Time Streaming Protocol (RTSP) and HTTP. Matroska is not meant to beused over RTP. RTP already has timing and channel mechanisms that would be wasted if doubledin Matroska. Additionally, having the same information at the RTP and Matroska level wouldbe a source of confusion if they do not match.Livestreaming of Matroska over file-like protocols like HTTP, QUIC, etc., is possible.

A live Matroska stream is different from a file because it usually has no known end(only ending when the client disconnects). For this, all bits of the "size" portionof theSegment ElementMUST be set to 1. Another option is to concatenateSegment Elementswith known sizes one after the other. This solution allows a change of codec/resolutionbetween each Segment. For example, this allows for a switch between 4:3 and 16:9 in a television program.

WhenSegment Elements are continuous, certainElements likeSeekHead,Cues,Chapters, andAttachmentsMUST NOT be used.

It is possible for aMatroska Player to detect that a stream is not seekable.If the stream has neither aSeekHead list nor aCues list at the beginning of the stream,itSHOULD be considered non-seekable. Even though it is possible to seek forwardin the stream, it isNOT RECOMMENDED.

In the context of live radio or web TV, it is possible to "tag" the content while it isplaying. TheTags Element can be placed betweenClusters each time it is necessary.In that case, the newTags ElementMUST reset the previously encounteredTags Elementsand use the new values instead.

24.Tags

24.1.Tags Precedence

Tags allow tagging all kinds of Matroska parts with very detailed metadata in multiple languages.

Some Matroska elements also contain their own string value, such as the Track Name (Section 5.1.4.1.18) or the Chapter String (Section 5.1.7.1.4.10).

The following Matroska elements can also be defined with tags:

When both values exist in the file, the value found in Tags takes precedence over the value found in original location of the element.For example, if you have aTrackEntry\Name element and TagTITLE for that track in a Matroska Segment, the Tag stringSHOULD be used instead of theTrackEntry\Name string to identify the track.

As the Tag element is optional, a lot ofMatroska Readers do not handle it and will not use the tags value when it's found.For maximum compatibility, it's usually better to put the strings in theTrackEntry,ChapterAtom, andAttachmentand keep the tags matching these values if tags are also used.

24.2.Tag Levels

Tag elements allow tagging information on multiple levels; each level has aTargetTypeValueSection 5.1.8.1.1.1.An element for a givenTargetTypeValue also applies to the lower levels denoted by smallerTargetTypeValue values. If an upper valuedoesn't apply to a level, but the actual value to use is not known,an emptyTagString element(Section 5.1.8.1.2.5) or an emptyTagBinary element (Section 5.1.8.1.2.6)MUST be used as the tag value for this level.

See[MatroskaTags] for more details on common tag names, types, and descriptions.

25.Implementation Recommendations

25.1.Cluster

It isRECOMMENDED that each individualCluster Element contains no more thanfive seconds or five megabytes of content.

25.2.SeekHead

It isRECOMMENDED that the firstSeekHead Element be followed by aVoid Element toallow for theSeekHead Element to be expanded to cover newTop-Level Elementsthat could be added to the Matroska file, such asTags,Chapters, andAttachments Elements.

The size of thisVoid Element should be adjusted depending on the Matroska file that already hasTags,Chapters, andAttachments Elements.

25.3.Optimum Layouts

While there can beTop-Level Elements in any order, some ordering of Elements are better than others.The following sections detail a few optimum layouts for different use cases.

25.3.1.Optimum Layout for a Muxer

This is the basic layout muxers should be using for an efficient playback experience:

25.3.2.Optimum Layout after Editing Tags

When tags from the previous layout need to be extended, they are moved to the end with the extra information.The location where the old tags were located is voided.

25.3.3.Optimum Layout with Cues at the Front

Cues are usually a big chunk of data referencing a lot of locations in the file.Players that want to seek in the file need to seek to the end of the fileto access these locations. It is often better if they are placed early in the file.On the other hand, that means players that don't intend to seek will have to read/skipthis data no matter what.

Because the Cues reference locations further in the file, it's often complicated toallocate the proper space for that element before all the locations are known.Therefore, this layout is rarely used:

25.3.4.Optimum Layout for Livestreaming

In Livestreaming (Section 23.2), only a few elements make sense. For example, SeekHead and Cues are useless.All elements other than the ClustersMUST be placed before the Clusters.

  • Info
  • Tracks
  • Attachments (rare)
  • Tags
  • Clusters

26.Security Considerations

Matroska inherits security considerations from EBML.

Attacks on aMatroska Reader could include:

The same error handling done for EBML applies to Matroska files.Particular error handling is not covered in this specification, as this is depends on the goal of theMatroska Readers.It is up to the decision of theMatroska Readers on how to handle the errors if they are recoverable in their code or not.For example, if the checksum of the\Segment\Tracks is invalid, some could decide to try to read the data anyway,some will just reject the file, and most will not even check it.

Matroska Reader implementations need to be robust against malicious payloads; those that are related to denial of service are outlined inSection 2.1 of [RFC4732].

Although rarer, the same may apply to aMatroska Writer. Malicious stream datamust not cause the Matroska Writer to misbehave, as this might allow an attacker accessto transcoding gateways.

As an audio and visual container format, a Matroska file or stream willpotentially encapsulate numerous byte streams created with a variety ofcodecs. Implementers will need to consider the security considerations ofthese encapsulated formats.

27.IANA Considerations

27.1.Matroska Element IDs Registry

This document creates a new IANA registry called the "Matroska Element IDs"registry.

To register a new Element ID in this registry, one needs an Element ID,a Change Controller (IETF or email of registrant), andan optional reference to a document describing the Element ID.

Element IDs are encodedusing the VINT mechanism described inSection 4 of [RFC8794] and can be betweenone and five octets long. Five-octet-long Element IDs are possibleonly if they are declared in the EBML header.

Element IDs are described inSection 5 of [RFC8794] with[Err7189] and[Err7191].

One-octet Matroska Element IDs are to be allocated according to the "RFC Required" policy[RFC8126].

Two-octet Matroska Element IDs are to be allocated according to the "Specification Required" policy[RFC8126].

Three-octet and four-octet Matroska Element IDs are to be allocated according to the "First Come First Served" policy[RFC8126].

The allowed values in the Matroska Element IDs registry are similar to the ones foundin the EBML Element IDs registry defined inSection 17.1 of [RFC8794].

EBML IDs defined for the EBML Header, as defined inSection 17.1 of [RFC8794],MUST NOT be used as Matroska Element IDs.

Given the scarcity of the one-octet Element IDs, they should only be created to save space for elements found many times in a file. For example, within a BlockGroup or Chapters. The four-octet Element IDs are mostly for synchronization of large elements.They should only be used for such high level elements.Elements that are not expected to be used often should use three-octet Element IDs.

Elements found inAppendix A have an assigned Matroska Element ID for historical reasons.These elements are not in use andSHOULD NOT be reused unless there is no other IDs available with the desired size.Such IDs are considered asreclaimed to the IANA registry, as they could be used for other things in the future.

Values of Matroska Element IDs found in this document are assigned as initial values as follows:

Table 54:IDs and Names for Matroska Element IDs Assigned by RFC 9559
Element IDElement NameReference
0x80ChapterDisplayDescribed inSection 5.1.7.1.4.9
0x83TrackTypeDescribed inSection 5.1.4.1.3
0x85ChapStringDescribed inSection 5.1.7.1.4.10
0x86CodecIDDescribed inSection 5.1.4.1.21
0x88FlagDefaultDescribed inSection 5.1.4.1.5
0x8ESlicesReclaimed (Appendix A.5)
0x91ChapterTimeStartDescribed inSection 5.1.7.1.4.3
0x92ChapterTimeEndDescribed inSection 5.1.7.1.4.4
0x96CueRefTimeDescribed inSection 5.1.5.1.2.8
0x97CueRefClusterReclaimed (Appendix A.37)
0x98ChapterFlagHiddenDescribed inSection 5.1.7.1.4.5
0x9AFlagInterlacedDescribed inSection 5.1.4.1.28.1
0x9BBlockDurationDescribed inSection 5.1.3.5.3
0x9CFlagLacingDescribed inSection 5.1.4.1.12
0x9DFieldOrderDescribed inSection 5.1.4.1.28.2
0x9FChannelsDescribed inSection 5.1.4.1.29.3
0xA0BlockGroupDescribed inSection 5.1.3.5
0xA1BlockDescribed inSection 5.1.3.5.1
0xA2BlockVirtualReclaimed (Appendix A.3)
0xA3SimpleBlockDescribed inSection 5.1.3.4
0xA4CodecStateDescribed inSection 5.1.3.5.6
0xA5BlockAdditionalDescribed inSection 5.1.3.5.2.2
0xA6BlockMoreDescribed inSection 5.1.3.5.2.1
0xA7PositionDescribed inSection 5.1.3.2
0xAACodecDecodeAllReclaimed (Appendix A.22)
0xABPrevSizeDescribed inSection 5.1.3.3
0xAETrackEntryDescribed inSection 5.1.4.1
0xAFEncryptedBlockReclaimed (Appendix A.15)
0xB0PixelWidthDescribed inSection 5.1.4.1.28.6
0xB2CueDurationDescribed inSection 5.1.5.1.2.4
0xB3CueTimeDescribed inSection 5.1.5.1.1
0xB5SamplingFrequencyDescribed inSection 5.1.4.1.29.1
0xB6ChapterAtomDescribed inSection 5.1.7.1.4
0xB7CueTrackPositionsDescribed inSection 5.1.5.1.2
0xB9FlagEnabledDescribed inSection 5.1.4.1.4
0xBAPixelHeightDescribed inSection 5.1.4.1.28.7
0xBBCuePointDescribed inSection 5.1.5.1
0xC0TrickTrackUIDReclaimed (Appendix A.28)
0xC1TrickTrackSegmentUIDReclaimed (Appendix A.29)
0xC4TrickMasterTrackSegmentUIDReclaimed (Appendix A.32)
0xC6TrickTrackFlagReclaimed (Appendix A.30)
0xC7TrickMasterTrackUIDReclaimed (Appendix A.31)
0xC8ReferenceFrameReclaimed (Appendix A.12)
0xC9ReferenceOffsetReclaimed (Appendix A.13)
0xCAReferenceTimestampReclaimed (Appendix A.14)
0xCBBlockAdditionIDReclaimed (Appendix A.9)
0xCCLaceNumberReclaimed (Appendix A.7)
0xCDFrameNumberReclaimed (Appendix A.8)
0xCEDelayReclaimed (Appendix A.10)
0xCFSliceDurationReclaimed (Appendix A.11)
0xD7TrackNumberDescribed inSection 5.1.4.1.1
0xDBCueReferenceDescribed inSection 5.1.5.1.2.7
0xE0VideoDescribed inSection 5.1.4.1.28
0xE1AudioDescribed inSection 5.1.4.1.29
0xE2TrackOperationDescribed inSection 5.1.4.1.30
0xE3TrackCombinePlanesDescribed inSection 5.1.4.1.30.1
0xE4TrackPlaneDescribed inSection 5.1.4.1.30.2
0xE5TrackPlaneUIDDescribed inSection 5.1.4.1.30.3
0xE6TrackPlaneTypeDescribed inSection 5.1.4.1.30.4
0xE7TimestampDescribed inSection 5.1.3.1
0xE8TimeSliceReclaimed (Appendix A.6)
0xE9TrackJoinBlocksDescribed inSection 5.1.4.1.30.5
0xEACueCodecStateDescribed inSection 5.1.5.1.2.6
0xEBCueRefCodecStateReclaimed (Appendix A.39)
0xEDTrackJoinUIDDescribed inSection 5.1.4.1.30.6
0xEEBlockAddIDDescribed inSection 5.1.3.5.2.3
0xF0CueRelativePositionDescribed inSection 5.1.5.1.2.3
0xF1CueClusterPositionDescribed inSection 5.1.5.1.2.2
0xF7CueTrackDescribed inSection 5.1.5.1.2.1
0xFAReferencePriorityDescribed inSection 5.1.3.5.4
0xFBReferenceBlockDescribed inSection 5.1.3.5.5
0xFDReferenceVirtualReclaimed (Appendix A.4)
0x41A4BlockAddIDNameDescribed inSection 5.1.4.1.17.2
0x41E4BlockAdditionMappingDescribed inSection 5.1.4.1.17
0x41E7BlockAddIDTypeDescribed inSection 5.1.4.1.17.3
0x41EDBlockAddIDExtraDataDescribed inSection 5.1.4.1.17.4
0x41F0BlockAddIDValueDescribed inSection 5.1.4.1.17.1
0x4254ContentCompAlgoDescribed inSection 5.1.4.1.31.6
0x4255ContentCompSettingsDescribed inSection 5.1.4.1.31.7
0x437CChapLanguageDescribed inSection 5.1.7.1.4.11
0x437DChapLanguageBCP47Described inSection 5.1.7.1.4.12
0x437EChapCountryDescribed inSection 5.1.7.1.4.13
0x4444SegmentFamilyDescribed inSection 5.1.2.7
0x4461DateUTCDescribed inSection 5.1.2.11
0x447ATagLanguageDescribed inSection 5.1.8.1.2.2
0x447BTagLanguageBCP47Described inSection 5.1.8.1.2.3
0x4484TagDefaultDescribed inSection 5.1.8.1.2.4
0x4485TagBinaryDescribed inSection 5.1.8.1.2.6
0x4487TagStringDescribed inSection 5.1.8.1.2.5
0x4489DurationDescribed inSection 5.1.2.10
0x44B4TagDefaultBogusReclaimed (Appendix A.43)
0x450DChapProcessPrivateDescribed inSection 5.1.7.1.4.16
0x45A3TagNameDescribed inSection 5.1.8.1.2.1
0x45B9EditionEntryDescribed inSection 5.1.7.1
0x45BCEditionUIDDescribed inSection 5.1.7.1.1
0x45DBEditionFlagDefaultDescribed inSection 5.1.7.1.2
0x45DDEditionFlagOrderedDescribed inSection 5.1.7.1.3
0x465CFileDataDescribed inSection 5.1.6.1.4
0x4660FileMediaTypeDescribed inSection 5.1.6.1.3
0x4661FileUsedStartTimeReclaimed (Appendix A.41)
0x4662FileUsedEndTimeReclaimed (Appendix A.42)
0x466EFileNameDescribed inSection 5.1.6.1.2
0x4675FileReferralReclaimed (Appendix A.40)
0x467EFileDescriptionDescribed inSection 5.1.6.1.1
0x46AEFileUIDDescribed inSection 5.1.6.1.5
0x47E1ContentEncAlgoDescribed inSection 5.1.4.1.31.9
0x47E2ContentEncKeyIDDescribed inSection 5.1.4.1.31.10
0x47E3ContentSignatureReclaimed (Appendix A.33)
0x47E4ContentSigKeyIDReclaimed (Appendix A.34)
0x47E5ContentSigAlgoReclaimed (Appendix A.35)
0x47E6ContentSigHashAlgoReclaimed (Appendix A.36)
0x47E7ContentEncAESSettingsDescribed inSection 5.1.4.1.31.11
0x47E8AESSettingsCipherModeDescribed inSection 5.1.4.1.31.12
0x4D80MuxingAppDescribed inSection 5.1.2.13
0x4DBBSeekDescribed inSection 5.1.1.1
0x5031ContentEncodingOrderDescribed inSection 5.1.4.1.31.2
0x5032ContentEncodingScopeDescribed inSection 5.1.4.1.31.3
0x5033ContentEncodingTypeDescribed inSection 5.1.4.1.31.4
0x5034ContentCompressionDescribed inSection 5.1.4.1.31.5
0x5035ContentEncryptionDescribed inSection 5.1.4.1.31.8
0x535FCueRefNumberReclaimed (Appendix A.38)
0x536ENameDescribed inSection 5.1.4.1.18
0x5378CueBlockNumberDescribed inSection 5.1.5.1.2.5
0x537FTrackOffsetReclaimed (Appendix A.18)
0x53ABSeekIDDescribed inSection 5.1.1.1.1
0x53ACSeekPositionDescribed inSection 5.1.1.1.2
0x53B8StereoModeDescribed inSection 5.1.4.1.28.3
0x53B9OldStereoModeDescribed inSection 5.1.4.1.28.5
0x53C0AlphaModeDescribed inSection 5.1.4.1.28.4
0x54AAPixelCropBottomDescribed inSection 5.1.4.1.28.8
0x54B0DisplayWidthDescribed inSection 5.1.4.1.28.12
0x54B2DisplayUnitDescribed inSection 5.1.4.1.28.14
0x54B3AspectRatioTypeReclaimed (Appendix A.24)
0x54BADisplayHeightDescribed inSection 5.1.4.1.28.13
0x54BBPixelCropTopDescribed inSection 5.1.4.1.28.9
0x54CCPixelCropLeftDescribed inSection 5.1.4.1.28.10
0x54DDPixelCropRightDescribed inSection 5.1.4.1.28.11
0x55AAFlagForcedDescribed inSection 5.1.4.1.6
0x55ABFlagHearingImpairedDescribed inSection 5.1.4.1.7
0x55ACFlagVisualImpairedDescribed inSection 5.1.4.1.8
0x55ADFlagTextDescriptionsDescribed inSection 5.1.4.1.9
0x55AEFlagOriginalDescribed inSection 5.1.4.1.10
0x55AFFlagCommentaryDescribed inSection 5.1.4.1.11
0x55B0ColourDescribed inSection 5.1.4.1.28.16
0x55B1MatrixCoefficientsDescribed inSection 5.1.4.1.28.17
0x55B2BitsPerChannelDescribed inSection 5.1.4.1.28.18
0x55B3ChromaSubsamplingHorzDescribed inSection 5.1.4.1.28.19
0x55B4ChromaSubsamplingVertDescribed inSection 5.1.4.1.28.20
0x55B5CbSubsamplingHorzDescribed inSection 5.1.4.1.28.21
0x55B6CbSubsamplingVertDescribed inSection 5.1.4.1.28.22
0x55B7ChromaSitingHorzDescribed inSection 5.1.4.1.28.23
0x55B8ChromaSitingVertDescribed inSection 5.1.4.1.28.24
0x55B9RangeDescribed inSection 5.1.4.1.28.25
0x55BATransferCharacteristicsDescribed inSection 5.1.4.1.28.26
0x55BBPrimariesDescribed inSection 5.1.4.1.28.27
0x55BCMaxCLLDescribed inSection 5.1.4.1.28.28
0x55BDMaxFALLDescribed inSection 5.1.4.1.28.29
0x55D0MasteringMetadataDescribed inSection 5.1.4.1.28.30
0x55D1PrimaryRChromaticityXDescribed inSection 5.1.4.1.28.31
0x55D2PrimaryRChromaticityYDescribed inSection 5.1.4.1.28.32
0x55D3PrimaryGChromaticityXDescribed inSection 5.1.4.1.28.33
0x55D4PrimaryGChromaticityYDescribed inSection 5.1.4.1.28.34
0x55D5PrimaryBChromaticityXDescribed inSection 5.1.4.1.28.35
0x55D6PrimaryBChromaticityYDescribed inSection 5.1.4.1.28.36
0x55D7WhitePointChromaticityXDescribed inSection 5.1.4.1.28.37
0x55D8WhitePointChromaticityYDescribed inSection 5.1.4.1.28.38
0x55D9LuminanceMaxDescribed inSection 5.1.4.1.28.39
0x55DALuminanceMinDescribed inSection 5.1.4.1.28.40
0x55EEMaxBlockAdditionIDDescribed inSection 5.1.4.1.16
0x5654ChapterStringUIDDescribed inSection 5.1.7.1.4.2
0x56AACodecDelayDescribed inSection 5.1.4.1.25
0x56BBSeekPreRollDescribed inSection 5.1.4.1.26
0x5741WritingAppDescribed inSection 5.1.2.14
0x5854SilentTracksReclaimed (Appendix A.1)
0x58D7SilentTrackNumberReclaimed (Appendix A.2)
0x61A7AttachedFileDescribed inSection 5.1.6.1
0x6240ContentEncodingDescribed inSection 5.1.4.1.31.1
0x6264BitDepthDescribed inSection 5.1.4.1.29.4
0x63A2CodecPrivateDescribed inSection 5.1.4.1.22
0x63C0TargetsDescribed inSection 5.1.8.1.1
0x63C3ChapterPhysicalEquivDescribed inSection 5.1.7.1.4.8
0x63C4TagChapterUIDDescribed inSection 5.1.8.1.1.5
0x63C5TagTrackUIDDescribed inSection 5.1.8.1.1.3
0x63C6TagAttachmentUIDDescribed inSection 5.1.8.1.1.6
0x63C9TagEditionUIDDescribed inSection 5.1.8.1.1.4
0x63CATargetTypeDescribed inSection 5.1.8.1.1.2
0x6624TrackTranslateDescribed inSection 5.1.4.1.27
0x66A5TrackTranslateTrackIDDescribed inSection 5.1.4.1.27.1
0x66BFTrackTranslateCodecDescribed inSection 5.1.4.1.27.2
0x66FCTrackTranslateEditionUIDDescribed inSection 5.1.4.1.27.3
0x67C8SimpleTagDescribed inSection 5.1.8.1.2
0x68CATargetTypeValueDescribed inSection 5.1.8.1.1.1
0x6911ChapProcessCommandDescribed inSection 5.1.7.1.4.17
0x6922ChapProcessTimeDescribed inSection 5.1.7.1.4.18
0x6924ChapterTranslateDescribed inSection 5.1.2.8
0x6933ChapProcessDataDescribed inSection 5.1.7.1.4.19
0x6944ChapProcessDescribed inSection 5.1.7.1.4.14
0x6955ChapProcessCodecIDDescribed inSection 5.1.7.1.4.15
0x69A5ChapterTranslateIDDescribed inSection 5.1.2.8.1
0x69BFChapterTranslateCodecDescribed inSection 5.1.2.8.2
0x69FCChapterTranslateEditionUIDDescribed inSection 5.1.2.8.3
0x6D80ContentEncodingsDescribed inSection 5.1.4.1.31
0x6DE7MinCacheReclaimed (Appendix A.16)
0x6DF8MaxCacheReclaimed (Appendix A.17)
0x6E67ChapterSegmentUUIDDescribed inSection 5.1.7.1.4.6
0x6EBCChapterSegmentEditionUIDDescribed inSection 5.1.7.1.4.7
0x6FABTrackOverlayReclaimed (Appendix A.23)
0x7373TagDescribed inSection 5.1.8.1
0x7384SegmentFilenameDescribed inSection 5.1.2.2
0x73A4SegmentUUIDDescribed inSection 5.1.2.1
0x73C4ChapterUIDDescribed inSection 5.1.7.1.4.1
0x73C5TrackUIDDescribed inSection 5.1.4.1.2
0x7446AttachmentLinkDescribed inSection 5.1.4.1.24
0x75A1BlockAdditionsDescribed inSection 5.1.3.5.2
0x75A2DiscardPaddingDescribed inSection 5.1.3.5.7
0x7670ProjectionDescribed inSection 5.1.4.1.28.41
0x7671ProjectionTypeDescribed inSection 5.1.4.1.28.42
0x7672ProjectionPrivateDescribed inSection 5.1.4.1.28.43
0x7673ProjectionPoseYawDescribed inSection 5.1.4.1.28.44
0x7674ProjectionPosePitchDescribed inSection 5.1.4.1.28.45
0x7675ProjectionPoseRollDescribed inSection 5.1.4.1.28.46
0x78B5OutputSamplingFrequencyDescribed inSection 5.1.4.1.29.2
0x7BA9TitleDescribed inSection 5.1.2.12
0x7D7BChannelPositionsReclaimed (Appendix A.27)
0x22B59CLanguageDescribed inSection 5.1.4.1.19
0x22B59DLanguageBCP47Described inSection 5.1.4.1.20
0x23314FTrackTimestampScaleDescribed inSection 5.1.4.1.15
0x234E7ADefaultDecodedFieldDurationDescribed inSection 5.1.4.1.14
0x2383E3FrameRateReclaimed (Appendix A.26)
0x23E383DefaultDurationDescribed inSection 5.1.4.1.13
0x258688CodecNameDescribed inSection 5.1.4.1.23
0x26B240CodecDownloadURLReclaimed (Appendix A.21)
0x2AD7B1TimestampScaleDescribed inSection 5.1.2.9
0x2EB524UncompressedFourCCDescribed inSection 5.1.4.1.28.15
0x2FB523GammaValueReclaimed (Appendix A.25)
0x3A9697CodecSettingsReclaimed (Appendix A.19)
0x3B4040CodecInfoURLReclaimed (Appendix A.20)
0x3C83ABPrevFilenameDescribed inSection 5.1.2.4
0x3CB923PrevUUIDDescribed inSection 5.1.2.3
0x3E83BBNextFilenameDescribed inSection 5.1.2.6
0x3EB923NextUUIDDescribed inSection 5.1.2.5
0x1043A770ChaptersDescribed inSection 5.1.7
0x114D9B74SeekHeadDescribed inSection 5.1.1
0x1254C367TagsDescribed inSection 5.1.8
0x1549A966InfoDescribed inSection 5.1.2
0x1654AE6BTracksDescribed inSection 5.1.4
0x18538067SegmentDescribed inSection 5.1
0x1941A469AttachmentsDescribed inSection 5.1.6
0x1C53BB6BCuesDescribed inSection 5.1.5
0x1F43B675ClusterDescribed inSection 5.1.3

27.2.Chapter Codec IDs Registry

This document creates a new IANA registry called the "Matroska Chapter Codec IDs" registry.The values correspond to the unsigned integerChapProcessCodecID value described inSection 5.1.7.1.4.15.

To register a new Chapter Codec ID in this registry, one needs a Chapter Codec ID,a Change Controller (IETF or email of registrant), andan optional reference to a document describing the Chapter Codec ID.

The Chapter Codec IDs are to be allocated according to the "First Come First Served" policy[RFC8126].

ChapProcessCodecID values of "0" and "1" are RESERVED to the IETF for future use.

27.3.Media Types

Matroska files and streams are found in three main forms: audio-video files, audio-only, and occasionally with stereoscopic video tracks.

Historically, Matroska files and streams have used the following media types with an "x-" prefix.For better compatibility, a systemSHOULD be able to handle both formats.Newer systemsSHOULD NOT use the historic format and use the format that follows the[RFC6838] format instead.

Please register three media types, the[RFC6838] templates are below:

27.3.1.For Files Containing Video Tracks

Type name:
video
Subtype name:
matroska
Required parameters:
N/A
Optional parameters:
N/A
Encoding considerations:
as per this document and RFC8794
Security considerations:
SeeSection 26.
Interoperability considerations:
Due to the extensibility of Matroska, it is possible to encounter files with unknown but valid EBML Elements. Readers should be ready to handle this case. The fixed byte order, octet boundaries, and UTF-8 usage allow for broad interoparability.
Published specification:
THISRFC
Applications that use this media type:
FFmpeg, VLC, ...
Fragment identifier considerations:
N/A
Additional information:


Deprecated alias names for this type:
video/x-matroska
Magic number(s):
N/A
File extension(s):
mkv
Macintosh file type code(s):
N/A
Person & email address to contact for further information:
IETF CELLAR WG cellar@ietf.org
Intended usage:
COMMON
Restrictions on usage:
None
Author:
IETF CELLAR WG
Change controller:
IETF

27.3.2.For Files Containing Audio Tracks with No Video Tracks

Type name:
audio
Subtype name:
matroska
Required parameters:
N/A
Optional parameters:
N/A
Encoding considerations:
as per this document and RFC8794
Security considerations:
SeeSection 26.
Interoperability considerations:
Due to the extensibility of Matroska, it is possible to encounter files with unknown but valid EBML Elements. Readers should be ready to handle this case. The fixed byte order, octet boundaries, and UTF-8 usage allow for broad interoparability.
Published specification:
THISRFC
Applications that use this media type:
FFmpeg, VLC, ...
Fragment identifier considerations:
N/A
Additional information:


Deprecated alias names for this type:
audio/x-matroska
Magic number(s):
N/A
File extension(s):
mka
Macintosh file type code(s):
N/A
Person & email address to contact for further information:
IETF CELLAR WG cellar@ietf.org
Intended usage:
COMMON
Restrictions on usage:
None
Author:
IETF CELLAR WG
Change controller:
IETF

27.3.3.For Files Containing a Stereoscopic Video Track

Type name:
video
Subtype name:
matroska-3d
Required parameters:
N/A
Optional parameters:
N/A
Encoding considerations:
as per this document and RFC8794
Security considerations:
SeeSection 26.
Interoperability considerations:
Due to the extensibility of Matroska, it is possible to encounter files with unknown but valid EBML Elements. Readers should be ready to handle this case. The fixed byte order, octet boundaries, and UTF-8 usage allow for broad interoparability.
Published specification:
THISRFC
Applications that use this media type:
FFmpeg, VLC, ...
Fragment identifier considerations:
N/A
Additional information:


Deprecated alias names for this type:
video/x-matroska-3d
Magic number(s):
N/A
File extension(s):
mk3d
Macintosh file type code(s):
N/A
Person & email address to contact for further information:
IETF CELLAR WG cellar@ietf.org
Intended usage:
COMMON
Restrictions on usage:
None
Author:
IETF CELLAR WG
Change controller:
IETF

28.References

28.1.Normative References

[BCP47]
Phillips, A., Ed. andM. Davis, Ed.,"Tags for Identifying Languages",BCP 47,RFC 5646,DOI 10.17487/RFC5646,,<https://www.rfc-editor.org/info/rfc5646>.
[CIE-1931]
Wikipedia,"CIE 1931 color space",<https://en.wikipedia.org/wiki/CIE_1931_color_space>.
[ISO639-2]
International Organization for Standardization,"Codes for the Representation of Names of Languages",ISO 639-2,,<https://www.loc.gov/standards/iso639-2/php/code_list.php>.
[ISO9899]
International Organization for Standardization,"Information technology -- Programming languages -- C",ISO/IEC 9899:2018,,<https://www.iso.org/standard/74528.html>.
[ITU-H.273]
ITU-T,"Coding-independent code points for video signal type identification",ITU-T Recommendation H.273,,<https://www.itu.int/rec/T-REC-H.273-202309-P/en>.
[RFC1950]
Deutsch, P. andJ. Gailly,"ZLIB Compressed Data Format Specification version 3.3",RFC 1950,DOI 10.17487/RFC1950,,<https://www.rfc-editor.org/info/rfc1950>.
[RFC2119]
Bradner, S.,"Key words for use in RFCs to Indicate Requirement Levels",BCP 14,RFC 2119,DOI 10.17487/RFC2119,,<https://www.rfc-editor.org/info/rfc2119>.
[RFC4122]
Leach, P.,Mealling, M., andR. Salz,"A Universally Unique IDentifier (UUID) URN Namespace",RFC 4122,DOI 10.17487/RFC4122,,<https://www.rfc-editor.org/info/rfc4122>.
[RFC6838]
Freed, N.,Klensin, J., andT. Hansen,"Media Type Specifications and Registration Procedures",BCP 13,RFC 6838,DOI 10.17487/RFC6838,,<https://www.rfc-editor.org/info/rfc6838>.
[RFC8081]
Lilley, C.,"The "font" Top-Level Media Type",RFC 8081,DOI 10.17487/RFC8081,,<https://www.rfc-editor.org/info/rfc8081>.
[RFC8126]
Cotton, M.,Leiba, B., andT. Narten,"Guidelines for Writing an IANA Considerations Section in RFCs",BCP 26,RFC 8126,DOI 10.17487/RFC8126,,<https://www.rfc-editor.org/info/rfc8126>.
[RFC8174]
Leiba, B.,"Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words",BCP 14,RFC 8174,DOI 10.17487/RFC8174,,<https://www.rfc-editor.org/info/rfc8174>.
[RFC8794]
Lhomme, S.,Rice, D., andM. Bunkus,"Extensible Binary Meta Language",RFC 8794,DOI 10.17487/RFC8794,,<https://www.rfc-editor.org/info/rfc8794>.

28.2.Informative References

[AVIFormat]
Microsoft Corporation,"AVI RIFF File Reference",,<https://docs.microsoft.com/en-us/windows/win32/directshow/avi-riff-file-reference>.
[Blowfish]
Schneier, B.,"The Blowfish Encryption Algorithm",,<https://www.schneier.com/academic/blowfish/>.
[BZIP2]
Seward, J.,"bzip2",,<https://sourceware.org/bzip2/>.
[DivXTrickTrack]
"Smooth FF/RW",,<https://web.archive.org/web/20101222001148/http://labs.divx.com/node/16601>.
[DivXWorldFonts]
"World Fonts",,<https://web.archive.org/web/20110214132246/http://labs.divx.com/node/16602>.
[DVD-Video]
DVD Forum,"DVD-Books: Part 3 DVD-Video Book",,<http://www.dvdforum.org/>.
[Err7189]
RFC Errata,Erratum ID 7189,RFC 8794,<https://www.rfc-editor.org/errata/eid7189>.
[Err7191]
RFC Errata,Erratum ID 7191,RFC 8794,<https://www.rfc-editor.org/errata/eid7191>.
[FIPS197]
National Institute of Standards and Technology (NIST),"Advanced Encryption Standard (AES)",FIPS PUB 197,DOI 10.6028/NIST.FIPS.197,,<https://csrc.nist.gov/publications/detail/fips/197/final>.
[FIPS46-3]
National Institute of Standards and Technology (NIST),"Data Encryption Standard (DES)",FIPS PUB 46,,<https://csrc.nist.gov/publications/detail/fips/46/3/archive/1999-10-25>.
[FourCC-RGB]
FOURCC,"RGB pixel formats",<https://web.archive.org/web/20160609214806/https://www.fourcc.org/rgb.php>.
[FourCC-YUV]
FOURCC,"YUV pixel formats",<https://web.archive.org/web/20160609214806/https://www.fourcc.org/yuv.php>.
[JPEG]
ITU,"INFORMATION TECHNOLOGY - DIGITAL COMPRESSION AND CODING OF CONTINUOUS-TONE STILL IMAGES - REQUIREMENTS AND GUIDELINES",ITU Recommendation T.81,,<https://www.w3.org/Graphics/JPEG/itu-t81.pdf>.
[LZO]
Tarreau, W. andR. Rodgman,"LZO stream format as understood by Linux's LZO decompressor",,<https://www.kernel.org/doc/Documentation/lzo.txt>.
[MatroskaCodec]
Lhomme, S.,Bunkus, M., andD. Rice,"Matroska Media Container Codec Specifications",Work in Progress,Internet-Draft, draft-ietf-cellar-codec-12,,<https://datatracker.ietf.org/doc/html/draft-ietf-cellar-codec-12>.
[MatroskaTags]
Lhomme, S.,Bunkus, M., andD. Rice,"Matroska Media Container Tag Specifications",Work in Progress,Internet-Draft, draft-ietf-cellar-tags-12,,<https://datatracker.ietf.org/doc/html/draft-ietf-cellar-tags-12>.
[MCF]
"MCF specification, introduction",<http://mukoli.free.fr/mcf/>.
[MSRGB]
Microsoft Corporation,"Compression Enumeration",,<https://learn.microsoft.com/en-us/openspecs/windows_protocols/ms-wmf/4e588f70-bd92-4a6f-b77f-35d0feaf7a57>.
[MSYUV16]
Microsoft Corporation,"10-bit and 16-bit YUV Video Formats",,<https://learn.microsoft.com/en-us/windows/win32/medfound/10-bit-and-16-bit-yuv-video-formats>.
[MSYUV8]
Microsoft Corporation,"Recommended 8-Bit YUV Formats for Video Rendering",,<https://learn.microsoft.com/en-us/windows/win32/medfound/recommended-8-bit-yuv-formats-for-video-rendering>.
[RFC0959]
Postel, J. andJ. Reynolds,"File Transfer Protocol",STD 9,RFC 959,DOI 10.17487/RFC0959,,<https://www.rfc-editor.org/info/rfc959>.
[RFC2083]
Boutell, T.,"PNG (Portable Network Graphics) Specification Version 1.0",RFC 2083,DOI 10.17487/RFC2083,,<https://www.rfc-editor.org/info/rfc2083>.
[RFC3533]
Pfeiffer, S.,"The Ogg Encapsulation Format Version 0",RFC 3533,DOI 10.17487/RFC3533,,<https://www.rfc-editor.org/info/rfc3533>.
[RFC4732]
Handley, M., Ed.,Rescorla, E., Ed., andIAB,"Internet Denial-of-Service Considerations",RFC 4732,DOI 10.17487/RFC4732,,<https://www.rfc-editor.org/info/rfc4732>.
[RFC9110]
Fielding, R., Ed.,Nottingham, M., Ed., andJ. Reschke, Ed.,"HTTP Semantics",STD 97,RFC 9110,DOI 10.17487/RFC9110,,<https://www.rfc-editor.org/info/rfc9110>.
[SMB-CIFS]
Microsoft Corporation,"[MS-CIFS]: Common Internet File System (CIFS) Protocol",,<https://winprotocoldoc.blob.core.windows.net/productionwindowsarchives/MS-CIFS/%5bMS-CIFS%5d.pdf>.
[SP800-38A]
National Institute of Standards and Technology (NIST),"Recommendation for Block Cipher Modes of Operation: Methods and Techniques",DOI 10.6028/NIST.SP.800-38A,NIST Special Publication 800-38A,,<https://nvlpubs.nist.gov/nistpubs/Legacy/SP/nistspecialpublication800-38a.pdf>.
[SP800-67]
National Institute of Standards and Technology (NIST),"Recommendation for the Triple Data Encryption Algorithm (TDEA) Block Cipher",DOI 10.6028/NIST.SP.800-67r2,NIST Special Publication 800-67,,<https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-67r2.pdf>.
[Twofish]
Schneier, B.,Kelsey, J.,Whiting, D.,Wagner, D.,Hall, C., andN. Ferguson,"Twofish: A 128-Bit Block Cipher",,<https://www.schneier.com/wp-content/uploads/2016/02/paper-twofish-paper.pdf>.
[WebM-Enc]
Galligan, F.,"WebM Encryption",,<https://www.webmproject.org/docs/webm-encryption/>.
[WebVTT]
Pieters, S.,Pfeiffer, S., Ed.,Jaegenstedt, P., andI. Hickson,"WebVTT: The Web Video Text Tracks Format",W3C Candidate Recommendation,,<https://www.w3.org/TR/2019/CR-webvtt1-20190404/>.

Appendix A.Historic Deprecated Elements

Since Matroska has evolved since 2002, many parts that were considered for use in the format were never used and often incorrectly designed. Many of the elements that were defined then are not found in any known files, but were part of public specs. DivX also had a few custom elements that were designed for custom features.

We list these elements that have a known ID thatSHOULD NOT be reused to avoid colliding with existing files. They might be reassigned by IANA in the future if there are no more IDs for a given size. A short description of what each ID was used for is included, but the text is not normative.

A.1.SilentTracks Element

type / id:
master / 0x5854
path:
\Segment\Cluster\SilentTracks
documentation:
The list of tracks that are not used in that part of the stream. It is useful when using overlay tracks on seeking or deciding what track to use.

A.2.SilentTrackNumber Element

type / id:
uinteger / 0x58D7
path:
\Segment\Cluster\SilentTracks\SilentTrackNumber
documentation:
One of the track numbers that are not used from now on in the stream. It could change later if it is not specified as silent in a further Cluster.

A.3.BlockVirtual Element

type / id:
binary / 0xA2
path:
\Segment\Cluster\BlockGroup\BlockVirtual
documentation:
A Block with no data. It must be stored in the stream at the place that the real Block would be in display order.

A.4.ReferenceVirtual Element

type / id:
integer / 0xFD
path:
\Segment\Cluster\BlockGroup\ReferenceVirtual
documentation:
The Segment Position of the data that would otherwise be in position of the virtual block.

A.5.Slices Element

type / id:
master / 0x8E
path:
\Segment\Cluster\BlockGroup\Slices
documentation:
Contains slices description.

A.6.TimeSlice Element

type / id:
master / 0xE8
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice
documentation:
Contains extra time information about the data contained in the Block. Being able to interpret this Element is not required for playback.

A.7.LaceNumber Element

type / id:
uinteger / 0xCC
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice\LaceNumber
documentation:
The reverse number of the frame in the lace (0 is the last frame, 1 is the next to last, etc.). Being able to interpret this Element is not required for playback.

A.8.FrameNumber Element

type / id:
uinteger / 0xCD
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice\FrameNumber
documentation:
The number of the frame to generate from this lace with this delay (allows for the generation of many frames from the same Block/Frame).

A.9.BlockAdditionID Element

type / id:
uinteger / 0xCB
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice\BlockAdditionID
documentation:
The ID of the BlockAdditional Element (0 is the main Block).

A.10.Delay Element

type / id:
uinteger / 0xCE
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice\Delay
documentation:
The delay to apply to the Element expressed in Track Ticks; seeSection 11.1.

A.11.SliceDuration Element

type / id:
uinteger / 0xCF
path:
\Segment\Cluster\BlockGroup\Slices\TimeSlice\SliceDuration
documentation:
The duration to apply to the Element expressed in Track Ticks; seeSection 11.1.

A.12.ReferenceFrame Element

type / id:
master / 0xC8
path:
\Segment\Cluster\BlockGroup\ReferenceFrame
documentation:
Contains information about the last reference frame. See[DivXTrickTrack].

A.13.ReferenceOffset Element

type / id:
uinteger / 0xC9
path:
\Segment\Cluster\BlockGroup\ReferenceFrame\ReferenceOffset
documentation:
The relative offset, in bytes, from the previous BlockGroup element for this Smooth FF/RW video track to the containing BlockGroup element. See[DivXTrickTrack].

A.14.ReferenceTimestamp Element

type / id:
uinteger / 0xCA
path:
\Segment\Cluster\BlockGroup\ReferenceFrame\ReferenceTimestamp
documentation:
The timestamp of the BlockGroup pointed to by ReferenceOffset expressed in Track Ticks; seeSection 11.1. See[DivXTrickTrack].

A.15.EncryptedBlock Element

type / id:
binary / 0xAF
path:
\Segment\Cluster\EncryptedBlock
documentation:
Similar to SimpleBlock (seeSection 10.2), but the data inside the Block is Transformed (encrypt and/or signed).

A.16.MinCache Element

type / id:
uinteger / 0x6DE7
path:
\Segment\Tracks\TrackEntry\MinCache
documentation:
The minimum number of frames a player should be able to cache during playback. If set to 0, the reference pseudo-cache system is not used.

A.17.MaxCache Element

type / id:
uinteger / 0x6DF8
path:
\Segment\Tracks\TrackEntry\MaxCache
documentation:
The maximum cache size necessary to store referenced frames in and the current frame. 0 means no cache is needed.

A.18.TrackOffset Element

type / id:
integer / 0x537F
path:
\Segment\Tracks\TrackEntry\TrackOffset
documentation:
A value to add to the Block's Timestamp expressed in Matroska Ticks -- i.e., in nanoseconds; seeSection 11.1. This can be used to adjust the playback offset of a track.

A.19.CodecSettings Element

type / id:
utf-8 / 0x3A9697
path:
\Segment\Tracks\TrackEntry\CodecSettings
documentation:
A string describing the encoding setting used.

A.20.CodecInfoURL Element

type / id:
string / 0x3B4040
path:
\Segment\Tracks\TrackEntry\CodecInfoURL
documentation:
A URL to find information about the codec used.

A.21.CodecDownloadURL Element

type / id:
string / 0x26B240
path:
\Segment\Tracks\TrackEntry\CodecDownloadURL
documentation:
A URL to download about the codec used.

A.22.CodecDecodeAll Element

type / id:
uinteger / 0xAA
path:
\Segment\Tracks\TrackEntry\CodecDecodeAll
documentation:
Set to 1 if the codec can decode potentially damaged data.

A.23.TrackOverlay Element

type / id:
uinteger / 0x6FAB
path:
\Segment\Tracks\TrackEntry\TrackOverlay
documentation:
Specify that this track is an overlay track for the Track specified (in the u-integer). That means when this track has a gap on SilentTracks, the overlay track should be used instead. The order of multiple TrackOverlay matters; the first one is the one that should be used. If the first one is not found, it should be the second, etc.

A.24.AspectRatioType Element

type / id:
uinteger / 0x54B3
path:
\Segment\Tracks\TrackEntry\Video\AspectRatioType
documentation:
Specify the possible modifications to the aspect ratio.

A.25.GammaValue Element

type / id:
float / 0x2FB523
path:
\Segment\Tracks\TrackEntry\Video\GammaValue
documentation:
Gamma Value.

A.26.FrameRate Element

type / id:
float / 0x2383E3
path:
\Segment\Tracks\TrackEntry\Video\FrameRate
documentation:
Number of frames per second. This value is Informational only. It is intended for constant frame rate streams and should not be used for a variable frame rate TrackEntry.

A.27.ChannelPositions Element

type / id:
binary / 0x7D7B
path:
\Segment\Tracks\TrackEntry\Audio\ChannelPositions
documentation:
Table of horizontal angles for each successive channel.

A.28.TrickTrackUID Element

type / id:
uinteger / 0xC0
path:
\Segment\Tracks\TrackEntry\TrickTrackUID
documentation:
The TrackUID of the Smooth FF/RW video in the paired EBML structure corresponding to this video track. See[DivXTrickTrack].

A.29.TrickTrackSegmentUID Element

type / id:
binary / 0xC1
path:
\Segment\Tracks\TrackEntry\TrickTrackSegmentUID
documentation:
The SegmentUID of the Segment containing the track identified by TrickTrackUID. See[DivXTrickTrack].

A.30.TrickTrackFlag Element

type / id:
uinteger / 0xC6
path:
\Segment\Tracks\TrackEntry\TrickTrackFlag
documentation:
Set to 1 if this video track is a Smooth FF/RW track. If set to 1, MasterTrackUID and MasterTrackSegUID should be present and BlockGroups for this track must contain ReferenceFrame structures. Otherwise, TrickTrackUID and TrickTrackSegUID must be present if this track has a corresponding Smooth FF/RW track. See[DivXTrickTrack].

A.31.TrickMasterTrackUID Element

type / id:
uinteger / 0xC7
path:
\Segment\Tracks\TrackEntry\TrickMasterTrackUID
documentation:
The TrackUID of the video track in the paired EBML structure that corresponds to this Smooth FF/RW track. See[DivXTrickTrack].

A.32.TrickMasterTrackSegmentUID Element

type / id:
binary / 0xC4
path:
\Segment\Tracks\TrackEntry\TrickMasterTrackSegmentUID
documentation:
The SegmentUID of the Segment containing the track identified by MasterTrackUID. See[DivXTrickTrack].

A.33.ContentSignature Element

type / id:
binary / 0x47E3
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentSignature
documentation:
A cryptographic signature of the contents.

A.34.ContentSigKeyID Element

type / id:
binary / 0x47E4
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentSigKeyID
documentation:
This is the ID of the private key that the data was signed with.

A.35.ContentSigAlgo Element

type / id:
uinteger / 0x47E5
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentSigAlgo
documentation:
The algorithm used for the signature.

A.36.ContentSigHashAlgo Element

type / id:
uinteger / 0x47E6
path:
\Segment\Tracks\TrackEntry\ContentEncodings\ContentEncoding\ContentEncryption\ContentSigHashAlgo
documentation:
The hash algorithm used for the signature.

A.37.CueRefCluster Element

type / id:
uinteger / 0x97
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueReference\CueRefCluster
documentation:
The Segment Position of the Cluster containing the referenced Block.

A.38.CueRefNumber Element

type / id:
uinteger / 0x535F
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueReference\CueRefNumber
documentation:
Number of the referenced Block of Track X in the specified Cluster.

A.39.CueRefCodecState Element

type / id:
uinteger / 0xEB
path:
\Segment\Cues\CuePoint\CueTrackPositions\CueReference\CueRefCodecState
documentation:
The Segment Position of the Codec State corresponding to this referenced Element. 0 means that the data is taken from the initial Track Entry.

A.40.FileReferral Element

type / id:
binary / 0x4675
path:
\Segment\Attachments\AttachedFile\FileReferral
documentation:
A binary value that a track/codec can refer to when the attachment is needed.

A.41.FileUsedStartTime Element

type / id:
uinteger / 0x4661
path:
\Segment\Attachments\AttachedFile\FileUsedStartTime
documentation:
The timestamp at which this optimized font attachment comes into context and is expressed in Segment Ticks, which are based on TimestampScale. See[DivXWorldFonts].

A.42.FileUsedEndTime Element

type / id:
uinteger / 0x4662
path:
\Segment\Attachments\AttachedFile\FileUsedEndTime
documentation:
The timestamp at which this optimized font attachment goes out of context and is expressed in Segment Ticks, which are based on TimestampScale. See[DivXWorldFonts].

A.43.TagDefaultBogus Element

type / id:
uinteger / 0x44B4
path:
\Segment\Tags\Tag\+SimpleTag\TagDefaultBogus
documentation:
A variant of the TagDefault element with a bogus Element ID. SeeSection 5.1.8.1.2.4.

Authors' Addresses

Steve Lhomme
Email:slhomme@matroska.org
Moritz Bunkus
Email:moritz@bunkus.org
Dave Rice
Email:dave@dericed.com

[8]ページ先頭

©2009-2026 Movatter.jp