RFC 9512 | YAML Media Type | February 2024 |
Polli, et al. | Informational | [Page] |
This document registers theapplication/yaml
media type and the+yaml
structured syntax suffix with IANA. Both identify documentcomponents that are serialized according to the YAML specification.¶
This document is not an Internet Standards Track specification; it is published for informational purposes.¶
This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Not all documents approved by the IESG are candidates for any level of Internet Standard; see Section 2 of RFC 7841.¶
Information about the current status of this document, any errata, and how to provide feedback on it may be obtained athttps://www.rfc-editor.org/info/rfc9512.¶
Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
YAML[YAML] is a data serialization format that is capable of conveying one or multiple documents in a single presentation stream (e.g., a file or a network resource). It is widely used on the Internet, including in the API sector (e.g., see[OAS]), but a corresponding media type and structured syntax suffix had not previously been registered by IANA.¶
To increase interoperability when exchanging YAML streams and leverage content negotiation mechanisms when exchanging YAML resources, this specification registers theapplication/yaml
media type and the+yaml
structured syntax suffix[MEDIATYPE].¶
Moreover, it provides security considerations and interoperability considerations related to[YAML], including its relation with[JSON].¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14[RFC2119][RFC8174] when, and only when, they appear in all capitals, as shown here.¶
The terms "content negotiation" and "resource" in this document are to be interpreted as in[HTTP].¶
The terms "fragment" and "fragment identifier" in this document are to be interpreted as in[URI].¶
The terms "presentation", "stream", "YAML document", "representation graph", "tag", "serialization detail", "node", "alias node", "anchor", and "anchor name" in this document are to be interpreted as in[YAML].¶
Figures containing YAML code always start with the%YAML
directive to improve readability.¶
A fragment identifies a node in a stream.¶
A fragment identifier starting with "*"is to be interpreted as a YAML alias node (seeSection 1.2.1).¶
For single-document YAML streams, a fragment identifier that is empty or that starts with "/" is to be interpreted as a JSON Pointer[JSON-POINTER] and is evaluated on the YAML representation graph, traversing alias nodes; in particular, the empty fragment identifier references the root node. This syntax can only reference the YAML nodes that are on a path that is made up of nodes interoperable with the JSON data model (seeSection 3.4).¶
A fragment identifier is not guaranteed to reference an existing node.Therefore, applicationsSHOULD define how an unresolved alias nodeought to be handled.¶
This section describes how to usealias nodes (see Sections 3.2.2.2 and 7.1 of[YAML])as fragment identifiers to designate nodes.¶
A YAML alias node can be represented in a URI fragment identifierby encoding it into bytes using UTF-8[UTF-8],but percent-encoding of those characters is not allowed by the fragment ruleinSection 3.5 of [URI].¶
If multiple nodes match a fragment identifier,the first occurrence of such a match is selected.¶
Users concerned with interoperability of fragment identifiers:¶
In the example resource below, the relative reference (seeSection 4.2 of [URI])file.yaml#*foo
identifies the first alias node*foo
pointing to the node with valuescalar
and not to the one in the second document, whereas the relative referencefile.yaml#*document_2
identifies the root node of the second document{one: [a, sequence]}
.¶
%YAML 1.2 --- one: &foo scalar two: &bar - some - sequence - items ... %YAML 1.2 --- &document_2 one: &foo [a, sequence]
This section includes the information required for IANA to register theapplication/yaml
media type and the+yaml
structured syntax suffix per[MEDIATYPE].¶
application/yaml
The media type for YAML isapplication/yaml
;the following information serves as the registration form for this media type.¶
application¶
yaml¶
N/A¶
N/A; unrecognized parameters should be ignored.¶
binary¶
Applications that need a human-friendly, cross-language, and Unicode-based data serialization language designed around the common data types of dynamic programming languages.¶
SeeSection 1.2 of this document.¶
See the Authors' Addresses section of this document.¶
COMMON¶
None¶
See the Authors' Addresses section of this document.¶
IETF¶
+yaml
Structured Syntax SuffixThe suffix+yaml
MAY be used with any media type whose representation followsthat established forapplication/yaml
.The structured syntax suffix registration form follows. See[MEDIATYPE] for definitions of each part of the registration form.¶
YAML Ain't Markup Language (YAML)¶
+yaml
¶
Same asapplication/yaml
¶
Same asapplication/yaml
¶
Unlikeapplication/yaml
,there is no fragment identification syntax definedfor+yaml
.¶
A specificxxx/yyy+yaml
media typeneeds to define the syntax and semantics for fragment identifiersbecause the ones defined forapplication/yaml
do not apply unless explicitly expressed.¶
Same asapplication/yaml
¶
httpapi@ietf.org or art@ietf.org¶
See the Authors' Addresses section of this document.¶
IETF¶
YAML is an evolving language, and over time,some features have been added and others removed.¶
Theapplication/yaml
media type registration is independent of the YAML version.This allows content negotiation of version-independent YAML resources.¶
Implementers concerned about features related to a specific YAML versioncan specify it in YAML documents using the%YAML
directive(see Section 6.8.1 of[YAML]).¶
A YAML stream can contain zero or more YAML documents.¶
When receiving a multi-document stream,an application that only expects single-document streamsshould signal an error instead of ignoring the extra documents.¶
Current implementations consider different documents in a stream independent,similarly to JSON text sequences (see[RFC7464]);elements such as anchors are not guaranteed to be referenceableacross different documents.¶
The "yaml" filename extension is the preferred one;it is the most popular and widely used on the web.The "yml" filename extension is still used.The simultaneous usage of two filename extensions in the same contextmight cause interoperability issues(e.g., when both a "config.yaml" and a "config.yml" are present).¶
When using flow collection styles (see Section 7.4 of[YAML]),a YAML document could look like JSON[JSON];thus, similar interoperability considerations apply.¶
When using YAML as a more efficient formatto serialize information intended to be consumed as JSON,information not reflected in the representation graphand classified as presentation or serialization details(see Section 3.2 of[YAML]) can be discarded.This includes comments (see Section 3.2.3.3 of[YAML]),directives, and alias nodes (see Section 7.1 of[YAML])that do not have a JSON counterpart.¶
%YAML 1.2 --- # This comment will be lost # when serializing in JSON. Title: type: string maxLength: &text_limit 64 Name: type: string maxLength: *text_limit # Replaced by the value 64.
Implementers need to ensure that relevantinformation will not be lost during processing.For example, they might consider alias nodes being replaced by static values as acceptable.¶
In some cases, an implementer may want todefine a list of allowed YAML features,taking into account that the following features might have interoperabilityissues with[JSON]:¶
.inf
and.nan
float values, since JSON does not support them¶!!timestamp
that were included in the default schema of older YAML versions¶!!python/object
and!mytag
(see Section 2.4 of[YAML])¶%YAML 1.2 --- non-json-keys: 0: a number [0, 1]: a sequence ? {k: v} : a map --- non-json-keys: !date 2020-01-01: a timestamp non-json-value: !date 2020-01-01 ...
To allow fragment identifiers to traverse alias nodes, the YAML representation graph needs to be generated before the fragment identifier evaluation. It is important that this evaluation does not cause the issues mentioned in Sections3.4 and4, such as infinite loops and unexpected code execution.¶
Implementers need to consider that the YAML version and supported features (e.g., merge keys)can affect the generation of the representation graph (seeFigure 9).¶
InSection 1.2, this document extends the use of specifications based onthe JSON data model with support for YAML fragment identifiers.This is to improve the interoperability of already-consolidated practices,such as writingOpenAPI documents [OAS] in YAML.¶
Appendix A provides a non-exhaustive list of examples to helpreaders understand interoperability issues related to fragment identifiers.¶
Security requirements for both media types and media type suffixesare discussed inSection 4.6 of [MEDIATYPE].¶
Care should be used when using YAML tags because their resolution might trigger unexpected code execution.¶
Code execution in deserializers should be disabled by defaultand only be enabled explicitly.In the latter case, the implementation should ensure (for example, via specific functions)that the code execution results in strictly bounded time/memory limits.¶
Many implementations provide safe deserializers that address these issues.¶
YAML documents are rooted, connected, directed graphsand can contain reference cycles,so they can't be treated as simple trees (see Section 3.2.1 of[YAML]).An implementation that treats them as simple treesrisks going into an infinite loop while traversing the YAML representation graph.This can happen:¶
%YAML 1.2 --- x: &x y: *x
Even if a representation graph is not cyclic, treating it as a simple tree could lead to improper behaviors, such as triggering an Exponential Data Expansion (e.g., a Billion Laughs Attack).¶
%YAML 1.2 --- x1: &a1 ["a", "a"] x2: &a2 [*a1, *a1] x3: &a3 [*a2, *a2]
This can be addressed using processors that limit the anchor recursion depthand validate the input before processing it;even in these cases, it is importantto carefully test the implementation you are going to use.The same considerations apply when serializing a YAML representation graphin a format that does not support reference cycles (seeSection 3.4).¶
Incremental parsing and processing of a YAML stream can produce partial resultsand later indicate failure to parse the remainder of the stream;to prevent partial processing, implementers might prefer validating and processing all the documents in a stream at the same time.¶
Repeated parsing and re-encoding of a YAML stream can resultin the addition or removal of document delimiters (e.g.,---
or...
)as well as the modification of anchor names and other serialization details that can break signature validation.¶
Section 10.3.2 of[YAML] specifies that only the scalars matching theregular expressiontrue|True|TRUE|false|False|FALSE
are interpreted as booleans.Older YAML versions were more tolerant (e.g., interpretingNO
andN
asFalse
and interpretingYES
andY
asTrue
).When the older syntax is used, a YAML implementation could then interpret{insecure: n}
as{insecure: "n"}
instead of{insecure: false}
.Using the syntax defined in Section 10.3.2 of[YAML] prevents these issues.¶
IANA has updated the"Media Types" registry with the registration information inSection 2.1 for the media typeapplication/yaml
.¶
IANA has updated the"Structured Syntax Suffixes" registry with the registration information inSection 2.2 for the structured syntax suffix+yaml
.¶
This example shows a couple of YAML nodes that cannot be referenced based on the JSON data model since their mapping keys are not strings.¶
%YAML 1.2 --- a-map-cannot: ? {be: expressed} : with a JSON Pointer 0: no numeric mapping keys in JSON
In this example, the fragment#/0
does not reference an existing node.¶
%YAML 1.2 --- 0: "JSON Pointer `#/0` references a string mapping key."
In this YAML document, the#/foo/bar/baz
fragment identifiertraverses the representation graph and references the stringyou
.Moreover, the presence of a cyclic reference implies thatthere are infinite fragment identifiers#/foo/bat/../bat/bar
referencing the&anchor
node.¶
%YAML 1.2 --- anchor: &anchor baz: you foo: &foo bar: *anchor bat: *foo
Many YAML implementations will resolvethe merge key "<<:" defined in YAML 1.1in the representation graph.This means that the fragment#/book/author/given_name
references the stringFederico
and that the fragment#/book/<<
will not reference any existing node.¶
%YAML 1.1 --- # Many implementations use merge keys. the-viceroys: &the-viceroys title: The Viceroys author: given_name: Federico family_name: De Roberto book: <<: *the-viceroys title: The Illusion
Thanks toErik Wilde andDavid Biesack for being the initial contributors to this specificationand toDarrel Miller andRich Salz for their support during the adoption phase.¶
In addition, this document owes a lot to the extensive discussion inside and outside the HTTPAPI Working Group. The following contributors helped improve this specification by opening pull requests, reporting bugs, asking smart questions, drafting or reviewing text, and evaluating open issues:Tina (tinita) Müller,Ben Hutton,Carsten Bormann,Manu Sporny, andJason Desrosiers.¶