Please check theerrata for any errors or issues reported since publication.
The English version of this specification is the only normative version. Non-normativetranslations may also be available.
Copyright © 2010-2014W3C® (MIT,ERCIM,Keio,Beihang), All Rights Reserved.W3Cliability,trademark anddocument use rules apply.
This document defines a textual syntax for RDF called TriGthat allows an RDF dataset to be completely written in a compact andnatural text form, with abbreviations for common usage patterns anddatatypes. TriG is an extension of theTurtle [TURTLE] format.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of currentW3C publications and the latest revision of this technical report can be found in theW3C technical reports index at http://www.w3.org/TR/.
This document is part of the RDF 1.1 document suite.TriG is intended the meet the charter requirement of theRDF Working Group todefine an RDF syntax for multiple graphs. TriG is an extension of theTurtlesyntax for RDF [TURTLE]. The current document is based onthe original proposal by Chris Bizer and Richard Cyganiak.
This document was published by theRDF Working Group as a Recommendation. If you wish to make comments regarding this document, please send them topublic-rdf-comments@w3.org (subscribe,archives). All comments are welcome.
Please see the Working Group'simplementation report.
This document has been reviewed byW3C Members, by software developers, and by otherW3C groups and interested parties, and is endorsed by the Director as aW3C Recommendation. It is a stable document and may be used as reference material or cited from another document.W3C's role in making the Recommendation is to draw attention to the specification and to promote its widespread deployment. This enhances the functionality and interoperability of the Web.
This document was produced by a group operating under the5 February 2004W3C Patent Policy.W3C maintains apublic list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes containsEssential Claim(s) must disclose the information in accordance withsection 6 of theW3C Patent Policy.
This document defines TriG, a concrete syntax for RDF as defined in theRDF Concepts and Abstract Syntax document[RDF11-CONCEPTS]. TriG is an extension ofTurtle [TURTLE], extendedto support representing a complete RDF Dataset.
This section is non-normative.
A TriG document allows writing down an RDF Dataset in a compacttextual form. It consists of a sequence of directives, triple statements, graph statements which contain triple-generating statements and optional blank lines.Comments may be given after a#
that is not part of anotherlexical token and continue to the end of the line.
Graph statements are a pair of an IRI or blank node label and a group of triple statementssurrounded by{}
. The IRI or blank node label of the graph statement may be used in another graph statement which implies taking the union of the tripes generatedby each graph statement. An IRI or blank node label used as a graph label may also reoccur as part of any triple statement.Optionally a graph statement may not not be labeled with an IRI. Such agraph statement corresponds to the Default Graph of an RDF Dataset.
The construction of an RDF Dataset from a TriG document is defined insection4.TriG Grammar andsection5.Parsing.
As TriG is an extention of the Turtle language it allows for any constructs from theTurtle language.Simple Triples,Predicate Lists, andObject Lists can all be used either inside a graph statement, or on their own as in a Turtle document. When outside a graph statement, the triples are considered to be part of the default graph of the RDF Dataset.
A graph statement pairs an IRI or blank node with a RDF graph. The triple statements that make up the graph are enclosed in{}
.
In a TriG document a graph IRI or blank node may be used as label for more than one graph statements. The graph label of a graph statement may be omitted. In this case the graph is considered the default graph of the RDF Dataset.
A RDF Dataset might contain only a single graph.
# This document encodes one graph.@prefix ex: <http://www.example.org/vocabulary#> .@prefix : <http://www.example.org/exampleDocument#> .:G1 { :Monica a ex:Person ; ex:name "Monica Murphy" ; ex:homepage <http://www.monicamurphy.org> ; ex:email <mailto:monica@monicamurphy.org> ; ex:hasSkill ex:Management , ex:Programming . }
A RDF Dataset may contain a default graph, and named graphs.
# This document contains a default graph and two named graphs.@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix dc: <http://purl.org/dc/terms/> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .# default graph { <http://example.org/bob> dc:publisher "Bob" . <http://example.org/alice> dc:publisher "Alice" . }<http://example.org/bob> { _:a foaf:name "Bob" . _:a foaf:mbox <mailto:bob@oldcorp.example.org> . _:a foaf:knows _:b . }<http://example.org/alice> { _:b foaf:name "Alice" . _:b foaf:mbox <mailto:alice@work.example.org> . }
TriG provides various alternative ways to write graphsand triples, giving the data writer choices for clarity:
# This document contains a same data as theprevious example.@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix dc: <http://purl.org/dc/terms/> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .# default graph - no {} used.<http://example.org/bob> dc:publisher "Bob" .<http://example.org/alice> dc:publisher "Alice" .# GRAPH keyword to highlight a named graph# Abbreviation of triples using ;GRAPH <http://example.org/bob>{ [] foaf:name "Bob" ; foaf:mbox <mailto:bob@oldcorp.example.org> ; foaf:knows _:b .}GRAPH <http://example.org/alice>{ _:b foaf:name "Alice" ; foaf:mbox <mailto:alice@work.example.org>}
All other terms and directives come from Turtle.
BlankNodes sharing the same label in differently labeled graph statements are considered to be the same BlankNode.
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key wordsMUST,MUST NOT,REQUIRED,SHOULD,SHOULD NOT,RECOMMENDED,MAY, andOPTIONAL in this specification are to be interpreted as described in [RFC2119].
This specification defines conformance criteria for:
A conformingTriG document is a Unicode string that conforms to the grammar and additional constraints defined insection4.TriG Grammar, starting with thetrigDoc
production. A TriG document serializes an RDF dataset.
A conformingTriG parser is a system capable of reading TriG documents on behalf of an application. It makes the serialized RDF dataset, as defined insection5.Parsing, available to the application, usually through some form of API.
The IRI that identifies the TriG language is:http://www.w3.org/ns/formats/TriG
This specification does not define how TriG parsers handle non-conforming input documents.
The media type of TriG isapplication/trig
. The content encoding of TriG content is always UTF-8.
A TriG document is a Unicode [UNICODE] character string encoded in UTF-8. Unicode characters only in the range U+0000 to U+10FFFF inclusive are allowed.
White space (productionWS) is used to separate two terminals which would otherwise be (mis-)recognized as one terminal. Rule names below in capitals indicate where white space is significant; these form a possible choice of terminals for constructing a TriG parser.
White space is significant in the productionString.
Comments in TriG take the form of '#', outside anIRI or astring, and continue to the end of line (marked by characters U+000D or U+000A) or end of file if there is no end of line after the comment marker. Comments are treated as white space.
Relative IRIs are resolved with base IRIs as perUniform Resource Identifier (URI): Generic Syntax [RFC3986] using only the basic algorithm in section 5.2. Neither Syntax-Based Normalization nor Scheme-Based Normalization (described in sections 6.2.2 and 6.2.3 of RFC3986) are performed. Characters additionally allowed in IRI references are treated in the same way that unreserved characters are treated in URI references, per section 6.5 ofInternationalized Resource Identifiers (IRIs) [RFC3987].
The@base
directive defines the Base IRI used to resolve relative IRIs per RFC3986 section 5.1.1, "Base URI Embedded in Content". Section 5.1.2, "Base URI from the Encapsulating Entity" defines how the In-Scope Base IRI may come from an encapsulating document, such as a SOAP envelope with an xml:base directive or a mime multipart document with a Content-Location header. The "Retrieval URI" identified in 5.1.3, Base "URI from the Retrieval URI", is the URL from which a particular TriG document was retrieved. If none of the above specifies the Base URI, the default Base URI (section 5.1.4, "Default Base URI") is used. Each@base
directive sets a new In-Scope Base URI, relative to the previous one.
There are three forms of escapes used in TriG documents:
numeric escape sequences represent Unicode code points:
Escape sequence | Unicode code point |
---|---|
'\u'hexhexhexhex | A Unicode character in the range U+0000 to U+FFFF inclusive corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit. |
'\U'hexhexhexhexhexhexhexhex | A Unicode character in the range U+0000 to U+10FFFF inclusive corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit. |
whereHEX is a hexadecimal character
HEX ::= [0-9] | [A-F] | [a-f]
string escape sequences represent the characters traditionally escaped in string literals:
Escape sequence | Unicode code point |
---|---|
'\t' | U+0009 |
'\b' | U+0008 |
'\n' | U+000A |
'\r' | U+000D |
'\f' | U+000C |
'\"' | U+0022 |
'\'' | U+0027 |
'\\' | U+005C |
reserved character escape sequences consist of a '\' followed by one of~.-!$&'()*+,;=/?#@%_
and represent the character to the right of the '\'.
numeric escapes | string escapes | reserved character escapes | |
---|---|---|---|
IRIs, used asRDF terms or as in@prefix or@base declarations | yes | no | no |
local names | no | no | yes |
Strings | yes | yes | no |
%-encoded sequences are in thecharacter range for IRIs and areexplicitly allowed in local names. These appear as a '%' followed by two hex characters and represent that same sequence of three characters. These sequences arenot decoded during processing. A term written as<http://a.example/%66oo-bar>
in TriG designates the IRIhttp://a.example/%66oo-bar
and not IRIhttp://a.example/foo-bar
. A term written asex:%66oo-bar
with a prefix@prefix ex: <http://a.example/>
also designates the IRIhttp://a.example/%66oo-bar
.
TheEBNF used here is defined in XML 1.0 [EBNF-NOTATION]. Production labels consisting of a number and a final 'g' are unique to TriG. All Production labels consisting of only a number reference the production with that number in theTurtle grammar[TURTLE]. Production labels consisting of a number and a final 's', e.g. [60s], reference the production with that number in the documentSPARQL 1.1 Query Language grammar [SPARQL11-QUERY].
Notes:
@base
', '@prefix
', 'a
', 'true
', 'false
') are case-sensitive. Keywords in double quotes ( "BASE
", "PREFIX
" "GRAPH
" ) are case-insensitive.\u
,\U
and those inECHAR
are case sensitive.trigDoc
.ANON
::='[
'WS*
']
' token allows any amount of white space and comments between[]
s. The single space version is used in the grammar for clarity.@prefix
' and '@base
' match the pattern forLANGTAG, though neither "prefix
" nor "base
" areregistered languagesubtags. This specification does not define whether a quoted literal followed by either of these tokens (e.g."Z"@base
) is in the TriG language.[1g] | trigDoc | ::= | (directive| block)* |
[2g] | block | ::= | triplesOrGraph| wrappedGraph| triples2| "GRAPH "labelOrSubjectwrappedGraph |
[3g] | triplesOrGraph | ::= | labelOrSubject (wrappedGraph| predicateObjectList '. ') |
[4g] | triples2 | ::= | blankNodePropertyListpredicateObjectList? '. '| collectionpredicateObjectList '. ' |
[5g] | wrappedGraph | ::= | '{ 'triplesBlock? '} ' |
[6g] | triplesBlock | ::= | triples ('. 'triplesBlock?)? |
[7g] | labelOrSubject | ::= | iri| BlankNode |
[3] | directive | ::= | prefixID| base| sparqlPrefix| sparqlBase |
[4] | prefixID | ::= | '@prefix 'PNAME_NSIRIREF '. ' |
[5] | base | ::= | '@base 'IRIREF '. ' |
[5s] | sparqlPrefix | ::= | "PREFIX "PNAME_NSIRIREF |
[6s] | sparqlBase | ::= | "BASE "IRIREF |
[6] | triples | ::= | subjectpredicateObjectList| blankNodePropertyListpredicateObjectList? |
[7] | predicateObjectList | ::= | verbobjectList ('; ' (verbobjectList)?)* |
[8] | objectList | ::= | object (', 'object)* |
[9] | verb | ::= | predicate| 'a ' |
[10] | subject | ::= | iri| blank |
[11] | predicate | ::= | iri |
[12] | object | ::= | iri| blank| blankNodePropertyList| literal |
[13] | literal | ::= | RDFLiteral| NumericLiteral| BooleanLiteral |
[14] | blank | ::= | BlankNode| collection |
[15] | blankNodePropertyList | ::= | '[ 'predicateObjectList '] ' |
[16] | collection | ::= | '( 'object* ') ' |
[17] | NumericLiteral | ::= | INTEGER| DECIMAL| DOUBLE |
[128s] | RDFLiteral | ::= | String (LANGTAG| '^^ 'iri)? |
[133s] | BooleanLiteral | ::= | 'true '| 'false ' |
[18] | String | ::= | STRING_LITERAL_QUOTE| STRING_LITERAL_SINGLE_QUOTE| STRING_LITERAL_LONG_SINGLE_QUOTE| STRING_LITERAL_LONG_QUOTE |
[135s] | iri | ::= | IRIREF| PrefixedName |
[136s] | PrefixedName | ::= | PNAME_LN| PNAME_NS |
[137s] | BlankNode | ::= | BLANK_NODE_LABEL| ANON |
Productions for terminals | |||
[19] | IRIREF | ::= | '< ' ([^#x00-#x20<>"{}|^`\ ]| UCHAR)* '> ' |
[139s] | PNAME_NS | ::= | PN_PREFIX? ': ' |
[140s] | PNAME_LN | ::= | PNAME_NSPN_LOCAL |
[141s] | BLANK_NODE_LABEL | ::= | '_: ' (PN_CHARS_U| [0-9 ]) ((PN_CHARS| '. ')* PN_CHARS)? |
[144s] | LANGTAG | ::= | '@ ' [a-zA-Z ]+ ('- ' [a-zA-Z0-9 ]+ )* |
[20] | INTEGER | ::= | [+- ]? [0-9 ]+ |
[21] | DECIMAL | ::= | [+- ]? ([0-9 ]* '. ' [0-9 ]+ ) |
[22] | DOUBLE | ::= | [+- ]? ([0-9 ]+ '. ' [0-9 ]* EXPONENT| '. ' [0-9 ]+ EXPONENT| [0-9 ]+ EXPONENT) |
[154s] | EXPONENT | ::= | [eE ] [+- ]? [0-9 ]+ |
[23] | STRING_LITERAL_QUOTE | ::= | '" ' ([^#x22#x5C#xA#xD ]| ECHAR| UCHAR)* '" ' |
[24] | STRING_LITERAL_SINGLE_QUOTE | ::= | "' " ([^#x27#x5C#xA#xD ]| ECHAR| UCHAR)* "' " |
[25] | STRING_LITERAL_LONG_SINGLE_QUOTE | ::= | "''' " (("' "| "'' ")? ([^'\ ]| ECHAR| UCHAR))* "''' " |
[26] | STRING_LITERAL_LONG_QUOTE | ::= | '""" ' (('" '| '"" ')? ([^"\ ]| ECHAR| UCHAR))* '""" ' |
[27] | UCHAR | ::= | '\u 'HEXHEXHEXHEX| '\U 'HEXHEXHEXHEXHEXHEXHEXHEX |
[159s] | ECHAR | ::= | '\ ' [tbnrf"'\ ] |
[160s] | NIL | ::= | '( 'WS* ') ' |
[161s] | WS | ::= | #x20 | #x9 | #xD | #xA |
[162s] | ANON | ::= | '[ 'WS* '] ' |
[163s] | PN_CHARS_BASE | ::= | [A-Z ]| [a-z ]| [#00C0-#00D6 ]| [#00D8-#00F6 ]| [#00F8-#02FF ]| [#0370-#037D ]| [#037F-#1FFF ]| [#200C-#200D ]| [#2070-#218F ]| [#2C00-#2FEF ]| [#3001-#D7FF ]| [#F900-#FDCF ]| [#FDF0-#FFFD ]| [#10000-#EFFFF ] |
[164s] | PN_CHARS_U | ::= | PN_CHARS_BASE| '_ ' |
[166s] | PN_CHARS | ::= | PN_CHARS_U| '- '| [0-9 ]| #00B7 | [#0300-#036F ]| [#203F-#2040 ] |
[167s] | PN_PREFIX | ::= | PN_CHARS_BASE ((PN_CHARS| '. ')* PN_CHARS)? |
[168s] | PN_LOCAL | ::= | (PN_CHARS_U| ': '| [0-9 ]| PLX) ((PN_CHARS| '. '| ': '| PLX)* (PN_CHARS| ': '| PLX))? |
[169s] | PLX | ::= | PERCENT| PN_LOCAL_ESC |
[170s] | PERCENT | ::= | '% 'HEXHEX |
[171s] | HEX | ::= | [0-9 ]| [A-F ]| [a-f ] |
[172s] | PN_LOCAL_ESC | ::= | '\ ' ('_ '| '~ '| '. '| '- '| '! '| '$ '| '& '| "' "| '( '| ') '| '* '| '+ '| ', '| '; '| '= '| '/ '| '? '| '# '| '@ '| '% ') |
The RDF Concepts and Abstract Syntax [RDF11-CONCEPTS]specification defines three types ofRDFTerm:IRIs,literals andblank nodes. Literals are composed of alexical form and an optionallanguage tag [BCP47] or datatype IRI. An extra type,prefix
, is used during parsing to map string identifiers to namespace IRIs. This section maps a string conforming to the grammar insection4.5Grammar to a set of triples by mapping strings matching productions and lexical tokens to RDF terms or their components (e.g. language tags, lexical forms of literals). Grammar productions change the parser state and emit triples.
Parsing TriG requires a state of six items:
baseURI
— When thebase production is reached, the second rule argument,IRIREF
, is the base URI used for relative IRI resolution.namespaces
— The second and third rule arguments (PNAME_NS
andIRIREF
) in theprefixID production assign a namespace name (IRIREF
) for the prefix (PNAME_NS
). Outside of aprefixID
production, anyPNAME_NS
is substituted with the namespace. Note that the prefix may be an empty string, per thePNAME_NS,
production:(PN_PREFIX)? ":"
.bnodeLabels
— A mapping from string to blank node.curSubject
— ThecurSubject
is bound to thesubject
production.curPredicate
— ThecurPredicate
is bound to theverb
production. If token matched was "a
",curPredicate
is bound to the IRIhttp://www.w3.org/1999/02/22-rdf-syntax-ns#type
.curGraph
— ThecurGraph
is bound to the label of the graph that is the destination of triples produced in parsing. When undefined, triples are destined for the default graph.This table maps productions and lexical tokens toRDF terms
or components ofRDF terms
listed insection5.Parsing:
production | type | procedure |
---|---|---|
IRIREF | IRI | The characters between "<" and ">" are taken, with thenumeric escape sequences unescaped, to form the unicode string of the IRI. Relative IRI resolution is performed persection4.3IRI References. |
PNAME_NS | prefix | When used in aprefixID orsparqlPrefix production, theprefix is the potentially empty unicode string matching the first argument of the rule is a key into thenamespaces map. |
IRI | When used in aPrefixedName production, theiri is the value in thenamespaces map corresponding to the first argument of the rule. | |
PNAME_LN | IRI | A potentially emptyprefix is identified by the first sequence,PNAME_NS . Thenamespaces mapMUST have a correspondingnamespace . The unicode string of the IRI is formed by unescaping thereserved characters in the second argument,PN_LOCAL , and concatenating this onto thenamespace . |
STRING_LITERAL_SINGLE_QUOTE | lexical form | The characters between the outermost "'"s are taken, withnumeric andstring escape sequences unescaped, to form the unicode string of a lexical form. |
STRING_LITERAL_QUOTE | lexical form | The characters between the outermost '"'s are taken, withnumeric andstring escape sequences unescaped, to form the unicode string of a lexical form. |
STRING_LITERAL_LONG_SINGLE_QUOTE | lexical form | The characters between the outermost "'''"s are taken, withnumeric andstring escape sequences unescaped, to form the unicode string of a lexical form. |
STRING_LITERAL_LONG_QUOTE | lexical form | The characters between the outermost '"""'s are taken, withnumeric andstring escape sequences unescaped, to form the unicode string of a lexical form. |
LANGTAG | language tag | The characters following the@ form the unicode string of the language tag. |
RDFLiteral | literal | The literal has a lexical form of the first rule argument,String , and either a language tag ofLANGTAG or a datatype IRI ofiri , depending on which rule matched the input. If theLANGTAG rule matched, the datatype isrdf:langString and the language tag isLANGTAG . If neither a language tag nor a datatype IRI is provided, the literal has a datatype ofxsd:string . |
INTEGER | literal | The literal has a lexical form of the input string, and a datatype ofxsd:integer . |
DECIMAL | literal | The literal has a lexical form of the input string, and a datatype ofxsd:decimal . |
DOUBLE | literal | The literal has a lexical form of the input string, and a datatype ofxsd:double . |
BooleanLiteral | literal | The literal has a lexical form of thetrue orfalse , depending on which matched the input, and a datatype ofxsd:boolean . |
BLANK_NODE_LABEL | blank node | The string matching the second argument,PN_LOCAL , is a key inbnodeLabels. If there is no corresponding blank node in the map, one is allocated. |
ANON | blank node | A blank node is generated. |
blankNodePropertyList | blank node | A blank node is generated. Note the rules forblankNodePropertyList in the next section. |
collection | blank node | For non-empty lists, a blank node is generated. Note the rules forcollection in the next section. |
IRI | For empty lists, the resulting IRI isrdf:nil . Note the rules forcollection in the next section. |
A TriG document defines anRDF Dataset composed of one default graph and zero or more named graphs. Each graph is composed of a set ofRDF triples.
The statecurGraph
is initially unset. It records the label of the graph for triples produced during parsing. If undefined, the default graph is used.
The rulelabelOrSubject
sets bothcurGraph
andcurSubject
(only one of these will be used).
The following grammar production clauses setcurGraph
to be undefined, indicating the default graph:
The grammar productionlabelOrSubject predicateObjectList '.'
unsetscurGraph
before handlingpredicateObjectLists
in ruletriplesOrGraph
.
Each RDF triple produced is added tocurGraph
, or the default graph ifcurGraph
is not set at that point in the parsing process.
Thesubject
production sets thecurSubject
. Theverb
production sets thecurPredicate
.
Triples are produced at the following points in the parsing process and each RDF triple produced is added to the graph identified bycurGraph
.
EachobjectN
in the document produces an RDF triple:curSubject
curPredicate
N
.
Beginning theblankNodePropertyList
production records thecurSubject
andcurPredicate
, and setscurSubject
to a novelblank node
B
. Finishing theblankNodePropertyList
production restorescurSubject
andcurPredicate
. The node produced by matchingblankNodePropertyList
is the blank nodeB
.
Beginning thecollection
production records thecurSubject
andcurPredicate
. Eachobject
in thecollection
production has acurSubject
set to a novelblank node
B
and acurPredicate
set tordf:first
. For each objectobjectn
after the first produces a triple:objectn-1
rdf:rest
objectn
. Finishing thecollection
production creates an additional triplecurSubject rdf:rest rdf:nil
. and restorescurSubject
andcurPredicate
The node produced by matchingcollection
is the first blank nodeB
for non-empty lists andrdf:nil
for empty lists.
This section is non-normative.
The editors gratefully acknowledge the work of Chris Bizer and Richard Cyganiak in creating the original TriG specification. Valuable contributions to this version were made by Gregg Kellogg, Eric Prud'hommeaux and Sandro Hawke.
The document was improved through the review process by the wider community.
This section is non-normative.
This section describes the main differences between TriG, as defined in this document, and earlier forms.
{
...}
.=
graph naming operator or optional "." after each graph.BASE
,PREFIX
as in [TURTLE].GRAPH
keyword is allowed to aid SPARQL alignment.The Internet Media Type / MIME Type for TriG is "application/trig".
It is recommended that TriG files have the extension ".trig" (all lowercase) on all platforms.
It is recommended that TriG files stored on Macintosh HFS file systems be given a file type of "TEXT".
This information that follows will be submitted to the IESG for review, approval, and registration with IANA.
Error in grammar productions [24] and [25] fixed.