Movatterモバイル変換

Abstract

RDF is a directed, labeled graph data model for representing information in the Web. This specification defines the syntax and semantics of the SPARQL Query Language for RDF. SPARQL can be used to express queries across diverse data sources, whether the data is stored natively as RDF or viewed as RDF via middleware. SPARQL contains capabilities for querying required and optional graph patterns along with their conjunctions and disjunctions. SPARQL also supports aggregation, subqueries, negation, creating values by expressions, extensible value testing, and constraining queries by source RDF graph. The results of SPARQL queries can be result sets or RDF graphs.

Status of This Document

This section describes the status of this document at the time of its publication. A list of currentW3C publications and the latest revision of this technical report can be found in the W3C standards and drafts index.

This specification is published by theRDF Star Working Group as part of the update of specifications for format and errata.

This document was published by theRDF & SPARQL Working Group as a Working Draft using theRecommendation track.

Publication as a Working Draft does not imply endorsement byW3C and its Members.

This is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to cite this document as other than a work in progress. Future updates to this specification may incorporate new features.

This document was produced by a group operating under the W3C Patent Policy.W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent that the individual believes containsEssential Claim(s) must disclose the information in accordance withsection 6 of the W3C Patent Policy.

This document is governed by the03 November 2023 W3C Process Document.

Set of Documents

This document is one of twelve SPARQL 1.2 documents produced by theRDF-star Working Group.

List of documents

SPARQL 1.2 Documents:

1.Introduction

RDF is a directed, labeled graph data model for representing information in the Web. RDF is often used to represent, among other things, personal information, social networks, metadata about digital artifacts, as well as to provide a means of integration over disparate sources of information. This specification defines the syntax and semantics of the SPARQL Query Language for RDF.

The SPARQL Query Language for RDF is designed to meet the use cases and requirements identified by the RDF Data Access Working Group in [RDF-DAWG-UC], the SPARQL 1.1 Working Group in [SPARQL-FEATURES], and the RDF-star Working Group.

1.1Document Outline

Unless otherwise noted in the section heading, all sections and appendices in this document are normative.

This section of the document,section 1, introduces the SPARQL Query Language specification. It presents the organization of this specification document and the conventions used throughout the specification.

Section 2 of the specification introduces the SPARQL query language itself via a series of example queries and query results.Section 3 continues the introduction of the SPARQL query language with more examples that demonstrate SPARQL's ability to express constraints on the RDF terms that appear in a query's results.

Section 4 presents details of the SPARQL query language's syntax. It is a companion to the full grammar of the language and defines how grammatical constructs represent IRIs, blank nodes, literals, and variables. Section 4 also defines the meaning of several grammatical constructs that serve as syntactic sugar for more verbose expressions.

Section 5 introduces basic graph patterns and group graph patterns, the building blocks from which more complex SPARQL query patterns are constructed. Sections 6, 7, and 8 present constructs that combine SPARQL graph patterns into larger graph patterns. In particular,Section 6 introduces the ability to make portions of a query optional;Section 7 introduces the ability to express the disjunction of alternative graph patterns; andSection 8 introduces patterns to test for the absense of information.

Section 9 adds property paths to graph pattern matching, giving a compact representation of queries and also the ability to match arbitrary length paths in the graph.

Section 10 describes the forms of assignment possible in SPARQL.

Sections 11 introduces the mechanism to group and aggregate results, which can be incorporated as subqueries as described inSection 12.

Section 13 introduces the ability to constrain portions of a query to particular source graphs. Section 13 also presents SPARQL's mechanism for defining the source graphs for a query.

Section 14 refers to the separate documentSPARQL 1.1 Federated Query.

Section 15 defines the constructs that affect the solutions of a query by ordering, slicing, projecting, limiting, and removing duplicates from a sequence of solutions.

Section 16 defines the four types of SPARQL queries that produce results in different forms.

Section 17 defines SPARQL's extensible value testing and expression framework. It presents the functions and operators that can be used to constrain the values that appear in a query's results and also calculate new values to be returned by a query.

Section 18 is a formal definition of the evaluation of SPARQL graph patterns and solution modifiers.

Section 19 contains the normative definition of the syntax for the SPARQL query andSPARQL 1.1 Update languages, as given by a grammar expressed in EBNF notation.

1.2Document Conventions

1.2.1Namespaces

In this document, examples assume the following namespace prefix definitions unless otherwise stated:

Prefix	IRI
`rdf:`	`http://www.w3.org/1999/02/22-rdf-syntax-ns#`
`rdfs:`	`http://www.w3.org/2000/01/rdf-schema#`
`xsd:`	`http://www.w3.org/2001/XMLSchema#`
`fn:`	`http://www.w3.org/2005/xpath-functions#`
`sfn:`	`http://www.w3.org/ns/sparql#`

1.2.2Data Descriptions

This document uses theRDF 1.1 Turtle [TURTLE] data format to show each triple explicitly. Turtle allows IRIs to be abbreviated with prefixes:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>:book1  dc:title  "SPARQL Tutorial" .

1.2.3Result Descriptions

Result sets are illustrated in tabular form.

x	y	z
"Alice"	`<http://example/a>`

A 'binding' is a pair (variable,RDF term). In this result set, there are three variables:x,y andz (shown as column headers). Each solution is shown as one row in the body of the table. Here, there is a single solution, in which variablex is bound to"Alice", variabley is bound to<http://example/a>, and variablez is not bound to an RDF term. Variables are not required to be bound in a solution.

1.2.4Terminology

The SPARQL language includes IRIs. Note that all IRIs in SPARQL queries are absolute; they may or may not include a fragment identifier [RFC3987], section 3.1. IRIs include URIs [RFC3986] and URLs. The abbreviated forms (relative IRIs and prefixed names) in the SPARQL syntax are resolved to produce absolute IRIs.

The following terms are defined inRDF 1.2 Concepts and Abstract Syntax [RDF12-CONCEPTS] and used in SPARQL:

Blank node identifiers arepart of SPARQL and RDF concrete serializations. In this document, the syntax form "_:abc" is used where theblank node identifier isabc. and the "_:" is the Turtle and SPARQL syntax used to introduce blank nodes with identifiers.

2.Making Simple Queries (Informative)

Most forms of SPARQL query contain a set of triple patterns called abasic graph pattern. Triple patterns are like RDF triples except that each of the subject, predicate and object may be a variable. A basic graph patternmatches a subgraph of the RDF data when anRDF term from that subgraph may be substituted for the variables and the result is RDF graph equivalent to the subgraph.

2.1Writing a Simple Query

The example below shows a SPARQL query to find the title of a book from the given data graph. The query consists of two parts: theSELECT clause identifies the variables to appear in the query results, and theWHERE clause provides the basic graph pattern to match against the data graph. The basic graph pattern in this example consists of a single triple pattern with a single variable (?title) in the object position.

Data:

<http://example.org/book/book1> <http://purl.org/dc/elements/1.1/title> "SPARQL Tutorial" .

Query:

SELECT ?titleWHERE{    <http://example.org/book/book1> <http://purl.org/dc/elements/1.1/title> ?title .}

This query, on the data above, has one solution:

Query Result:

title
"SPARQL Tutorial"

2.2Multiple Matches

The result of a query is asolution sequence, corresponding to the ways in which the query's graph pattern matches the data. There may be zero, one or multiple solutions to a query.

Data:

PREFIX foaf:  <http://xmlns.com/foaf/0.1/> ._:a  foaf:name   "Johnny Lee Outlaw" ._:a  foaf:mbox   <mailto:jlow@example.com> ._:b  foaf:name   "Peter Goodguy" ._:b  foaf:mbox   <mailto:peter@example.org> ._:c  foaf:mbox   <mailto:carol@example.org> .

Query:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE{ ?x foaf:name ?name .  ?x foaf:mbox ?mbox }

Query Result:

name	mbox
"Johnny Lee Outlaw"	<mailto:jlow@example.com>
"Peter Goodguy"	<mailto:peter@example.org>

Each solution gives one way in which the selected variables can be bound to RDF terms so that the query pattern matches the data. The result set gives all the possible solutions. In the above example, the following two subsets of the data provided the two matches.

_:a foaf:name  "Johnny Lee Outlaw" ._:a foaf:box   <mailto:jlow@example.com> .

_:b foaf:name  "Peter Goodguy" ._:b foaf:box   <mailto:peter@example.org> .

This is abasic graph pattern match; all the variables used in the query pattern must be bound in every solution.

2.3Matching RDF Literals

The data below contains three RDF literals:

PREFIX dt:   <http://example.org/datatype#>PREFIX ns:   <http://example.org/ns#>PREFIX :     <http://example.org/ns#>PREFIX xsd:  <http://www.w3.org/2001/XMLSchema#>:x   ns:p     "cat"@en .:y   ns:p     "42"^^xsd:integer .:z   ns:p     "abc"^^dt:specialDatatype .

Note that, in Turtle,"cat"@en is an RDF literal with a lexical form "cat" and a language tag "en";"42"^^xsd:integer is a literal with the datatypehttp://www.w3.org/2001/XMLSchema#integer; and"abc"^^dt:specialDatatype is a literal with the datatypehttp://example.org/datatype#specialDatatype.

This RDF data is the data graph for the query examples in sections 2.3.1–2.3.3.

2.3.1Matching Literals with Language Tags

Language tags in SPARQL are expressed using@ and the language tag, as defined inTags for Identifying Languages [BCP47].

This following query has no solution because"cat" is not the same RDF literal as"cat"@en:

SELECT ?v WHERE { ?v ?p "cat" }

v

but the query below will find a solution where variablev is bound to:x because the language tag is specified and matches the given data:

SELECT ?v WHERE { ?v ?p "cat"@en }

v
<http://example.org/ns#x>

2.3.2Matching Literals with Numeric Types

Integers in a SPARQL query indicate an RDF literal with the datatypexsd:integer. For example:42 is a shortened form of "42"^^<http://www.w3.org/2001/XMLSchema#integer>.

The pattern in the following query has a solution with variablev bound to:y.

SELECT ?v WHERE { ?v ?p 42 }

v
<http://example.org/ns#y>

Section 4.1.2 defines SPARQL shortened forms forxsd:float andxsd:double.

2.3.3Matching Literals with Arbitrary Datatypes

The following query has a solution with variablev bound to:z. The query processor does not have to have any understanding of the values in the space of the datatype. Because the lexical form and datatype IRI both match, the literal matches.

SELECT ?v WHERE { ?v ?p "abc"^^<http://example.org/datatype#specialDatatype> }

v
<http://example.org/ns#z>

2.4Blank Node Identifiers in Query Results

Query results can contain blank nodes. Blank nodes in the example result sets in this document are written in the form "_:" followed by ablank node identifier.

Blank node identifiers are scoped to a result set (see "SPARQL Query Results XML Format (Second Edition)" and "SPARQL 1.1 Query Results JSON Format") or, for theCONSTRUCT query form, the result graph. Use of the same identifier within a result set indicates the same blank node.

Data:

PREFIX foaf:  <http://xmlns.com/foaf/0.1/>_:a  foaf:name   "Alice" ._:b  foaf:name   "Bob" .

Query:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT ?x ?nameWHERE  { ?x foaf:name ?name }

x	name
_:c	"Alice"
_:d	"Bob"

The results above could equally be given with different blank node identifiers because the blank node identifiers in the results only indicate whether RDF terms in the solutions are the same or different.

x	name
_:r	"Alice"
_:s	"Bob"

These two results have the same information: the blank nodes used to match the query are different in the two solutions. There need not be any relation between a blank node identifier_:a in the result set and a blank node identifier used in the syntax for the data.

An application writer should not expect blank node identifiers in a query to refer to a particular blank node in the data.

2.5Creating Values with Expressions

SPARQL 1.2 allows values to be created from complex expressions. The queries below show how theCONCAT function can be used to concatenate first names and last names from FOAF data, then assign the value using anexpression in theSELECT clause and also assign the value by using theBIND form.

Data:

PREFIX foaf:  <http://xmlns.com/foaf/0.1/>            _:a  foaf:givenName   "John" ._:a  foaf:surname  "Doe" .

Query:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT ( CONCAT(?G, " ", ?S) AS ?name )WHERE  { ?P foaf:givenName ?G ; foaf:surname ?S }

Query:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE  {     ?P foaf:givenName ?G ;        foaf:surname ?S     BIND(CONCAT(?G, " ", ?S) AS ?name)}

name
"John Doe"

2.6Building RDF Graphs

SPARQL has severalquery forms. TheSELECT query form returns variable bindings. TheCONSTRUCT query form returns an RDF graph. The graph is built based on a template which is used to generate RDF triples based on the results of matching the graph pattern of the query.

Data:

PREFIX org:    <http://example.com/ns#>_:a  org:employeeName   "Alice" ._:a  org:employeeId     12345 ._:b  org:employeeName   "Bob" ._:b  org:employeeId     67890 .

Query:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>PREFIX org:    <http://example.com/ns#>CONSTRUCT { ?x foaf:name ?name }WHERE  { ?x org:employeeName ?name }

Results:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>                _:x foaf:name "Alice" ._:y foaf:name "Bob" .

which can be serialized inRDF/XML as:

<rdf:RDF   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"   xmlns:foaf="http://xmlns.com/foaf/0.1/" >  <rdf:Description>    <foaf:name>Alice</foaf:name>  </rdf:Description>  <rdf:Description>    <foaf:name>Bob</foaf:name>  </rdf:Description></rdf:RDF>

3.RDF Term Constraints (Informative)

Graph pattern matching produces a solution sequence, where each solution has a set of bindings of variables to RDF terms. SPARQLFILTERs restrict solutions to those for which the filter expression evaluates toTRUE.

This section provides an informal introduction to SPARQLFILTERs; their semantics are defined in section 'Expressions and Testing Values' where there is acomprehensive function library. The examples in this section share one input graph:

Data:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>PREFIX ns:   <http://example.org/ns#>:book1  dc:title  "SPARQL Tutorial" .:book1  ns:price  42 .:book2  dc:title  "The Semantic Web" .:book2  ns:price  23 .

3.1Restricting the Value of Strings

SPARQLFILTER functions likeregex can test RDF literals.regex matches onlystring literals.regex can be used to match the lexical forms of other literals by using thestr function.

Query:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>SELECT  ?titleWHERE   {     ?x dc:title ?title    FILTER regex(?title, "^SPARQL") }

Query Result:

title
"SPARQL Tutorial"

Regular expression matches may be made case-insensitive with the "i" flag.

Query:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>SELECT  ?titleWHERE   {     ?x dc:title ?title    FILTER regex(?title, "web", "i" ) }

Query Result:

title
"The Semantic Web"

The regular expression language isdefined by XQuery and XPath Functions and Operators and is based onXML Schema Regular Expressions.

3.2Restricting Numeric Values

SPARQLFILTERs can restrict on arithmetic expressions.

Query:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title ?priceWHERE   {    ?x ns:price ?price .    FILTER (?price < 30.5)    ?x dc:title ?title . }

Query Result:

title	price
"The Semantic Web"	23

By constraining theprice variable, only:book2 matches the query because only:book2 has a price less than30.5, as the filter condition requires.

3.3Other Term Constraints

In addition tonumeric types, SPARQL supports typesxsd:string,xsd:boolean andxsd:dateTime (seeOperand Data Types). SectionOperator Mapping describes the operators and sectionFunction Definitions the functions that can be that can be applied to RDF terms.

4.SPARQL Syntax

This section covers the syntax used by SPARQL forRDF terms andtriple patterns. The full grammar is given insection 19.

4.1RDF Term Syntax

4.1.1Syntax for IRIs

Theiri production designates the set of IRIs [RFC3987]; IRIs are a generalization of URIs [RFC3986] and are fully compatible with URIs and URLs. ThePrefixedName production designates a prefixed name. The mapping from a prefixed name to an IRI is described below. IRI references (relative or absolute IRIs) are designated by theIRIREF production, where the '<' and '>' delimiters do not form part of the IRI reference. Relative IRIs match theirelative-ref reference in section 2.2 ABNF for IRI References and IRIs in [RFC3987] and are resolved to IRIs as described below.

4.1.1.1Prefixed Names

ThePREFIX keyword associates a prefix label with an IRI. A prefixed name is a prefix label and a local part, separated by a colon ":". A prefixed name is mapped to an IRI by concatenating the IRI associated with the prefix and the local part. The prefix label or the local part may be empty. Note thatSPARQL local names allow leading digits whileXML local names do not.SPARQL local names also allow the non-alphanumeric characters allowed in IRIs via backslash character escapes (e.g.ns:id\=123).SPARQL local names have more syntactic restrictions thanCURIEs.

4.1.1.2Relative IRIs

Relative IRIs are combined with base IRIs as perUniform Resource Identifier (URI): Generic Syntax [RFC3986] using only the basic algorithm in section 5.2. Neither Syntax-Based Normalization nor Scheme-Based Normalization (described in sections 6.2.2 and 6.2.3 of [RFC3986]) are performed. Characters additionally allowed in IRI references are treated in the same way that unreserved characters are treated in URI references, per section 6.5 ofInternationalized Resource Identifiers (IRIs) [RFC3987].

TheBASE keyword defines the Base IRI used to resolve relative IRIs per [RFC3986] section 5.1.1, "Base URI Embedded in Content". Section 5.1.2, "Base URI from the Encapsulating Entity" defines how the Base IRI may come from an encapsulating document, such as a SOAP envelope with an xml:base directive or a mime multipart document with a Content-Location header. The "Retrieval URI" identified in 5.1.3, Base "URI from the Retrieval URI", is the URL from which a particular SPARQL query was retrieved. If none of the above specifies the Base URI, the default Base URI (section 5.1.4, "Default Base URI") is used.

The following fragments are some of the different ways to write the same IRI:

<http://example.org/book/book1>

BASE <http://example.org/book/><book1>

PREFIX book: <http://example.org/book/>book:book1

4.1.2Syntax for Literals

The general syntax for literals is a string (enclosed in either double quotes,"...", or single quotes,'...'), with either an optional language tag (introduced by@) or an optional datatype IRI or prefixed name (introduced by^^).

As a convenience, integers can be written directly (without quotation marks and an explicit datatype IRI) and are interpreted as literals with datatypexsd:integer; decimal numbers for which there is '.' in the number but no exponent are interpreted asxsd:decimal; and numbers with exponents are interpreted asxsd:double. Values of typexsd:boolean can also be written astrue orfalse.

To facilitate writing literal values which themselves contain quotation marks or which are long and contain newline characters, SPARQL provides an additional quoting construct in which literals are enclosed in three single- or double-quotation marks.

Examples of literal syntax in SPARQL include:

"chat"
'chat'@fr with language tag "fr"
"xyz"^^<http://example.org/ns/userDatatype>
"abc"^^appNS:appDataType
'''The librarian said, "Perhaps you would enjoy 'War and Peace'."'''
1, which is the same as"1"^^xsd:integer
1.3, which is the same as"1.3"^^xsd:decimal
1.300, which is the same as"1.300"^^xsd:decimal
1.0e6, which is the same as"1.0e6"^^xsd:double
true, which is the same as"true"^^xsd:boolean
false, which is the same as"false"^^xsd:boolean

Tokens matching the productionsINTEGER,DECIMAL,DOUBLE orBooleanLiteral are equivalent to a typed literal with the lexical value of the token and the corresponding datatype (xsd:integer,xsd:decimal,xsd:double,xsd:boolean).

4.1.3Syntax for Query Variables

A query variable is marked by the use of either "?" or "$"; the "?" or "$" is not part of the variable name. In a query,$abc and?abc identify the same variable. Thepossible names for variables are given in theSPARQL grammar.

4.1.4Syntax for Blank Nodes

Blank nodes in graph patterns act as variables, not as references to specific blank nodes in the data being queried. Blank nodes are indicated by either the identifier form, such as "_:abc", or an abbreviation form using "[]" or "[...]".

Blank node identifiers are written as "_:abc" for a blank node with identifier "abc". The same blank node identifier cannot be used in two different basic graph patterns in the same query.

A blank node that is used in only one place in the query syntax can be indicated with[]. A unique blank node will be used to form the triple pattern.

The[:p :v] construct can be used to create triple patterns with a unique blank node as the subject of contained predicate-object pairs.

The following two forms

[ :p "v" ] .

[] :p "v" .

allocate a unique blank node (here, illustrated by "_:b57") and both are equivalent to writing:

_:b57 :p "v" .

The allocated blank node can be used as the subject or object of further triple patterns. For example, as a subject:

[ :p "v" ] :q "w" .

which is equivalent to the two triples:

_:b57 :p "v" ._:b57 :q "w" .

and as an object:

:x :q [ :p "v" ] .

which is equivalent to the two triples:

:x  :q _:b57 ._:b57 :p "v" .

Abbreviated blank node syntax can be combined with other abbreviations forcommon subjects andcommon predicates.

[ foaf:name  ?name ;  foaf:mbox  <mailto:alice@example.org> ]

This is the same as writing the following basic graph pattern using a blank node identifer instead.

_:b18  foaf:name  ?name ._:b18  foaf:mbox  <mailto:alice@example.org> .

4.2Syntax for Triple Patterns

Triple Patterns are written as subject, predicate and object; there are abbreviated ways of writing some common triple pattern constructs.

The following examples express the same query:

PREFIX  dc: <http://purl.org/dc/elements/1.1/>SELECT  ?titleWHERE   { <http://example.org/book/book1> dc:title ?title }

PREFIX  dc: <http://purl.org/dc/elements/1.1/>PREFIX  : <http://example.org/book/>SELECT  $titleWHERE   { :book1  dc:title  $title }

BASE    <http://example.org/book/>PREFIX  dc: <http://purl.org/dc/elements/1.1/>SELECT  $titleWHERE   { <book1>  dc:title  ?title }

4.2.1Predicate-Object Lists

Triple patterns with a common subject can be written so that the subject is only written once and is used for more than one triple pattern by employing the ";" notation.

?x  foaf:name  ?name ;foaf:mbox  ?mbox .

This is the same as writing the triple patterns:

?x  foaf:name  ?name .?x  foaf:mbox  ?mbox .

4.2.2Object Lists

If triple patterns share both subject and predicate, the objects may be separated by ",".

?x foaf:nick  "Alice" , "Alice_" .

is the same as writing the triple patterns:

?x  foaf:nick  "Alice" .?x  foaf:nick  "Alice_" .

Object lists can be combined with predicate-object lists:

?x  foaf:name ?name ; foaf:nick  "Alice" , "Alice_" .

is equivalent to:

?x  foaf:name  ?name .?x  foaf:nick  "Alice" .?x  foaf:nick  "Alice_" .

4.2.3RDF Collections

RDF collections can be written in triple patterns using the syntax "(element1 element2 ...)". The form "()" is an alternative for the IRIhttp://www.w3.org/1999/02/22-rdf-syntax-ns#nil. When used with collection elements, such as(1 ?x 3 4), triple patterns with blank nodes are allocated for the collection. The blank node at the head of the collection can be used as a subject or object in other triple patterns. The blank nodes allocated by the collection syntax do not occur elsewhere in the query.

(1 ?x 3 4) :p "w" .

is syntactic sugar for (noting thatb0,b1,b2 andb3 do not occur anywhere else in the query):

_:b0  rdf:first  1 ;      rdf:rest   _:b1 ._:b1  rdf:first  ?x ;      rdf:rest   _:b2 ._:b2  rdf:first  3 ;      rdf:rest   _:b3 ._:b3  rdf:first  4 ;      rdf:rest   rdf:nil ._:b0  :p         "w" .

RDF collections can be nested and can involve other syntactic forms:

(1 [:p :q] ( 2 ) ) .

is syntactic sugar for:

_:b0  rdf:first  1 ;      rdf:rest   _:b1 ._:b1  rdf:first  _:b2 ._:b2  :p         :q ._:b1  rdf:rest   _:b3 ._:b3  rdf:first  _:b4 ._:b4  rdf:first  2 ;      rdf:rest   rdf:nil ._:b3  rdf:rest   rdf:nil .

4.2.4rdf:type

The keyword "a" can be used as a predicate in a triple pattern and is an alternative for the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#type. This keyword is case-sensitive.

?x  a  :Class1 .[ a :appClass ] :p "v" .

is syntactic sugar for:

?x    rdf:type  :Class1 ._:b0  rdf:type  :appClass ._:b0  :p        "v" .

4.3Version Announcement

To cope with the language evolution of SPARQL, theVERSION directive can be used. When writing SPARQL queries with new features such astriple terms orfunctions on triple terms, authorsMAY announce the use of the new syntax forms by including this directive.

VERSION "1.2"PREFIX : <http://example/>PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?s ?date {    ?s ?p ?o .    BIND( <<( ?s ?p ?o )>> AS ?tt )    :myreifier rdf:reifies ?tt .    :myreifier :tripleAdded ?date .}

5.Graph Patterns

SPARQL is based around graph pattern matching. More complex graph patterns can be formed by combining smaller patterns in various ways:

Basic Graph Patterns, where a set of triple patterns must match
Group Graph Pattern, where a set of graph patterns must all match
Optional Graph patterns, where additional patterns may extend the solution
Alternative Graph Pattern, where two or more possible patterns are tried
Patterns on Named Graphs, where patterns are matched against named graphs

In this section we describe the two forms that combine patterns by conjunction: basic graph patterns, which combine triples patterns, and group graph patterns, which combine all other graph patterns.

The outer-most graph pattern in a query is called the query pattern. It is grammatically identified byGroupGraphPattern in

[17] WhereClause ::= 'WHERE'?GroupGraphPattern

5.1Basic Graph Patterns

Basic graph patterns are sets of triple patterns. SPARQL graph pattern matching is defined in terms of combining the results from matching basic graph patterns.

A sequence of triple patterns, with optional filters, comprises a single basic graph pattern. Any other graph pattern terminates a basic graph pattern.

5.1.1Blank Node Identifiers

When using blank nodes of the form_:abc, identifiers for blank nodes are scoped to the basic graph pattern. Ablank node identifier can only be used in one basic graph pattern in any query.

5.1.2Extending Basic Graph Pattern Matching

SPARQL evaluates basic graph patterns using subgraph matching, which is defined for simple entailment. SPARQL can be extended to other forms of entailment givencertain conditions as described below. The documentSPARQL 1.1 Entailment Regimes describes several specific entailment regimes.

5.2Group Graph Patterns

In a SPARQL query string, a group graph pattern is delimited with braces:{}. For example, this query's query pattern is a group graph pattern of one basic graph pattern.

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE  {    ?x foaf:name ?name .    ?x foaf:mbox ?mbox .}

The same solutions would be obtained from a query that grouped the triple patterns into two basic graph patterns. For example, the query below has a different structure but would yield the same solutions as the previous query:

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE  {    { ?x foaf:name ?name . }   { ?x foaf:mbox ?mbox . }}

5.2.1Empty Group Pattern

The group pattern:

{ }

matches any graph (including the empty graph) with one solution that does not bind any variables. For example:

SELECT ?xWHERE {}

matches with one solution in which variablex is not bound.

5.2.2Scope of Filters

A constraint, expressed by the keywordFILTER, is a restriction on solutions over the whole group in which the filter appears. The following patterns all have the same solutions:

{  ?x foaf:name ?name .   ?x foaf:mbox ?mbox .   FILTER regex(?name, "Smith")}

{  FILTER regex(?name, "Smith")   ?x foaf:name ?name .   ?x foaf:mbox ?mbox .}

{  ?x foaf:name ?name .   FILTER regex(?name, "Smith")   ?x foaf:mbox ?mbox .}

5.2.3Group Graph Pattern Examples

{ ?x foaf:name ?name .  ?x foaf:mbox ?mbox .}

is a group of one basic graph pattern and that basic graph pattern consists of two triple patterns.

{  ?x foaf:name ?name . FILTER regex(?name, "Smith")  ?x foaf:mbox ?mbox .}

is a group of one basic graph pattern and a filter, and that basic graph pattern consists of two triple patterns; the filter does not break the basic graph pattern into two basic graph patterns.

{  ?x foaf:name ?name .  {}  ?x foaf:mbox ?mbox .}

is a group of three elements, a basic graph pattern of one triple pattern, an empty group, and another basic graph pattern of one triple pattern.

6.Including Optional Values

Basic graph patterns allow applications to make queries where the entire query pattern must match for there to be a solution. For every solution of a query containing only group graph patterns with at least one basic graph pattern, every variable is bound to an RDF Term in a solution. However, regular, complete structures cannot be assumed in all RDF graphs. It is useful to be able to have queries that allow information to be added to the solution where the information is available, but do not reject the solution because some part of the query pattern does not match. Optional matching provides this facility: if the optional part does not match, it creates no bindings but does not eliminate the solution.

6.1Optional Pattern Matching

Optional parts of the graph pattern may be specified syntactically with theOPTIONAL keyword applied to a graph pattern:

pattern OPTIONAL {pattern }

The syntactic form:

{ OPTIONAL {pattern } }

is equivalent to:

{ { } OPTIONAL {pattern } }

TheOPTIONAL keyword is left-associative :

pattern OPTIONAL {pattern } OPTIONAL { pattern }

is the same as:

{pattern OPTIONAL {pattern } } OPTIONAL { pattern }

In an optional match, either the optional graph pattern matches a graph, thereby defining and adding bindings to one or more solutions, or it leaves a solution unchanged without adding any additional bindings.

Data:

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>PREFIX rdf:        <http://www.w3.org/1999/02/22-rdf-syntax-ns#>_:a  rdf:type        foaf:Person ._:a  foaf:name       "Alice" ._:a  foaf:mbox       <mailto:alice@example.com> ._:a  foaf:mbox       <mailto:alice@work.example> ._:b  rdf:type        foaf:Person ._:b  foaf:name       "Bob" .

Query:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE  {    ?x foaf:name  ?name .    OPTIONAL { ?x  foaf:mbox  ?mbox }}

With the data above, the query result is:

name	mbox
"Alice"	<mailto:alice@example.com>
"Alice"	<mailto:alice@work.example>
"Bob"

There is no value ofmbox in the solution where the name is"Bob".

This query finds the names of people in the data. If there is a triple with predicatembox and the same subject, a solution will contain the object of that triple as well. In this example, only a single triple pattern is given in the optional match part of the query but, in general, the optional part may be any graph pattern. The entire optional graph pattern must match for the optional graph pattern to affect the query solution.

6.2Constraints in Optional Pattern Matching

Constraints can be given in an optional graph pattern. For example:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>PREFIX ns:   <http://example.org/ns#>:book1  dc:title  "SPARQL Tutorial" .:book1  ns:price  42 .:book2  dc:title  "The Semantic Web" .:book2  ns:price  23 .

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title ?priceWHERE   {     ?x dc:title ?title .    OPTIONAL { ?x ns:price ?price . FILTER (?price < 30) }}

title	price
"SPARQL Tutorial"
"The Semantic Web"	23

No price appears for the book with title "SPARQL Tutorial" because the optional graph pattern did not lead to a solution involving the variable "price".

6.3Multiple Optional Graph Patterns

Graph patterns are defined recursively. A graph pattern may have zero or more optional graph patterns, and any part of a query pattern may have an optional part. In this example, there are two optional graph patterns.

Data:

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice" ._:a  foaf:homepage   <http://work.example.org/alice/> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       <mailto:bob@work.example> .

Query:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mbox ?hpageWHERE  {    ?x foaf:name  ?name .    OPTIONAL { ?x foaf:mbox ?mbox } .    OPTIONAL { ?x foaf:homepage ?hpage }}

Query result:

name	mbox	hpage
"Alice"		<http://work.example.org/alice/>
"Bob"	<mailto:bob@work.example>

7.Matching Alternatives

SPARQL provides a means of combining graph patterns so that one of several alternative graph patterns may match. If more than one of the alternatives matches, all the possible pattern solutions are found.

Pattern alternatives are syntactically specified with theUNION keyword.

Data:

PREFIX dc10:  <http://purl.org/dc/elements/1.0/>PREFIX dc11:  <http://purl.org/dc/elements/1.1/>_:a  dc10:title     "SPARQL Query Language Tutorial" ._:a  dc10:creator   "Alice" ._:b  dc11:title     "SPARQL Protocol Tutorial" ._:b  dc11:creator   "Bob" ._:c  dc10:title     "SPARQL" ._:c  dc11:title     "SPARQL (updated)" .

Query:

PREFIX dc10:  <http://purl.org/dc/elements/1.0/>PREFIX dc11:  <http://purl.org/dc/elements/1.1/>SELECT ?titleWHERE  { { ?book dc10:title  ?title } UNION { ?book dc11:title  ?title } }

Query result:

title
"SPARQL Protocol Tutorial"
"SPARQL"
"SPARQL (updated)"
"SPARQL Query Language Tutorial"

This query finds titles of the books in the data, whether the title is recorded usingDublin Core properties from version 1.0 or version 1.1. To determine exactly how the information was recorded, a query could use different variables for the two alternatives:

PREFIX dc10:  <http://purl.org/dc/elements/1.0/>PREFIX dc11:  <http://purl.org/dc/elements/1.1/>SELECT ?x ?yWHERE  { { ?book dc10:title ?x } UNION { ?book dc11:title  ?y } }

x	y
	"SPARQL (updated)"
	"SPARQL Protocol Tutorial"
"SPARQL"
"SPARQL Query Language Tutorial"

This will return results with the variablex bound for solutions from the left branch of theUNION, andy bound for the solutions from the right branch. If neither part of theUNION pattern matched, then the graph pattern would not match.

TheUNION pattern combines graph patterns; each alternative possibility can contain more than one triple pattern:

PREFIX dc10:  <http://purl.org/dc/elements/1.0/>PREFIX dc11:  <http://purl.org/dc/elements/1.1/>SELECT ?title ?authorWHERE {    { ?book dc10:title ?title .  ?book dc10:creator ?author }      UNION    { ?book dc11:title ?title .  ?book dc11:creator ?author }}

title	author
"SPARQL Query Language Tutorial"	"Alice"
"SPARQL Protocol Tutorial"	"Bob"

This query will only match a book if it has both a title and creator predicate from the same version of Dublin Core.

8.Negation

The SPARQL query language incorporates two styles of negation, one based on filtering results depending on whether a graph pattern does or does not match in the context of the query solution being filtered, and one based on removing solutions related to another pattern.

8.1Filtering Using Graph Patterns

Filtering of query solutions is done within aFILTER expression usingNOT EXISTS andEXISTS. Note that the filter scope rulesapply to the whole group in which the filter appears.

8.1.1Testing For the Absence of a Pattern

TheNOT EXISTS filter expression tests whether a graph pattern does not match the dataset, given the values of variables in the group graph pattern in which the filter occurs. It does not generate any additional bindings.

Data:

PREFIX  :       <http://example/>PREFIX  rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#>PREFIX  foaf:   <http://xmlns.com/foaf/0.1/>:alice  rdf:type   foaf:Person .:alice  foaf:name  "Alice" .:bob    rdf:type   foaf:Person .

Query:

PREFIX  rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX  foaf:   <http://xmlns.com/foaf/0.1/> SELECT ?personWHERE {    ?person rdf:type  foaf:Person .    FILTER NOT EXISTS { ?person foaf:name ?name }}

Query Result:

person
<http://example/bob>

8.1.2Testing For the Presence of a Pattern

The filter expressionEXISTS is also provided. It tests whether the pattern can be found in the data; it does not generate any additional bindings.

Query:

PREFIX  rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX  foaf:   <http://xmlns.com/foaf/0.1/> SELECT ?personWHERE {    ?person rdf:type  foaf:Person .    FILTER EXISTS { ?person foaf:name ?name }}

Query Result:

person
<http://example/alice>

8.2Removing Possible Solutions

The other style of negation provided in SPARQL isMINUS which evaluates both its arguments, then calculates solutions in the left-hand side that are not compatible with the solutions on the right-hand side.

Data:

PREFIX :       <http://example/>PREFIX foaf:   <http://xmlns.com/foaf/0.1/>:alice  foaf:givenName "Alice" ;        foaf:familyName "Smith" .:bob    foaf:givenName "Bob" ;        foaf:familyName "Jones" .:carol  foaf:givenName "Carol" ;        foaf:familyName "Smith" .

Query:

PREFIX :       <http://example/>PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT DISTINCT ?sWHERE {    ?s ?p ?o .    MINUS {        ?s foaf:givenName "Bob" .    }}

Results:

s
<http://example/carol>
<http://example/alice>

8.3Relationship and differences between NOT EXISTS and MINUS

NOT EXISTS andMINUS represent two ways of thinking about negation, one based on testing whether a pattern exists in the data, given the bindings already determined by the query pattern, and one based on removing matches based on the evaluation of two patterns. In some cases they can produce different answers.

8.3.1Example: Sharing of variables

PREFIX : <http://example/>:a :b :c .

SELECT * {     ?s ?p ?o    FILTER NOT EXISTS { ?x ?y ?z }}

evaluates to a result set with no solutions because{ ?x ?y ?z } matches given any?s ?p ?o, soNOT EXISTS { ?x ?y ?z } eliminates any solutions.

s	p	o

whereas withMINUS, there is no shared variable between the first part (?s ?p ?o) and the second (?x ?y ?z) so no bindings are eliminated.

SELECT * {     ?s ?p ?o     MINUS     { ?x ?y ?z }}

Results:

s	p	o
<http://example/a>	<http://example/b>	<http://example/c>

8.3.2Example: Fixed pattern

Another case is where there is a concrete pattern (no variables) in the example:

PREFIX : <http://example/>SELECT * {      ?s ?p ?o     FILTER NOT EXISTS { :a :b :c }}

evaluates to a result set with no query solutions:

Results:

s	p	o

whereas

PREFIX : <http://example/>SELECT * {     ?s ?p ?o     MINUS { :a :b :c }}

evaluates to result set with one query solution:

Results:

s	p	o
<http://example/a>	<http://example/b>	<http://example/c>

because there is no match of bindings and so no solutions are eliminated.

8.3.3Example: Inner FILTERs

Differences also arise because in a filter, variables from the group arein scope. In this example, theFILTER inside theNOT EXISTS has access to the value of?n for the solution being considered.

PREFIX : <http://example.com/>:a :p 1 .:a :q 1 .:a :q 2 .:b :p 3.0 .:b :q 4.0 .:b :q 5.0 .

When usingFILTER NOT EXISTS, the test is on each possible solution to?x :p ?n:

PREFIX : <http://example.com/>SELECT * WHERE {    ?x :p ?n    FILTER NOT EXISTS {        ?x :q ?m .        FILTER(?n = ?m)    }}

x	n
<http://example.com/b>	3.0

whereas withMINUS, theFILTER inside the pattern does not have a value for ?n and it is always unbound:

PREFIX : <http://example/>SELECT * WHERE {    ?x :p ?n    MINUS {        ?x :q ?m .        FILTER(?n = ?m)    }}

x	n
<http://example.com/b>	3.0
<http://example.com/a>	1

9.Property Paths

A property path is a possible route through a graph between two graph nodes. A trivial case is a property path of length exactly 1, which is a triple pattern. The ends of the path may be RDF terms or variables. Variables can not be used as part of the path itself, only the ends.

Property paths allow for more concise expressions for some SPARQL basic graph patterns and they also add the ability to match connectivity of two resources by an arbitrary length path.

9.1Property Path Syntax

In the description below,iri is eitheran IRI written in full or abbreviated by a prefixed name, or the keyworda.elt is a path element, which may itself be composed of path constructs.

Syntax Form	Property Path Expression Name	Matches
`iri`	PredicatePath	An IRI. A path of length one.
`^elt`	InversePath	Inverse path (object to subject).
`elt1 /elt2`	SequencePath	A sequence path of`elt1` followed by`elt2`.
`elt1 \|elt2`	AlternativePath	A alternative path of`elt1` or`elt2` (all possibilities are tried).
`elt*`	ZeroOrMorePath	A path that connects the subject and object of the path by zero or more matches of`elt`.
`elt+`	OneOrMorePath	A path that connects the subject and object of the path by one or more matches of`elt`.
`elt?`	ZeroOrOnePath	A path that connects the subject and object of the path by zero or one matches of`elt`.
`!iri` or`!(iri₁\| ...\|iri_n)`	NegatedPropertySet	Negated property set. An IRI which is not one of`iri_i`.`!iri` is short for`!(iri)`.
`!^iri` or`!(^iri₁\| ...\|^iri_n)`	NegatedPropertySet	Negated property set where the excluded matches are based on reversed path. That is, not one ofiri₁...iri_n as reverse paths.`!^iri` is short for`!(^iri)`.
`!(iri₁\| ...\|iri_j\|^iri_j+1\| ...\|^iri_n)`	NegatedPropertySet	A combination of forward and reverse properties in a negated property set.
`(elt)`		A group path`elt`, brackets control precedence.

The order of IRIs, and reverse IRIs, in a negated property set is not significant and they can occur in a mixed order.

The precedence of the syntax forms is, from highest to lowest:

IRI, prefixed names
Negated property sets
Groups
Unary operators*,? and+
Unary ^ inverse links
Binary operator/
Binary operator|

Precedence is left-to-right within groups.

9.2Examples

Alternatives: Match one or both possibilities

{ :book1 dc:title|rdfs:label ?displayString }

which could have written:

{    :book1 <http://purl.org/dc/elements/1.1/title> | <http://www.w3.org/2000/01/rdf-schema#label> ?displayString}

Sequence: Find the name of any people that Alice knows.

{    ?x foaf:mbox <mailto:alice@example> .    ?x foaf:knows/foaf:name ?name .}

Sequence: Find the names of people 2 "foaf:knows" links away.

{     ?x foaf:mbox <mailto:alice@example> .    ?x foaf:knows/foaf:knows/foaf:name ?name .}

This is the same as the SPARQL query:

SELECT ?x ?name {    ?x  foaf:mbox <mailto:alice@example> .    ?x  foaf:knows [ foaf:knows [ foaf:name ?name ]]. }

or, with explicit variables:

SELECT ?x ?name {    ?x  foaf:mbox <mailto:alice@example> .    ?x  foaf:knows ?a1 .    ?a1 foaf:knows ?a2 .    ?a2 foaf:name ?name .}

Filtering duplicates: Because someone Alice knows may well know Alice, the example above may include Alice herself. This could be avoided with:

{ ?x foaf:mbox <mailto:alice@example> .  ?x foaf:knows/foaf:knows ?y .  FILTER ( ?x != ?y )  ?y foaf:name ?name }

Inverse Property Paths: These two are the same query: the second is just reversing the property direction which swaps the roles of subject and object.

{ ?x foaf:mbox <mailto:alice@example> }

{ <mailto:alice@example> ^foaf:mbox ?x }

Inverse Path Sequence: Find all the people who know someone?x knows.

{  ?x foaf:knows/^foaf:knows ?y .    FILTER(?x != ?y)}

which is equivalent to (?gen1 is a system generated variable):

{  ?x foaf:knows ?gen1 .  ?y foaf:knows ?gen1 .    FILTER(?x != ?y)}

Arbitrary length match: Find the names of all the people that can be reached from Alice byfoaf:knows:

{  ?x foaf:mbox <mailto:alice@example> .  ?x foaf:knows+/foaf:name ?name .}

Alternatives in an arbitrary length path:

{ ?ancestor (ex:motherOf|ex:fatherOf)+ <#me> }

Arbitrary length path match: Some forms of limited inference are possible as well. For example, for RDFS, all types and supertypes of a resource:

{ <http://example/thing> rdf:type/rdfs:subClassOf* ?type }

All resources and all their inferred types:

{ ?x rdf:type/rdfs:subClassOf* ?type }

Subproperty:

{ ?x ?p ?v . ?p rdfs:subPropertyOf* :property }

Negated Property Paths: Find nodes connected but not by rdf:type (either way round):

{ ?x !(rdf:type|^rdf:type) ?y }

Elements in an RDF collection:

{ :list rdf:rest*/rdf:first ?element }

Note: This path expression does not guarantee the order of the results.

9.3Property Paths and Equivalent Patterns

SPARQL property paths treat the RDF triples as a directed, possibly cyclic, graph with named edges. Evaluation of a property path expression can lead to duplicates because any variables introduced in the equivalent pattern are not part of the results and are not already used elsewhere. They are hidden by implicit projection of the results to just the variables given in the query.

For example, on the data:

PREFIX :       <http://example/>:order  :item :z1 .:order  :item :z2 .:z1 :name "Small" .:z1 :price 5 .:z2 :name "Large" .:z2 :price 5 .

Query:

PREFIX :   <http://example/>SELECT * {  ?s :item/:price ?x . }

Results:

s	x
<http://example/order>	5
<http://example/order>	5

whereas if the query were written out to include the intermediate variable (?_a), no rows in the results are duplicates:

PREFIX :   <http://example/>SELECT * {  ?s :item ?_a .   ?_a :price ?x .}

Results:

s	_a	x
<http://example/order>	<http://example/z1>	5
<http://example/order>	<http://example/z2>	5

The equivalence to graphs patterns is particularly significant when query also involves an aggregation operation. The total cost of the order can be found with

PREFIX :   <http://example/>SELECT (sum(?x) AS ?total) {     :order :item/:price ?x}

total
10

9.4Arbitrary Length Path Matching

Connectivity between the subject and object by a property path of arbitrary length can be found using the "zero or more" property path operator,*, and the "one or more" property path operator,+. There is also a "zero or one" connectivity property path operator,?.

Each of these operators uses the property path expression to try to find a connection between subject and object, using the path step a number of times, as restricted by the operator.

For example, finding all the the possible types of a resource, including supertypes of resources, can be achieved with:

PREFIX  rdfs:   <http://www.w3.org/2000/01/rdf-schema#> . PREFIX  rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?x ?type{     ?x rdf:type/rdfs:subClassOf* ?type}

Similarly, finding all the people:x connects to via thefoaf:knows relationship,

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX :     <http://example/>SELECT ?person{     :x foaf:knows+ ?person}

Such connectivity matching does not introduce duplicates (it does not incorporate any count of the number of ways the connection can be made) even if the repeated path itself would otherwise result in duplicates.

The graph matched may include cycles. Connectivity matching is defined so that matching cycles does not lead to undefined or infinite results.

10.Assignment

The value of an expression can be added to a solution mapping by binding a new variable to the value of the expression, which is an RDF term. The variable can then be used in the query and also can be returned in results.

Three syntax forms allow this: theBIND keyword,expressions in theSELECT clause andexpressions in theGROUP BY clause. The assignment form is(expression AS ?var).

If the evaluation of the expression produces an error, the variable remains unbound for that solution but the query evaluation continues.

Data can also be directly included in a query usingVALUES for inline data.

10.1BIND: Assigning to Variables

TheBIND form allows a value to be assigned to a variable from a basic graph pattern or property path expression. Use ofBIND ends the preceding basic graph pattern. The variable introduced by theBIND clause must not have been used in the group graph pattern up to the point of use inBIND.

Example:

Data:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>PREFIX ns:   <http://example.org/ns#>:book1  dc:title     "SPARQL Tutorial" .:book1  ns:price     42 .:book1  ns:discount  0.2 .:book2  dc:title     "The Semantic Web" .:book2  ns:price     23 .:book2  ns:discount  0.25 .

Query:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title ?price{       ?x ns:price ?p .    ?x ns:discount ?discount    BIND (?p*(1-?discount) AS ?price)    FILTER(?price < 20)    ?x dc:title ?title . }

Equivalent query (BIND ends the basic graph pattern; theFILTER applies to the whole group graph pattern):

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title ?price{  { ?x ns:price ?p .     ?x ns:discount ?discount     BIND (?p*(1-?discount) AS ?price)    }    {?x dc:title ?title . }    FILTER(?price < 20)}

Results:

title	price
"The Semantic Web"	17.25

10.2VALUES: Providing inline data

Data can be directly written in a graph pattern or added to a query usingVALUES.VALUES provides inline data as asolution sequence which are combined with the results of query evaluation by ajoin operation. It can be used by an application to provide specific requirements on query results and also by SPARQL query engine implementations that providefederated query through theSERVICE keyword to send a more constrained query to a remote query service.

10.2.1VALUES syntax

VALUES allows multiple variables to be specified in the data block; there is a special syntax for the common case of specifying just one variable and some values.

In the following example, there is a table of two variables,?x and?y. The second row has no value for?y.

VALUES (?x ?y) {    (:uri1 1)    (:uri2 UNDEF)}

Optionally, when there is a single variable and some values:

VALUES ?z { "abc" "def" }

which is the same as using the general form:

VALUES (?z) { ("abc") ("def") }

10.2.2VALUES Examples

AVALUES block of data can appear in a query pattern or at the end of aSELECT query, including asubquery.

Data:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>PREFIX ns:   <http://example.org/ns#>:book1  dc:title  "SPARQL Tutorial" .:book1  ns:price  42 .:book2  dc:title  "The Semantic Web" .:book2  ns:price  23 .

Query:

PREFIX dc:   <http://purl.org/dc/elements/1.1/> PREFIX :     <http://example.org/book/> PREFIX ns:   <http://example.org/ns#> SELECT ?book ?title ?price{    VALUES ?book { :book1 :book3 }    ?book dc:title ?title ;          ns:price ?price .}

Result:

book	title	price
<http://example.org/book/book1>	"SPARQL Tutorial"	42

If a variable has no value for a particular solution in theVALUES clause, the keywordUNDEF is used instead of an RDF term.

PREFIX dc:   <http://purl.org/dc/elements/1.1/> PREFIX :     <http://example.org/book/> PREFIX ns:   <http://example.org/ns#> SELECT ?book ?title ?price{    ?book dc:title ?title ;          ns:price ?price .    VALUES (?book ?title) {        (UNDEF "SPARQL Tutorial")        (:book2 UNDEF)    }}

book	title	price
<http://example.org/book/book1>	"SPARQL Tutorial"	42
<http://example.org/book/book2>	"The Semantic Web"	23

In this example, theVALUES might have been specified to execute over the results of theSELECT query:

PREFIX dc:   <http://purl.org/dc/elements/1.1/> PREFIX :     <http://example.org/book/> PREFIX ns:   <http://example.org/ns#> SELECT ?book ?title ?price {    ?book dc:title ?title ;          ns:price ?price .}VALUES (?book ?title) {    (UNDEF "SPARQL Tutorial")    (:book2 UNDEF)}

This is a different query but, in the example situation, has the same results.

11.Aggregates

Aggregates apply expressions over groups of solutions. By default a solution set consists of a single group, containing all solutions.

Grouping may be specified using theGROUP BY syntax.

Aggregates defined in version 1.1 of SPARQL areCOUNT,SUM,MIN,MAX,AVG,GROUP_CONCAT, andSAMPLE.

Aggregates are used where the querier wishes to see a result which is computed over a group of solutions, rather than a single solution. For example the maximum value that a particular variable takes, rather than each value individually.

11.1Aggregate Example

Data:

PREFIX : <http://books.example/>:org1 :affiliates :auth1, :auth2 .:auth1 :writesBook :book1, :book2 .:book1 :price 9 .:book2 :price 5 .:auth2 :writesBook :book3 .:book3 :price 7 .:org2 :affiliates :auth3 .:auth3 :writesBook :book4 .:book4 :price 7 .

Query:

PREFIX : <http://books.example/>SELECT (SUM(?lprice) AS ?totalPrice)WHERE {    ?org :affiliates ?auth .    ?auth :writesBook ?book .    ?book :price ?lprice .}GROUP BY ?orgHAVING (SUM(?lprice) > 10)

Results:

totalPrice
21

This example demonstrates two features of aggregates:GROUP BY, which groups query solutions according to one or more expressions (in this case?org), andHAVING, which is analogous to aFILTER expression, but operates over groups, rather than individual solutions.

The example is produced by grouping solutions according to theGROUP BY expression (i.e. all solutions where?org takes a particular value appear within the same group), and evaluating the Set FunctionSUM over that group. The groups are then filtered by theHAVING expression, which removes all groups whereSUM(?lprice) is not greater than 10.

In aggregate queries and sub-queries, variables that appear in the query pattern, but are not in theGROUP BY clause, can only be projected or used in select expressions if they are aggregated. TheSAMPLE aggregate may be used for this purpose. For details see the section onProjection Restrictions.

It should be noted thatas per functions, aggregate expressions are required to be aliased (again, similar to theBIND clause, using the keywordAS) in order to project them from queries or subqueries. In the example above this is done using the variable?totalPrice. It is an error for aggregates to project variables with a name already used in other aggregate projections, or in theWHERE clause.

11.2GROUP BY

In order to calculate aggregate values for a solution, the solution is first divided into one or more groups, and the aggregate value is calculated for each group.

If aggregates are used in the query level inSELECT,HAVING orORDER BY but theGROUP BY term is not used, then this is taken to be a single implicit group, to which all solutions belong.

WithinGROUP BY clauses the binding keyword,AS, may be used, such asGROUP BY (?x + ?y AS ?z). This is equivalent to{ ... BIND (?x + ?y AS ?z) } GROUP BY ?z.

For example, given a solution sequenceS,( {?x→2, ?y→3}, {?x→2, ?y→5}, {?x→6, ?y→7} ), we might wish to group the solutions according to the value of?x, and calculate the average of the values of?y for each group.

This could be written as:

SELECT (AVG(?y) AS ?avg)WHERE {    ?a :x ?x ;    :y ?y .}GROUP BY ?x

11.3HAVING

HAVING operates over grouped solution sets, in the same way thatFILTER operates over un-grouped ones.

HAVING expressions have the same evaluation rules as projections from grouped queries, as described in the following section.

An example of the use ofHAVING is given below.

PREFIX : <http://data.example/>SELECT (AVG(?size) AS ?asize)WHERE {    ?x :size ?size}GROUP BY ?xHAVING(AVG(?size) > 10)

This will return average sizes, grouped by the subject, but only where the mean size is greater than 10.

11.4Aggregate Projection Restrictions

In a query level which uses aggregates, only expressions consisting of aggregates and constants may be projected, with one exception. WhenGROUP BY is given with one or more simple expressions consisting of just a variable, those variables may be projected from the level.

For example, the following query is legal as ?x is given as aGROUP BY term.

PREFIX : <http://example.com/data/#>SELECT ?x (MIN(?y) * 2 AS ?min)WHERE {    ?x :p ?y .    ?x :q ?z .} GROUP BY ?x (STR(?z))

Note that it would not be legal to projectSTR(?z) as this is not a simple variable expression. However, withGROUP BY (STR(?z) AS ?strZ) it would be possible to project?strZ.

Other expressions, not usingGROUP BY variables, or aggregates may have non-deterministic values projected from their groups using theSAMPLE aggregate.

11.5Aggregate Example (with errors)

This section shows an example query using aggregation, which demonstrates how errors are handled in results, in the presence of aggregates.

Data:

PREFIX : <http://example.com/data/#>:x :p 1, 2, 3, 4 .:y :p 1, _:b2, 3, 4 .:z :p 1.0, 2.0, 3.0, 4 .

Query:

PREFIX : <http://example.com/data/#>SELECT ?g (AVG(?p) AS ?avg) ((MIN(?p) + MAX(?p)) / 2 AS ?c)WHERE {    ?g :p ?p .}GROUP BY ?g

Result:

g	avg	c
<http://example.com/data/#x>	2.5	2.5
<http://example.com/data/#y>
<http://example.com/data/#z>	2.5	2.5

Note that the bindings for the :y group is not included in the results as the evaluation of Avg({1, _:b2, 3, 4}), and (_:b2 + 4) / 2 is an error, removing the bindings from the solution.

12.Subqueries

Subqueries are a way to embed SPARQL queries within other queries, normally to achieve results which cannot otherwise be achieved, such as limiting the number of results from some sub-expression within the query.

Due to the bottom-up nature of SPARQL query evaluation, the subqueries are evaluated logically first, and the results are projected up to the outer query.

Note that only variables projected out of the subquery will be visible, orin scope, to the outer query.

12.1Example

Data:

PREFIX : <http://people.example/>:alice :name "Alice", "Alice Foo", "A. Foo" .:alice :knows :bob, :carol .:bob :name "Bob", "Bob Bar", "B. Bar" .:carol :name "Carol", "Carol Baz", "C. Baz" .

Return a name (the one with the lowest sort order) for all the people that know Alice and have a name.

Query:

PREFIX : <http://people.example/>PREFIX : <http://people.example/>SELECT ?y ?minNameWHERE {    :alice :knows ?y .    {      SELECT ?y (MIN(?name) AS ?minName)      WHERE {          ?y :name ?name .      } GROUP BY ?y    }}

Results:

y	minName
:bob	"B. Bar"
:carol	"C. Baz"

This result is achieved by first evaluating the inner query:

SELECT ?y (MIN(?name) AS ?minName)WHERE {    ?y :name ?name .} GROUP BY ?y

This produces the following solution sequence:

y	minName
:alice	"A. Foo"
:bob	"B. Bar"
:carol	"C. Baz"

Which is joined with the results of the outer query:

y
:bob
:carol

13.RDF Dataset

The RDF data model expresses information as graphs consisting of triples with subject, predicate and object. Many RDF data stores hold multiple RDF graphs and record information about each graph, allowing an application to make queries that involve information from more than one graph.

A SPARQL query is executed against anRDF Dataset [RDF12-CONCEPTS] which represents a collection of graphs. An RDF Dataset comprises one graph, the default graph, which does not have a name, and zero or more named graphs, where each named graph is identified by an IRI or a blank node. A SPARQL query can match different parts of the query pattern against different graphs as described in section13.3 Querying the Dataset.

An RDF Dataset may contain zero named graphs; an RDF Dataset always contains one default graph. A query does not need to involve matching the default graph; the query can just involve matching named graphs.

The graph that is used for matching a basic graph pattern is theactive graph. In the previous sections, all queries have been shown executed against a single graph, the default graph of an RDF dataset as the active graph. TheGRAPH keyword is used to make the active graph one of all of the named graphs in the dataset for part of the query.

13.1Examples of RDF Datasets

Thedefinition of RDF Dataset [RDF12-CONCEPTS] does not restrict the relationships of named and default graphs. Information can be repeated in different graphs; relationships between graphs can be exposed. Two useful arrangements are:

to have information in the default graph that includes provenance information about the named graphs
to include the information in the named graphs in the default graph as well.

Example 1:

PREFIX dc: <http://purl.org/dc/elements/1.1/><http://example.org/bob>    dc:publisher  "Bob" .<http://example.org/alice>  dc:publisher  "Alice" .GRAPH <http://example.org/bob> {    _:a foaf:name "Bob" .    _:a foaf:mbox <mailto:bob@oldcorp.example.org> .}GRAPH <http://example.org/alice> {    _:a foaf:name "Alice" .    _:a foaf:mbox <mailto:alice@work.example.org> .}

In this example, the default graph contains the names of the publishers of two named graphs. The triples in the named graphs are not visible in the default graph in this example.

Example 2:

RDF data can be combined by theRDF merge [RDF12-SEMANTICS] of graphs. One possible arrangement of graphs in an RDF Dataset is to have the default graph be the RDF merge of some or all of the information in the named graphs.

In this next example, the named graphs contain the same triples as before. The RDF dataset includes anRDF merge of the named graphs in the default graph, which keeps blank nodes distinct.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>_:x foaf:name "Bob" ._:x foaf:mbox <mailto:bob@oldcorp.example.org> ._:y foaf:name "Alice" ._:y foaf:mbox <mailto:alice@work.example.org> .GRAPH <http://example.org/bob> {    _:a foaf:name "Bob" .    _:a foaf:mbox <mailto:bob@oldcorp.example.org> .}GRAPH <http://example.org/alice> {    _:a foaf:name "Alice" .    _:a foaf:mbox <mailto:alice@work.example> .}

In an RDF merge, blank nodes in the merged graph are not shared with blank nodes from the graphs being merged.

13.2Specifying RDF Datasets

A SPARQL query may specify the dataset to be used for matching by using theFROM clause and theFROM NAMED clause to describe the RDF dataset. If a query provides such a dataset description, then it is used in place of any dataset that the query service would use if no dataset description is provided in a query. The RDF dataset may also bespecified in a SPARQL protocol request, in which case the protocol description overrides any description in the query itself. A query service may refuse a query request if the dataset description is not acceptable to the service.

TheFROM andFROM NAMED keywords allow a query to specify an RDF dataset by reference; they indicate that the dataset should include graphs that are obtained from representations of the resources identified by the given IRIs (i.e. the absolute form of the given IRI references). The dataset resulting from a number ofFROM andFROM NAMED clauses is:

a default graph consisting of the RDF merge of the graphs referred to in theFROM clauses, and
a set of (IRI, graph) pairs, one from eachFROM NAMED clause.

If there is noFROM clause, but there is one or moreFROM NAMED clauses, then the dataset includes an empty graph for the default graph.

13.2.1Specifying the Default Graph

EachFROM clause contains an IRI that indicates a graph to be used to form the default graph. This does not put the graph in as a named graph.

In this example, the RDF Dataset contains a single default graph and no named graphs:

# Default graph (located at http://example.org/foaf/aliceFoaf)PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>_:a  foaf:name     "Alice" ._:a  foaf:mbox     <mailto:alice@work.example> .

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT  ?nameFROM    <http://example.org/foaf/aliceFoaf>WHERE   { ?x foaf:name ?name }

name
"Alice"

If a query provides more than oneFROM clause, providing more than one IRI to indicate the default graph, then the default graph is theRDF merge of the graphs obtained from representations of the resources identified by the given IRIs.

13.2.2Specifying Named Graphs

A query can supply IRIs for the named graphs in the RDF Dataset using theFROM NAMED clause. Each IRI is used to provide one named graph in the RDF Dataset. Using the same IRI in two or moreFROM NAMED clauses results in one named graph with that IRI appearing in the dataset.

# Graph: http://example.org/bobPREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Bob" ._:a foaf:mbox <mailto:bob@oldcorp.example.org> .

# Graph: http://example.org/alicePREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Alice" ._:a foaf:mbox <mailto:alice@work.example> .

...FROM NAMED <http://example.org/alice>FROM NAMED <http://example.org/bob>...

TheFROM NAMED syntax suggests that the IRI identifies the corresponding graph, but the relationship between an IRI and a graph in an RDF dataset is indirect. The IRI identifies a resource, and the resource is represented by a graph (or, more precisely: by a document that serializes a graph). Forfurther details see [WEBARCH].

13.2.3Combining FROM and FROM NAMED

TheFROM clause andFROM NAMED clause can be used in the same query.

#Default graph (located at http://example.org/dft.ttl)PREFIX dc: <http://purl.org/dc/elements/1.1/><http://example.org/bob>    dc:publisher  "Bob Hacker" .<http://example.org/alice>  dc:publisher  "Alice Hacker" .

#Named graph: http://example.org/bobPREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Bob" ._:a foaf:mbox <mailto:bob@oldcorp.example.org> .

#Named graph: http://example.org/alicePREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Alice" ._:a foaf:mbox <mailto:alice@work.example.org> .

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX dc: <http://purl.org/dc/elements/1.1/>SELECT ?who ?g ?mboxFROM <http://example.org/dft.ttl>FROM NAMED <http://example.org/alice>FROM NAMED <http://example.org/bob>WHERE{    ?g dc:publisher ?who .    GRAPH ?g { ?x foaf:mbox ?mbox }}

The RDF Dataset for this query contains a default graph and two named graphs. TheGRAPH keyword is described below.

The actions required to construct the dataset are not determined by the dataset description alone. If an IRI is given twice in a dataset description, either by using twoFROM clauses, or aFROM clause and aFROM NAMED clause, then it does not assume that exactly one or exactly two attempts are made to obtain an RDF graph associated with the IRI. Therefore, no assumptions can be made about blank node identity in triples obtained from the two occurrences in the dataset description. In general, no assumptions can be made about the equivalence of the graphs.

13.3Querying the Dataset

When querying a collection of graphs, theGRAPH keyword is used to match patterns against named graphs.GRAPH can provide an IRI to select one graph or use a variable which will range over the IRI of all the named graphs in the query's RDF dataset.

The use ofGRAPH changes the active graph for matching graph patterns within that part of the query. Outside the use ofGRAPH, matching is done using the default graph.

The following two graphs will be used in examples:

# Named graph: http://example.org/foaf/aliceFoafPREFIX  foaf:     <http://xmlns.com/foaf/0.1/>PREFIX  rdf:      <http://www.w3.org/1999/02/22-rdf-syntax-ns#>PREFIX  rdfs:     <http://www.w3.org/2000/01/rdf-schema#>_:a  foaf:name     "Alice" ._:a  foaf:mbox     <mailto:alice@work.example> ._:a  foaf:knows    _:b ._:b  foaf:name     "Bob" ._:b  foaf:mbox     <mailto:bob@work.example> ._:b  foaf:nick     "Bobby" ._:b  rdfs:seeAlso  <http://example.org/foaf/bobFoaf> .<http://example.org/foaf/bobFoaf>rdf:type      foaf:PersonalProfileDocument .

# Named graph: http://example.org/foaf/bobFoafPREFIX  foaf:     <http://xmlns.com/foaf/0.1/>PREFIX  rdf:      <http://www.w3.org/1999/02/22-rdf-syntax-ns#>PREFIX  rdfs:     <http://www.w3.org/2000/01/rdf-schema#>_:z  foaf:mbox     <mailto:bob@work.example> ._:z  rdfs:seeAlso  <http://example.org/foaf/bobFoaf> ._:z  foaf:nick     "Robert" .<http://example.org/foaf/bobFoaf>        rdf:type      foaf:PersonalProfileDocument .

13.3.1Accessing Graph Names

The query below matches the graph pattern against each of the named graphs in the dataset and forms solutions which have thesrc variable bound to IRIs of the graph being matched. The graph pattern is matched with the active graph being each of the named graphs in the dataset.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?src ?bobNickFROM NAMED <http://example.org/foaf/aliceFoaf>FROM NAMED <http://example.org/foaf/bobFoaf>WHERE{    GRAPH ?src    { ?x foaf:mbox <mailto:bob@work.example> .      ?x foaf:nick ?bobNick    }}

The query result gives the name of the graphs where the information was found and the value for Bob's nick:

src	bobNick
<http://example.org/foaf/aliceFoaf>	"Bobby"
<http://example.org/foaf/bobFoaf>	"Robert"

13.3.2Restricting by Graph IRI

The query can restrict the matching applied to a specific graph by supplying the graph IRI. This sets the active graph to the graph named by the IRI. This query looks for Bob's nick as given in the graphhttp://example.org/foaf/bobFoaf.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX data: <http://example.org/foaf/>SELECT ?nickFROM NAMED <http://example.org/foaf/aliceFoaf>FROM NAMED <http://example.org/foaf/bobFoaf>WHERE{    GRAPH data:bobFoaf {        ?x foaf:mbox <mailto:bob@work.example> .        ?x foaf:nick ?nick     }}

which yields a single solution:

nick
"Robert"

13.3.3Restricting Possible Graph IRIs

A variable used in theGRAPH clause may also be used in anotherGRAPH clause or in a graph pattern matched against the default graph in the dataset.

The query below uses the graph with IRIhttp://example.org/foaf/aliceFoaf to find the profile document for Bob; it then matches another pattern against that graph. The pattern in the secondGRAPH clause finds the blank node (variablew) for the person with the same mail box (given by variablembox) as found in the firstGRAPH clause (variablewhom), because the blank node used to match for variablewhom from Alice's FOAF file is not the same as the blank node in the profile document (they are in different graphs).

PREFIX  data:  <http://example.org/foaf/>PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>PREFIX  rdfs:  <http://www.w3.org/2000/01/rdf-schema#>SELECT ?mbox ?nick ?ppdFROM NAMED <http://example.org/foaf/aliceFoaf>FROM NAMED <http://example.org/foaf/bobFoaf>WHERE {    GRAPH data:aliceFoaf {        ?alice foaf:mbox <mailto:alice@work.example> ;               foaf:knows ?whom .        ?whom  foaf:mbox ?mbox ;               rdfs:seeAlso ?ppd .        ?ppd  a foaf:PersonalProfileDocument .    }    GRAPH ?ppd {        ?w foaf:mbox ?mbox ;           foaf:nick ?nick    }}

mbox	nick	ppd
<mailto:bob@work.example>	"Robert"	<http://example.org/foaf/bobFoaf>

Any triple in Alice's FOAF file giving Bob'snick is not used to provide a nick for Bob because the pattern involving variablenick is restricted byppd to a particular Personal Profile Document.

13.3.4Named and Default Graphs

Query patterns can involve both the default graph and the named graphs. In this example, an aggregator has read in a Web resource on two different occasions. Each time a graph is read into the aggregator, it is given an IRI by the local system. The graphs are nearly the same but the email address for "Bob" has changed.

In this example, the default graph is being used to record the provenance information and the RDF data actually read is kept in two separate graphs, each of which is given a different IRI by the system. The RDF dataset consists of two named graphs and the information about them.

RDF Dataset:

#Default graphPREFIX dc: <http://purl.org/dc/elements/1.1/>PREFIX g:  <tag:example.org,2005-06-06:>PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>g:graph1 dc:publisher "Bob" .g:graph1 dc:date "2004-12-06"^^xsd:date .g:graph2 dc:publisher "Bob" .g:graph2 dc:date "2005-01-10"^^xsd:date .

#Graph: locally allocated IRI: tag:example.org,2005-06-06:graph1PREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Alice" ._:a foaf:mbox <mailto:alice@work.example> ._:b foaf:name "Bob" ._:b foaf:mbox <mailto:bob@oldcorp.example.org> .

#Graph: locally allocated IRI: tag:example.org,2005-06-06:graph2PREFIX foaf: <http://xmlns.com/foaf/0.1/>_:a foaf:name "Alice" ._:a foaf:mbox <mailto:alice@work.example> ._:b foaf:name "Bob" ._:b foaf:mbox <mailto:bob@newcorp.example.org> .

This query finds email addresses, detailing the name of the person and the date the information was discovered.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX dc:   <http://purl.org/dc/elements/1.1/>SELECT ?name ?mbox ?dateWHERE {   ?g dc:publisher ?name ;      dc:date ?date .   GRAPH ?g {        ?person foaf:name ?name ; foaf:mbox ?mbox   }}

The results show that the email address for "Bob" has changed.

name	mbox	date
"Bob"	<mailto:bob@oldcorp.example.org>	"2004-12-06"^^xsd:date
"Bob"	<mailto:bob@newcorp.example.org>	"2005-01-10"^^xsd:date

14.Basic Federated Query

This document incorporates the syntax for SPARQL federation extensions.

This feature is defined in the documentSPARQL 1.1 Federated Query.

15.Solution Sequences and Modifiers

Query patterns generate an unordered collection of solutions, eachsolution being a partial function from variables to RDF terms. These solutions are then treated as a sequence (a solution sequence), initially in no specific order; any sequence modifiers are then applied to create another sequence. Finally, this latter sequence is used to generate one of the results of aSPARQL query form.

Asolution sequence modifier is one of:

Order modifier: put the solutions in order
Projection modifier: choose certain variables
Distinct modifier: ensure solutions in the sequence are unique
Reduced modifier: permit elimination of some non-distinct solutions
Offset modifier: control where the solutions start from in the overall sequence of solutions
Limit modifier: restrict the number of solutions

Modifiers are applied in the order given by the list above.

15.1ORDER BY

TheORDER BY clause establishes the order of a solution sequence.

Following theORDER BY clause is a sequence of order comparators, composed of an expression and an optional order modifier (eitherASC() orDESC()). Each ordering comparator is either ascending (indicated by theASC() modifier or by no modifier) or descending (indicated by theDESC() modifier).

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE { ?x foaf:name ?name }ORDER BY ?name

PREFIX     :    <http://example.org/ns#>PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE { ?x foaf:name ?name ; :empId ?emp }ORDER BY DESC(?emp)

PREFIX     :    <http://example.org/ns#>PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE { ?x foaf:name ?name ; :empId ?emp }ORDER BY ?name DESC(?emp)

The"<" operator (see theOperator Mapping and17.3.1 Operator Extensibility) defines the relative order of pairs ofnumerics,xsd:strings,xsd:booleans andxsd:dateTimes. Pairs of IRIs are ordered by comparing them as literals with datatypexsd:string.

SPARQL also fixes an order between some kinds of RDF terms that would not otherwise be ordered:

(Lowest) no value assigned to the variable or expression in this solution.
Blank nodes
IRIs
RDF literals

SPARQL does not define a total ordering of all possible RDF terms. Here are a few examples of pairs of terms for which the relative order is undefined:

"a" and "a"@en_gb (a literal with datatypexsd:string and a literal with a language tag)
"a"@en_gb and "b"@en_gb (two literals with language tags)
"a" and "1"^^xsd:integer (a literal with datatypexsd:string and a literal with a supported datatype)
"1"^^my:integer and "2"^^my:integer (two unsupported datatypes)
"1"^^xsd:integer and "2"^^my:integer (a supported datatype and an unsupported datatype)

This list of variable bindings is in ascending order:

RDF Term	Reason
	Unbound results sort earliest.
`_:z`	Blank nodes follow unbound.
`_:a`	There is no relative ordering of blank nodes.
`<http://script.example/Latin>`	IRIs follow blank nodes.
`<http://script.example/Кириллица>`	The character in the 23rd position, "К", has a unicode codepoint 0x41A, which is higher than 0x4C ("L").
`<http://script.example/漢字>`	The character in the 23rd position, "漢", has a unicode codepoint 0x6F22, which is higher than 0x41A ("К").
`"http://script.example/Latin"`	`xsd:strings` follow IRIs.

The ascending order of two solutions with respect to an ordering comparator is established by substituting the solution bindings into the expressions and comparing them with the"<" operator. The descending order is the reverse of the ascending order.

The relative order of two solutions is the relative order of the two solutions with respect to the first ordering comparator in the sequence. For solutions where the substitutions of the solution bindings produce the same RDF term, the order is the relative order of the two solutions with respect to the next ordering comparator. The relative order of two solutions is undefined if no order expression evaluated for the two solutions produces distinct RDF terms.

Ordering a sequence of solutions always results in a sequence with the same number of solutions in it.

UsingORDER BY on a solution sequence for aCONSTRUCT orDESCRIBE query has no direct effect because onlySELECT returns a sequence of results. Used in combination withLIMIT andOFFSET,ORDER BY can be used to return results generated from a different slice of the solution sequence. AnASK query does not includeORDER BY,LIMIT orOFFSET.

15.2Projection

The solution sequence can be transformed into one involving only a subset of the variables. For each solution in the sequence, a new solution is formed using a specified selection of the variables using the SELECT query form.

The following example shows a query to extract just the names of people described in an RDF graph using FOAF properties.

PREFIX foaf:        <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice" ._:a  foaf:mbox       <mailto:alice@work.example> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       <mailto:bob@work.example> .

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE{ ?x foaf:name ?name }

name
"Bob"
"Alice"

15.3Duplicate Solutions

A solution sequence with noDISTINCT orREDUCED query modifier will preserve duplicate solutions.

Data:

PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>_:x    foaf:name   "Alice" ._:x    foaf:mbox   <mailto:alice@example.com> ._:y    foaf:name   "Alice" ._:y    foaf:mbox   <mailto:asmith@example.com> ._:z    foaf:name   "Alice" ._:z    foaf:mbox   <mailto:alice.smith@example.com> .

Query:

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?name WHERE { ?x foaf:name ?name }

Results:

name
"Alice"
"Alice"
"Alice"

The modifiersDISTINCT andREDUCED affect whether duplicates are included in the query results.

DISTINCT

TheDISTINCT solution modifier eliminates duplicate solutions. Only one solution solution that binds the same variables to the same RDF terms is returned from the query.

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT DISTINCT ?name WHERE { ?x foaf:name ?name }

name
"Alice"

Note that, per theorder of solution sequence modifiers, duplicates are eliminated before either limit or offset is applied.

REDUCED

While theDISTINCT modifier ensures that duplicate solutions are eliminated from the solution set,REDUCED simply permits them to be eliminated. The multiplicity of any solution in aREDUCED solution set is at least one and not more than the multiplicity of the solution within the solution set with noDISTINCT orREDUCED modifier. For example, using the data above, the query

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT REDUCED ?name WHERE { ?x foaf:name ?name }

may have one, two (shown here) or three solutions:

name
"Alice"
"Alice"

15.4OFFSET

OFFSET causes the solutions generated to start after the specified number of solutions. AnOFFSET of zero has no effect.

UsingLIMIT andOFFSET to select different subsets of the query solutions will not be useful unless the order is made predictable by usingORDER BY.

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT  ?nameWHERE   { ?x foaf:name ?name }ORDER BY ?nameLIMIT   5OFFSET  10

15.5LIMIT

TheLIMIT clause puts an upper bound on the number of solutions returned. If the number of actual solutions, afterOFFSET is applied, is greater than the limit, then at most the limit number of solutions will be returned.

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE { ?x foaf:name ?name }LIMIT 20

ALIMIT of 0 would cause no results to be returned. A limit may not be negative.

16.Query Forms

SPARQL has four query forms. These query forms use the solutions from pattern matching to form result sets or RDF graphs. The query forms are:

SELECT
Returns all, or a subset of, the variables bound in a query pattern match.
CONSTRUCT
Returns an RDF graph constructed by substituting variables in a set of triple templates.
ASK
Returns a boolean indicating whether a query pattern matches or not.
DESCRIBE
Returns an RDF graph that describes the resources found.

Formats such asSPARQL 1.1 Query Results JSON Format,SPARQL Query Results XML Format (Second Edition) orSPARQL 1.1 Query Results CSV and TSV Formats can be used to serialize the result set from aSELECT query or the boolean result of anASK query.

16.1SELECT

The SELECT form of results returns variables and their bindings directly. It combines the operations of projecting the required variables with introducing new variable bindings into a query solution.

16.1.1Projection

Specific variables and their bindings are returned when a list of variable names is given in the SELECT clause. The syntaxSELECT * is an abbreviation that selects all of the variables that arein-scope at that point in the query. It excludes variables only used inFILTER, in the right-hand side ofMINUS, and takes account of subqueries.

Use ofSELECT * is only permitted when the query does not have aGROUP BY clause.

PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>_:a    foaf:name   "Alice" ._:a    foaf:knows  _:b ._:a    foaf:knows  _:c ._:b    foaf:name   "Bob" ._:c    foaf:name   "Clare" ._:c    foaf:nick   "CT" .

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>SELECT ?nameX ?nameY ?nickYWHERE{ ?x foaf:knows ?y ;  foaf:name ?nameX .  ?y foaf:name ?nameY .  OPTIONAL { ?y foaf:nick ?nickY }}

nameX	nameY	nickY
"Alice"	"Bob"
"Alice"	"Clare"	"CT"

Result sets can be accessed by a local API but also can be serialized into either JSON, XML, CSV or TSV.

SPARQL 1.1 Query Results JSON Format:

{    "head": {        "vars": [ "nameX" , "nameY" , "nickY" ]    } ,    "results": {        "bindings": [          {            "nameX": { "type": "literal" , "value": "Alice" } ,            "nameY": { "type": "literal" , "value": "Bob" }          } ,          {            "nameX": { "type": "literal" , "value": "Alice" } ,            "nameY": { "type": "literal" , "value": "Clare" } ,            "nickY": { "type": "literal" , "value": "CT" }          }        ]    }}

SPARQL Query Results XML Format (Second Edition):

<?xml version="1.0"?><sparql xmlns="http://www.w3.org/2005/sparql-results#"><head><variable name="nameX"/><variable name="nameY"/><variable name="nickY"/></head><results><result><binding name="nameX"><literal>Alice</literal></binding><binding name="nameY"><literal>Bob</literal></binding></result><result><binding name="nameX"><literal>Alice</literal></binding><binding name="nameY"><literal>Clare</literal></binding><binding name="nickY"><literal>CT</literal></binding></result></results></sparql>

16.1.2SELECT Expressions

As well as choosing which variables from the pattern matching are included in the results, the SELECT clause can also introduce new variables. The rules of assignment in SELECT expression are the same as for assignment in BIND. The expression combines variable bindings already in the query solution, or defined earlier in the SELECT clause, to produce a binding in the query solution.

The scoping for(expr AS v) applies immediately. InSELECT expressions, the variable may be used in an expression later in the sameSELECT clause and may not be be assigned again in the sameSELECT clause.

Example:

Data:

PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX :     <http://example.org/book/>PREFIX ns:   <http://example.org/ns#>:book1  dc:title  "SPARQL Tutorial" .:book1  ns:price  42 .:book1  ns:discount 0.2 .:book2  dc:title  "The Semantic Web" .:book2  ns:price  23 .:book2  ns:discount 0.25 .

Query:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title (?p*(1-?discount) AS ?price){ ?x ns:price ?p .  ?x dc:title ?title .   ?x ns:discount ?discount }

Results:

title	price
"The Semantic Web"	17.25
"SPARQL Tutorial"	33.6

New variables can also be used in expressions if they are introduced earlier, syntactically, in the same SELECT clause:

PREFIX  dc:  <http://purl.org/dc/elements/1.1/>PREFIX  ns:  <http://example.org/ns#>SELECT  ?title (?p AS ?fullPrice) (?fullPrice*(1-?discount) AS ?customerPrice){ ?x ns:price ?p .  ?x dc:title ?title .   ?x ns:discount ?discount }

Results:

title	fullPrice	customerPrice
"The Semantic Web"	23	17.25
"SPARQL Tutorial"	42	33.6

16.2CONSTRUCT

TheCONSTRUCT query form returns a single RDF graph specified by a graph template. The result is an RDF graph formed by taking each query solution in the solution sequence, substituting for the variables in the graph template, and combining the triples into a single RDF graph by set union.

If any such instantiation produces a triple containing an unbound variable or an illegal RDF construct, such as a literal in subject or predicate position, then that triple is not included in the output RDF graph. The graph template can contain triples with no variables (known as ground or explicit triples), and these also appear in the output RDF graph returned by the CONSTRUCT query form.

Note

The construction of the result graph by "set union" does not enforce whether or not duplicated triples appear in the graph serialization. Implementations are allowed to produce duplicate triples or to deduplicate them.

PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>_:a    foaf:name   "Alice" ._:a    foaf:mbox   <mailto:alice@example.org> .

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>PREFIX vcard:   <http://www.w3.org/2001/vcard-rdf/3.0#>CONSTRUCT   { <http://example.org/person#Alice> vcard:FN ?name }WHERE       { ?x foaf:name ?name }

creates vcard properties from the FOAF information:

PREFIX vcard: <http://www.w3.org/2001/vcard-rdf/3.0#><http://example.org/person#Alice> vcard:FN "Alice" .

16.2.1Templates with Blank Nodes

A template can create an RDF graph containing blank nodes. The blank node identifiers inside the template are scoped to the template for each solution, while blank nodes from query solutions are not scoped. If the same identifier occurs twice in a template, every occurrence is replaced by the same blank node which is created for each query solution, and there will be different blank nodes for triples generated by different query solutions.

PREFIX  foaf:  <http://xmlns.com/foaf/0.1/>_:a    foaf:givenname   "Alice" ._:a    foaf:family_name "Hacker" ._:b    foaf:firstname   "Bob" ._:b    foaf:surname     "Hacker" .

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>PREFIX vcard:   <http://www.w3.org/2001/vcard-rdf/3.0#>CONSTRUCT {     ?x  vcard:N _:v .    _:v vcard:givenName ?gname .    _:v vcard:familyName ?fname} WHERE {    { ?x foaf:firstname ?gname } UNION  { ?x foaf:givenname   ?gname } .    { ?x foaf:surname   ?fname } UNION  { ?x foaf:family_name ?fname } .}

creates vcard properties corresponding to the FOAF information:

PREFIX vcard: <http://www.w3.org/2001/vcard-rdf/3.0#>_:a vcard:N         _:v1 ._:v1 vcard:givenName  "Alice" ._:v1 vcard:familyName "Hacker" ._:b vcard:N         _:v2 ._:v2 vcard:givenName  "Bob" ._:v2 vcard:familyName "Hacker" .

The blank node with identifier_:v in the template will be replaced by a different blank node when the template is applied to each of the two query solutions. In this example, this will cause the template to generate blank nodes with identifier_:v1 and_:v2 in the results graph.

The blank nodes in the query solutions, shown with identifiers_:a and_:b, originate from the underlying RDF dataset and will not be altered.

16.2.2Accessing Graphs in the RDF Dataset

UsingCONSTRUCT, it is possible to extract parts or the whole of graphs from the target RDF dataset. This first example returns the graph (if it is in the dataset) with IRI labelhttp://example.org/aGraph; otherwise, it returns an empty graph.

CONSTRUCT { ?s ?p ?o } WHERE { GRAPH <http://example.org/aGraph> { ?s ?p ?o } . }

The access to the graph can be conditional on other information. For example, if the default graph contains metadata about the named graphs in the dataset, then a query like the following one can extract one graph based on information about the named graph:

PREFIX  dc: <http://purl.org/dc/elements/1.1/>PREFIX app: <http://example.org/ns#>PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>CONSTRUCT { ?s ?p ?o } WHERE{    GRAPH ?g { ?s ?p ?o } .    ?g dc:publisher <http://www.w3.org/> .    ?g dc:date ?date .    FILTER ( app:customDate(?date) > "2005-02-28T00:00:00Z"^^xsd:dateTime ) .}

whereapp:customDate identifies anextension function to turn the date format into anxsd:dateTime RDF term.

16.2.3Solution Modifiers and CONSTRUCT

The solution modifiers of a query affect the results of aCONSTRUCT query. In this example, the output graph from theCONSTRUCT template is derived from just two of the solutions from graph pattern matching. The query outputs a graph with the names of the people with the top two sites, rated by hits. The triples in the RDF graph are not ordered.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX site: <http://example.org/stats#>_:a foaf:name "Alice" ._:a site:hits 2349 ._:b foaf:name "Bob" ._:b site:hits 105 ._:c foaf:name "Eve" ._:c site:hits 181 .

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX site: <http://example.org/stats#>CONSTRUCT { [] foaf:name ?name }WHERE{ [] foaf:name ?name ;  site:hits ?hits .}ORDER BY desc(?hits)LIMIT 2

PREFIX foaf: <http://xmlns.com/foaf/0.1/>_:x foaf:name "Alice" ._:y foaf:name "Eve" .

16.2.4CONSTRUCT WHERE

A short form for the CONSTRUCT query form is provided for the case where the template and the pattern are the same and the pattern is just a basic graph pattern (noFILTERs and no complex graph patterns are allowed in the short form). The keywordWHERE is required in the short form.

The following two queries are the same; the first is a short form of the second.

PREFIX foaf: <http://xmlns.com/foaf/0.1/>CONSTRUCT WHERE { ?x foaf:name ?name }

PREFIX foaf: <http://xmlns.com/foaf/0.1/>CONSTRUCT { ?x foaf:name ?name } WHERE{ ?x foaf:name ?name }

16.3ASK

Applications can use theASK form to test whether or not a query pattern has a solution. No information is returned about the possible query solutions, just whether or not a solution exists.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice" ._:a  foaf:homepage   <http://work.example.org/alice/> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       <mailto:bob@work.example> .

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>ASK  { ?x foaf:name  "Alice" }

true

TheSPARQL Query Results XML Format (Second Edition) form of this result set gives:

<?xml version="1.0"?>                <sparql xmlns="http://www.w3.org/2005/sparql-results#">                <head></head>                <boolean>true</boolean>                </sparql>

On the same data, the following returns no match because Alice'smbox is not mentioned.

PREFIX foaf:    <http://xmlns.com/foaf/0.1/>ASK  {   ?x foaf:name  "Alice" ;      foaf:mbox  <mailto:alice@work.example>}

false

16.4DESCRIBE (Informative)

TheDESCRIBE form returns a single result RDF graph containing RDF data about resources. This data is not prescribed by a SPARQL query, where the query client would need to know the structure of the RDF in the data source, but, instead, is determined by the SPARQL query processor. The query pattern is used to create a result set. TheDESCRIBE form takes each of the resources identified in a solution, together with any resources directly named by IRI, and assembles a single RDF graph by taking a "description" which can come from any information available including the target RDF Dataset. The description is determined by the query service. The syntaxDESCRIBE * is an abbreviation that describes all of the variables in a query.

16.4.1Explicit IRIs

TheDESCRIBE clause itself can take IRIs to identify the resources. The simplestDESCRIBE query is just an IRI in theDESCRIBE clause:

DESCRIBE <http://example.org/>

16.4.2Identifying Resources

The resources to be described can also be taken from the bindings to a query variable in a result set. This enables description of resources whether they are identified by IRI or by blank node in the dataset:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>DESCRIBE ?xWHERE    { ?x foaf:mbox <mailto:alice@org> }

The propertyfoaf:mbox is defined as being an inverse functional property in the FOAF vocabulary. If treated as such, this query will return information about at most one person. If, however, the query pattern has multiple solutions, the RDF data for each is the union of all RDF graph descriptions.

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>DESCRIBE ?xWHERE    { ?x foaf:name "Alice" }

More than one IRI or variable can be given:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>DESCRIBE ?x ?y <http://example.org/>WHERE    {?x foaf:knows ?y}

16.4.3Descriptions of Resources

The RDF returned is determined by the information publisher. It may be information the service deems relevant to the resources being described. It may include information about other resources: for example, the RDF data for a book may also include details about the author.

A simple query such as

PREFIX ent:  <http://org.example.com/employees#>DESCRIBE ?x WHERE { ?x ent:employeeId "1234" }

might return a description of the employee and some other potentially useful details:

PREFIX foaf:   <http://xmlns.com/foaf/0.1/>PREFIX vcard:  <http://www.w3.org/2001/vcard-rdf/3.0>PREFIX exOrg:  <http://org.example.com/employees#>PREFIX rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#>PREFIX owl:    <http://www.w3.org/2002/07/owl#>_:a     exOrg:employeeId    "1234" ;        foaf:mbox_sha1sum   "bee135d3af1e418104bc42904596fe148e90f033" ;        vcard:N          [ vcard:Family       "Smith" ;            vcard:Given        "John"  ] .foaf:mbox_sha1sum  rdf:type  owl:InverseFunctionalProperty .

which includes the blank node closure for thevCard vocabularyvcard:N. Other possible mechanisms for deciding what information to return include Concise Bounded Descriptions [CBD].

For a vocabulary such as FOAF, where the resources are typically blank nodes, returning sufficient information to identify a node such as the InverseFunctionalPropertyfoaf:mbox_sha1sum as well as information like name and other details recorded would be appropriate. In the example, the match to theWHERE clause was returned, but this is not required.

17.Expressions and Testing Values

SPARQLFILTERs restrict the solutions of a graph pattern match according to a givenconstraint. Specifically,FILTERs eliminate any solutions that, when substituted into the expression, either result in an effective boolean value offalse or produce an error. Effective boolean values are defined in section17.2.2Effective Boolean Value and errors are defined inXQuery 3.1: An XML Query Language [XQUERY-31] section2.3.1,Kinds of Errors. These errors have no effect outside ofFILTER evaluation.

RDF Literals have datatypes that determine the value of the literal.

PREFIX a:          <http://www.w3.org/2000/10/annotation-ns#>PREFIX dc:         <http://purl.org/dc/elements/1.1/>_:a   a:annotates   <http://www.w3.org/TR/rdf-sparql-query/> ._:a   dc:date       "2004-12-31T19:00:00-05:00" ._:b   a:annotates   <http://www.w3.org/TR/rdf-sparql-query/> ._:b   dc:date       "2004-12-31T19:01:00-05:00"^^<http://www.w3.org/2001/XMLSchema#dateTime> .

The object of the firstdc:date triple is a literal that has a datatype ofxsd:string. The second has the datatypexsd:dateTime. They are differentRDF terms with different values.

SPARQL expressions are constructed according to the grammar and provide access to functions (named by IRI) and operator functions (invoked by keywords and symbols in the SPARQL grammar). SPARQL operators can be used to compare the values of literals:

PREFIX a:      <http://www.w3.org/2000/10/annotation-ns#>PREFIX dc:     <http://purl.org/dc/elements/1.1/>PREFIX xsd:    <http://www.w3.org/2001/XMLSchema#>SELECT ?annotWHERE { ?annot  a:annotates  <http://www.w3.org/TR/rdf-sparql-query/> .        ?annot  dc:date      ?date .        FILTER ( ?date > "2005-01-01T00:00:00Z"^^xsd:dateTime ) }

The SPARQL operators are listed insection 17.3 and are associated with their productions in the grammar.

In addition, SPARQL provides the ability to invoke arbitrary functions, including a subset of the XPath casting functions, listed insection 17.5. These functions are invoked by name (an IRI) within a SPARQL query. For example:

... FILTER ( xsd:dateTime(?date) < xsd:dateTime("2005-01-01T00:00:00Z") ) ...

Typographical convention in this section: XPath operators are labeled with the prefixop:. XPath operators have no namespace;op: is a labeling convention.

17.1Operand Data Types

SPARQL functions and operators operate on RDF terms and SPARQL variables. A subset of these functions and operators are taken from theXPath and XQuery Functions and Operators 3.1 [XPATH-FUNCTIONS-31] and have XML Schematyped value arguments and return types. RDFliterals passed as arguments to these functions and operators are mapped to XML Schema typed values with astring value of thelexical form and anatomic datatype corresponding to thedatatype IRI. The returned typed values are mapped back to RDFliterals the same way.

SPARQL has additional operators which operate on specific subsets of RDF terms. When referring to a type, the following terms denote aliteral with the correspondingW3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes [XMLSCHEMA11-2]datatype IRI:

The following terms identify additional types used in SPARQL value tests:

numeric denotesliterals with datatypesxsd:integer,xsd:decimal,xsd:float, orxsd:double.
RDF term denotes the typesIRI,literal,blank node, ortriple term.
variable denotes a SPARQL variable.

The following types are derived fromnumeric types and are valid arguments to functions and operators takingnumeric arguments:

SPARQL language extensions may treat additional types as being derived from XML schema datatypes.

17.2Filter Evaluation

SPARQL provides a subset of the functions and operators defined by XQueryOperator Mapping. The XQuery sectionExpression Processing describes the invocation of XPath functions. The following rules accommodate the differences in the data and execution models between XQuery and SPARQL:

Unlike XPath/XQuery, SPARQL functions do not process node sequences. When interpreting the semantics of XPath functions, assume that each argument is a sequence of a single node.
Functions invoked with an argument of the wrong type will produce atype error. Effective boolean value arguments (labeled "xsd:boolean (EBV)" in the operator mapping table below), are coerced toxsd:boolean using theEBV rules in section 17.2.2.
Apart from thefunctional forms BOUND,COALESCE,IF,IN,NOT IN,logical-or (||),logical-and (&&),NOT EXISTS, andEXISTS, all functions operate on RDF Terms and will produce a type error if any arguments are unbound.
Any expression other thanlogical-or (||) orlogical-and (&&) that encounters an error will produce that error.
Alogical-or that encounters an error on only one branch will return TRUE if the other branch is TRUE and an error if the other branch is FALSE.
Alogical-and that encounters an error on only one branch will return an error if the other branch is TRUE and FALSE if the other branch is FALSE.
Alogical-or orlogical-and that encounters errors on both branches will produceeither of the errors.

The logical-and and logical-or truth table for true (T), false (F), and error (E) is as follows:

A	B	A \|\| B	A && B
T	T	T	T
T	F	T	F
F	T	T	F
F	F	F	F
T	E	T	E
E	T	T	E
F	E	E	F
E	F	E	F
E	E	E	E

17.2.1Invocation

SPARQL defines a syntax for invoking functions on a list of arguments. Unless otherwise noted, these are invoked as follows:

Argument expressions are evaluated, producing argument values. The order of argument evaluation is not defined.
Numeric arguments are promoted as necessary to fit the expected types for that function or operator.
The function or operator is invoked on the argument values.

If any of these steps fails, the invocation generates an error. The effects of errors are defined inFilter Evaluation.

There are also "functional forms" which have different evaluation rules to functions as specified by each such form.

17.2.2Effective Boolean Value (EBV)

Effective boolean value is used to calculate the arguments to the logical functionslogical-and,logical-or, andfn:not, as well as evaluate the result of aFILTER expression.

The XQueryEffective Boolean Value rules rely on the definition of XPath'sfn:boolean. The following rules reflect the rules forfn:boolean applied to the argument types present in SPARQL queries:

The EBV of any literal whose type isxsd:boolean ornumeric is false if the lexical form is not valid for that datatype, such as"abc"^^xsd:integer.
If the argument is atyped literal with adatatype ofxsd:boolean, and it has a valid lexical form, the EBV is the value of that argument.
If the argument is a literal with adatatype ofxsd:string, the EBV is false if the operand value has zero length; otherwise the EBV is true.
If the argument is anumeric type or atyped literal with a datatype derived from anumeric type, and it has a valid lexical form, the EBV is false if the operand value is NaN or is numerically equal to zero; otherwise the EBV is true.
All other arguments, including unbound arguments, produce a type error.

An EBV oftrue is represented as atyped literal with a datatype ofxsd:boolean and a lexical value of "true"; an EBV of false is represented as aliteral with a datatype ofxsd:boolean and a lexical value of "false".

17.3Operator Mapping

The SPARQL grammar identifies a set of operators (for instance,&&,*,isIRI) used to construct constraints. The following table associates each of these grammatical productions with the appropriate operands and an operator function defined by eitherXPath and XQuery Functions and Operators 3.1 [XPATH-FUNCTIONS-31] or the SPARQL operators specified insection 17.4. When selecting the operator definition for a given set of parameters, the definition with the most specific parameters applies. For instance, when evaluatingxsd:integer = xsd:signedInt, the definition for= with twonumeric parameters applies, rather than the one with twoRDF terms. The table is arranged so that the upper-most viable candidate is the most specific. Operators invoked without appropriate operands result in a type error.

SPARQL follows XPath's scheme for numeric type promotions and subtype substitution for arguments to numeric operators. TheXPath Operator Mapping rules fornumeric operands (xsd:integer,xsd:decimal,xsd:float,xsd:double, and types derived from anumeric type) apply to SPARQL operators as well (seeXML Path Language (XPath) 3.1 [XPATH-31] for definitions ofnumeric type promotions andsubtype substitution). Some of the operators are associated with nested function expressions, e.g.fn:not(op:numeric-equal(A, B)). Note that per the XPath definitions,fn:not andop:numeric-equal produce an error if their argument is an error.

The collation forfn:compare isdefined by XPath and identified byhttp://www.w3.org/2005/xpath-functions/collation/codepoint. This collation allows for string comparison based on code point values. Codepoint string equivalence can be tested withRDF term equivalence.

SPARQL Unary Operators
Operator	Type(A)	Function	Result type
XQuery Unary Operators
! A	xsd:boolean(EBV)	fn:not(A)	xsd:boolean
+ A	numeric	op:numeric-unary-plus(A)	numeric
- A	numeric	op:numeric-unary-minus(A)	numeric

SPARQL Binary Operators
Operator	Type(A)	Type(B)	Function	Result type
Logical Connectives
A\|\| B	xsd:boolean(EBV)	xsd:boolean(EBV)	logical-or(A, B)	xsd:boolean
A&& B	xsd:boolean(EBV)	xsd:boolean(EBV)	logical-and(A, B)	xsd:boolean
XPath Tests
A= B	numeric	numeric	op:numeric-equal(A, B)	xsd:boolean
A= B	xsd:string	xsd:string	op:numeric-equal(fn:compare(STR(A),STR(B)), 0)	xsd:boolean
A= B	xsd:boolean	xsd:boolean	op:boolean-equal(A, B)	xsd:boolean
A= B	xsd:dateTime	xsd:dateTime	op:dateTime-equal(A, B)	xsd:boolean
A!= B	numeric	numeric	fn:not(op:numeric-equal(A, B))	xsd:boolean
A!= B	xsd:string	xsd:string	fn:not(op:numeric-equal(fn:compare(STR(A),STR(B)), 0))	xsd:boolean
A!= B	xsd:boolean	xsd:boolean	fn:not(op:boolean-equal(A, B))	xsd:boolean
A!= B	xsd:dateTime	xsd:dateTime	fn:not(op:dateTime-equal(A, B))	xsd:boolean
A< B	numeric	numeric	op:numeric-less-than(A, B)	xsd:boolean
A< B	xsd:string	xsd:string	op:numeric-equal(fn:compare(STR(A),STR(B)), -1)	xsd:boolean
A< B	xsd:boolean	xsd:boolean	op:boolean-less-than(A, B)	xsd:boolean
A< B	xsd:dateTime	xsd:dateTime	op:dateTime-less-than(A, B)	xsd:boolean
A> B	numeric	numeric	op:numeric-greater-than(A, B)	xsd:boolean
A> B	xsd:string	xsd:string	op:numeric-equal(fn:compare(STR(A),STR(B)), 1)	xsd:boolean
A> B	xsd:boolean	xsd:boolean	op:boolean-greater-than(A, B)	xsd:boolean
A> B	xsd:dateTime	xsd:dateTime	op:dateTime-greater-than(A, B)	xsd:boolean
A<= B	numeric	numeric	logical-or(op:numeric-less-than(A, B),op:numeric-equal(A, B))	xsd:boolean
A<= B	xsd:string	xsd:string	fn:not(op:numeric-equal(fn:compare(STR(A),STR(B)), 1))	xsd:boolean
A<= B	xsd:boolean	xsd:boolean	fn:not(op:boolean-greater-than(A, B))	xsd:boolean
A<= B	xsd:dateTime	xsd:dateTime	fn:not(op:dateTime-greater-than(A, B))	xsd:boolean
A>= B	numeric	numeric	logical-or(op:numeric-greater-than(A, B),op:numeric-equal(A, B))	xsd:boolean
A>= B	xsd:string	xsd:string	fn:not(op:numeric-equal(fn:compare(STR(A),STR(B)), -1))	xsd:boolean
A>= B	xsd:boolean	xsd:boolean	fn:not(op:boolean-less-than(A, B))	xsd:boolean
A>= B	xsd:dateTime	xsd:dateTime	fn:not(op:dateTime-less-than(A, B))	xsd:boolean
XPath Arithmetic
A* B	numeric	numeric	op:numeric-multiply(A, B)	numeric
A/ B	numeric	numeric	op:numeric-divide(A, B)	numeric; but xsd:decimal if both operands are xsd:integer
A+ B	numeric	numeric	op:numeric-add(A, B)	numeric
A- B	numeric	numeric	op:numeric-subtract(A, B)	numeric
SPARQL Tests
A= B	IRI	IRI	sameTerm(A, B)	xsd:boolean
A= B	Blank Node	Blank Node	sameTerm(A, B)	xsd:boolean
A= B	Triple Term	Triple Term	( A.subject = B.subject )&& ( A.predicate = B.predicate )&& ( A.object = B.object )	xsd:boolean
A!= B	IRI	IRI	fn:not(sameTerm(A, B))	xsd:boolean
A!= B	Blank Node	Blank Node	fn:not(sameTerm(A, B)	xsd:boolean
A!= B	Triple Term	Triple Term	( A.subject != B.subject )\|\| ( A.predicate != B.predicate )\|\| ( A.object != B.object )	xsd:boolean
A= B	RDF term	RDF term	sameValue(A, B)	xsd:boolean
A!= B	RDF term	RDF term	fn:not(sameValue(A, B))	xsd:boolean

xsd:boolean function arguments marked with "(EBV)" are coerced to xsd:boolean by evaluating theeffective boolean value of that argument.

Operators= and!= applied totriple terms apply the operator to each of the components.

17.3.1Operator Extensibility

SPARQL language extensions may provide additional associations between operators and operator functions; this amounts to adding rows to the table above. No additional operator may yield a result that replaces any result other than a type error. The consequence of this rule is that SPARQLFILTERs will produceat least the same intermediate bindings after applying aFILTER as an unextended implementation.

Additional mappings of the '<' operator are expected to control the relative ordering of the operands, specifically, when used in anORDER BY clause.

17.4Function Definitions

This section defines the operators and functions introduced by the SPARQL query language. The examples show the behavior of the operators as invoked by the appropriate grammatical constructs.

17.4.1Functional Forms

17.4.1.1BOUND

xsd:booleanBOUND (variablevar)

Returnstrue ifvar is bound to a value. Returns false otherwise. Variables with the value NaN or INF are considered bound.

Data:

PREFIX foaf:        <http://xmlns.com/foaf/0.1/>PREFIX dc:          <http://purl.org/dc/elements/1.1/>PREFIX xsd:          <http://www.w3.org/2001/XMLSchema#>_:a  foaf:givenName  "Alice"._:b  foaf:givenName  "Bob" ._:b  dc:date         "2005-04-04T04:04:04Z"^^xsd:dateTime .

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX dc:   <http://purl.org/dc/elements/1.1/>PREFIX xsd:   <http://www.w3.org/2001/XMLSchema#>SELECT ?givenNameWHERE {   ?x foaf:givenName  ?givenName .   OPTIONAL { ?x dc:date ?date } .   FILTER ( bound(?date) )}

Query result:

givenName
"Bob"

One may test whether a graph pattern isnot expressed by specifying anOPTIONAL graph pattern that introduces a variable and testing to see whether the variable is notbound This is calledNegation as Failure in logic programming.

This query matches the people with aname butno expresseddate:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX dc:   <http://purl.org/dc/elements/1.1/>SELECT ?nameWHERE {     ?x foaf:givenName  ?name .    OPTIONAL { ?x dc:date ?date } .    FILTER (!bound(?date)) }

Query result:

name
"Alice"

Because Bob'sdc:date was known,"Bob" was not a solution to the query.

17.4.1.2IF

rdfTermIF (expression1,expression2,expression3)

TheIF function form evaluates the first argument, interprets it as aeffective boolean value, then returns the value ofexpression2 if the EBV is true, otherwise it returns the value ofexpression3. Only one ofexpression2 andexpression3 is evaluated. If evaluating the first argument raises an error, then an error is raised for the evaluation of theIF expression.

Examples: Suppose ?x = 2, ?z = 0 and ?y is not bound in some query solution:

`IF(?x = 2, "yes", "no")`	returns "yes"
`IF(bound(?y), "yes", "no")`	returns "no"
`IF(?x=2, "yes", 1/?z)`	returns "yes", the expression`1/?z` is not evaluated
`IF(?x=1, "yes", 1/?z)`	raises an error
`IF("2" > 1, "yes", "no")`	raises an error

17.4.1.3COALESCE

rdfTermCOALESCE(expression, ....)

TheCOALESCE function form returns the RDF term value of the first expression that evaluates without error. In SPARQL, evaluating an unbound variable raises an error.

If none of the expressions evaluate without error, an error is raised.

If there are zero expressions, an error is raised.

Examples: Suppose ?x = 2 and ?y is not bound in some query solution:

`COALESCE(?x, 1/0)`	returns 2, the value of`x`
`COALESCE(1/0, ?x)`	returns 2
`COALESCE(5, ?x)`	returns 5
`COALESCE(?y, 3)`	returns 3
`COALESCE(?y)`	raises an error because`y` is not bound.
`COALESCE()`	raises an error because there are zero arguments.

17.4.1.4NOT EXISTS and EXISTS

There is a filter operatorEXISTS that takes a graph pattern.EXISTS returnstrue/false depending on whether the pattern matches the dataset given the bindings in the current group graph pattern, the dataset and theactive graph at this point in the query evaluation. No additional binding of variables occurs. TheNOT EXISTS form translates intofn:not(EXISTS{...}).

xsd:booleanNOT EXISTS {pattern }

Returnsfalse ifpattern matches. Returns true otherwise.

NOT EXISTS { pattern } is equivalent tofn:not(EXISTS { pattern }).

xsd:boolean EXISTS {pattern }

Returnstrue ifpattern matches. Returns false otherwise.

Variables in thepattern that are bound in the currentsolution mapping take the value that they have from the solution mapping. Variables in the patternpattern that are not bound in the current solution mapping take part in pattern matching.

To facilitate this, we introduce a functionExists that evaluates a SPARQL Algebra expression and returns true or false, depending on whether there are any solutions to the pattern, given the solution mapping being tested by the filter operation.

17.4.1.5logical-or

xsd:booleanlogical-or (xsd:booleanleft,xsd:booleanright)

This function cannot be used directly in expressions. The purpose of this function is to define the semantics of the "||" operator.

The function returns a logicalOR ofleft andright. Note thatlogical-or operates on theeffective boolean value of its arguments.

Note: see section 17.2,Filter Evaluation, for the|| operator's treatment of errors.

17.4.1.6logical-and

xsd:booleanlogical-and (xsd:booleanleft,xsd:booleanright)

This function cannot be used directly in expressions. The purpose of this function is to define the semantics of the "&&" operator.

The function returns a logicalAND ofleft andright. Note thatlogical-and operates on theeffective boolean value of its arguments.

Note: see section 17.2,Filter Evaluation, for the&& operator's treatment of errors.

17.4.1.7IN

boolean  rdfTermIN (expression,...)

TheIN operator tests whether the RDF term on the left-hand side is found in the list of values of the expressions on the right-hand side. The test is done with the "=" operator, which tests for the same value, as determined by theoperator mapping.

A list of zero terms on the right-hand side is legal and evaluates tofalse.

Errors in comparisons cause theIN expression to raise an error if the RDF term being tested is not found elsewhere in the list of terms.

IfIN is used with an expression to produce therdfTerm, then that expression is evaluated only once, before evaluating theIN expression.

TheIN operator is equivalent to the SPARQL expression:

(rdfTerm = value of expression1) || (rdfTerm = value of expression2) || ...

Examples:

`2 IN (1, 2, 3)`	true
`2 IN ()`	false
`2 IN (<http://example/iri>, "str", 2.0)`	true
`2 IN (1/0, 2)`	true
`2 IN (2, 1/0)`	true
`2 IN (3, 1/0)`	raises an error

17.4.1.8NOT IN

boolean  rdfTermNOT IN (expression,...)

TheNOT IN operator tests whether the RDF term on the left-hand side is not found in the values of list of the expressions on the right-hand side. The test is done with the "!=" operator, which tests that two values are not the same value, as determined by theoperator mapping.

A list of zero terms on the right-hand side is legal and evaluates totrue.

IfNOT IN is used with an expression to produce therdfTerm, then that expression is evaluated only once, before evaluating theNOT IN expression.

Errors in comparisons cause theNOT IN expression to raise an error if the RDF term being tested is not found elsewhere in the list of terms.

TheNOT IN operator is equivalent to the SPARQL expression:

(rdfTerm!=valueofexpression1)&&(rdfTerm!=valueofexpression2)&&...

NOT IN (...) is equivalent to!(IN (...)).

Examples:

`2 NOT IN (1, 2, 3)`	false
`2 NOT IN ()`	true
`2 NOT IN (<http://example/iri>, "str", 2.0)`	false
`2 NOT IN (1/0, 2)`	false
`2 NOT IN (2, 1/0)`	false
`2 NOT IN (3, 1/0)`	raises an error

17.4.2Functions on RDF Terms

17.4.2.1sameTerm

xsd:booleansameTerm (RDF termterm1,RDF termterm2)

Returns TRUE ifterm1 andterm2 are the same RDF term as defined inRDF 1.2 Concepts and Abstract Syntax [RDF12-CONCEPTS]; returns FALSE otherwise.

term1 andterm2 are thesame RDF term if one of the following is true:

term1 andterm2 areIRIs that are thesame as IRIs.
term1 andterm2 areliterals that areequal as literal terms.
term1 andterm2 areblank nodes that areequal as blank nodes.
term1 andterm2 aretriple terms that areequal as triple terms; that is, thesubject,predicate, andobject components are pair-wise the same term.

`sameTerm(<http://example/>, <http://example/>)`	true
`sameTerm(<http://example/>, <https://example/>)`	false
`sameTerm("abc", "abc")`	true
`sameTerm("abc"@en, "abc")`	false
`sameTerm("abc"@en, "abc"@EN)`	true
`sameTerm("abc"@en--rtl, "abc"@en)`	false
`sameTerm(2, 2.0)`	false
`sameTerm(2, "2"^^xsd:integer)`	true
`sameTerm(2, "02"^^xsd:integer)`	false

17.4.2.2sameValue

xsd:booleansameValue (RDF termterm1,RDF termterm2)

This function cannot be used directly in expressions. The purpose of this function is to define the semantics of the "=" operator when applied to two RDF terms that do not fall into the concrete cases covered in the operator mapping table in Section17.3Operator Mapping.

The result of this function is determined by going through the following steps.

Ifterm1 andterm2 areequal RDF terms, then return TRUE.
Ifterm1 orterm2 is anIRI or ablank node then return FALSE.
Ifterm1 andterm2 are bothliterals and one or both of these literals has a datatype that is not handled by the SPARQL processor, then produce a type error.
Ifterm1 andterm2 are bothliterals and one or both of these literals are known to beill-typed, then produce a type error.
"NaN"^^xsd:double and"NaN"^^xsd:float are considered to represent the same value. Ifterm1 andterm2 are both "NaN" for either xsd:double or xsd:float, then return TRUE.
Ifterm1 andterm2 are bothliterals and the SPARQL processor can determine that their the values are equal, then return TRUE.
Ifterm1 andterm2 are bothliterals and the SPARQL processor can determine the values can not be equal, then return FALSE.
Ifterm1 andterm2 are bothtriple terms, apply the functionsameValue pair-wise to each of the components. Return TRUE if each component pair returns TRUE; produce a type error if any component pair produces an error; otherwise return FALSE.
Otherwise, return FALSE.

Note

A literal isill-typed if its datatype is handled by the SPARQL processor and its lexical form is not in thelexical space of the datatype.

If the two arguments are literals, the functionsameValue returnstrue orfalse in cases where the SPARQL processor can determine that the values of these literals are equal or are not equal. If the SPARQL processor can not be sure, it returnserror.

Note

TheOperator Mapping for "=" is the functionfn:numeric-equal which is defined to returnfalse when comparing arguments involvingNaN. However,sameTerm("NaN"^^xsd:double, "NaN"^^xsd:double) is true. The functionsameValue definessameValue("NaN"^^xsd:double, "NaN"^^xsd:double) to be true because the arguments are the same element of the value space.

sameValue treats the values of"NaN"^^xsd:double and"NaN"^^xsd:float as being the same.sameValue("NaN"^^xsd:double, "NaN"^^xsd:float) istrue.

Note

For xsd:double and xsd:float,+0,-0 and0 are same value.

Note

An extended implementation may support additional datatypes for literals. An implementation processing a query that tests for equivalence of literals with non-recognized datatypes (and non-identical lexical form and datatype IRI) returns an error, indicating that it is unable to determine whether or not the values of the compared literals are equivalent. For example, an unextended implementation will produce an error when testing either"iiii"^^my:romanNumeral = "iv"^^my:romanNumeral or"iiii"^^my:romanNumeral != "iv"^^my:romanNumeral.

Examples:

sameValue	Results
`sameValue(1e10, "NaN"^^xsd:double)`	false
`sameValue("NaN"^^xsd:double, "NaN"^^xsd:double)`	true
`sameValue("NaN"^^xsd:double, "NaN"^^xsd:float)`	true
`sameValue( <<(:s :p 123)>> , <<(:s :p 123.0)>> )`	true

Note

This function was calledRDFterm-equal up until SPARQL 1.1.

17.4.2.3isIRI

xsd:booleanisIRI (RDF termterm)xsd:booleanisURI (RDF termterm)

Returnstrue ifterm is anIRI. Returnsfalse otherwise.isURI is an alternate spelling for theisIRI operator.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice"._:a  foaf:mbox       <mailto:alice@work.example> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       "bob@work.example" .

This query matches the people with aname and anmbox which is an IRI:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE {    ?x foaf:name  ?name ;       foaf:mbox  ?mbox .    FILTER isIRI(?mbox) }

Query result:

name	mbox
"Alice"	<mailto:alice@work.example>

17.4.2.4isBLANK

xsd:booleanisBLANK (RDF termterm)

Returnstrue ifterm is ablank node. Returnsfalse otherwise.

PREFIX a:          <http://www.w3.org/2000/10/annotation-ns#>PREFIX dc:         <http://purl.org/dc/elements/1.1/>PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a   a:annotates   <http://www.w3.org/TR/rdf-sparql-query/> ._:a   dc:creator    "Alice B. Toeclips" ._:b   a:annotates   <http://www.w3.org/TR/rdf-sparql-query/> ._:b   dc:creator    _:c ._:c   foaf:given    "Bob"._:c   foaf:family   "Smith".

This query matches the people with adc:creator which uses predicates from the FOAF vocabulary to express the name.

PREFIX a:      <http://www.w3.org/2000/10/annotation-ns#>PREFIX dc:     <http://purl.org/dc/elements/1.1/>PREFIX foaf:   <http://xmlns.com/foaf/0.1/>SELECT ?given ?familyWHERE {     ?annot  a:annotates  <http://www.w3.org/TR/rdf-sparql-query/> .    ?annot  dc:creator   ?c .    OPTIONAL { ?c  foaf:given   ?given ; foaf:family  ?family } .    FILTER isBLANK(?c)}

Query result:

given	family
"Bob"	"Smith"

In this example, there were two objects ofdc:creator predicates, but only one (_:c) was a blank node.

17.4.2.5isLITERAL

xsd:booleanisLITERAL (RDF termterm)

Returnstrue ifterm is aliteral. Returnsfalse otherwise.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>                _:a  foaf:name       "Alice"._:a  foaf:mbox       <mailto:alice@work.example> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       "bob@work.example" .

This query is similar to the one in17.4.2.1 except that is matches the people with aname and anmbox which is a literal. This could be used to look for erroneous data (foaf:mbox should only have an IRI as its object).

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE {    ?x foaf:name  ?name ;       foaf:mbox  ?mbox .    FILTER isLiteral(?mbox)}

Query result:

name	mbox
"Bob"	"bob@work.example"

17.4.2.6isNUMERIC

xsd:booleanisNUMERIC (RDF termterm)

Returnstrue ifterm is a numeric value. Returnsfalse otherwise.term is numeric if it has an appropriate datatype (see the sectionOperand Data Types) and has a valid lexical form, making it a valid argument to functions and operators taking numeric arguments.

Examples:

`isNUMERIC(12)`	true
`isNUMERIC("12")`	false
`isNUMERIC("12"^^xsd:nonNegativeInteger)`	true
`isNUMERIC("1200"^^xsd:byte)`	false
`isNUMERIC(<http://example/>)`	false

17.4.2.7STR

xsd:stringSTR (literalliteral)xsd:stringSTR (IRIrsrc)

Returns thelexical form ofliteral (aliteral); returns the codepoint representation ofrsrc (anIRI). This is useful for examining parts of an IRI, for instance, the host-name.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice"._:a  foaf:mbox       <mailto:alice@work.example> ._:b  foaf:name       "Bob" ._:b  foaf:mbox       <mailto:bob@home.example> .

This query selects the set of people who use theirwork.example address in their foaf profile:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE {    ?x foaf:name  ?name ;      foaf:mbox  ?mbox .    FILTER regex(str(?mbox), "@work\\.example$")}

Query result:

name	mbox
"Alice"	<mailto:alice@work.example>

17.4.2.8LANG

xsd:stringLANG (literalltrl)

Returns thelanguage tag ofltrl, if it has one. It returns an empty string ifltrl has nolanguage tag. Note that the RDF data model does not include literals with an emptylanguage tag.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Robert"@en._:a  foaf:name       "Roberto"@es._:a  foaf:mbox       <mailto:bob@work.example> .

This query finds the Spanishfoaf:name andfoaf:mbox:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?name ?mboxWHERE {    ?x foaf:name  ?name ;       foaf:mbox  ?mbox .    FILTER ( lang(?name) = "es" )}

Query result:

name	mbox
"Roberto"@es	<mailto:bob@work.example>

Function examples:

Expression	Result
`LANG("abc"@en)`	`"en"`
`LANG("abc"@en--ltr)`	`"en"`
`LANG("abc")`	`""`
`LANG(1)`	`""`
`LANG(<http://example/>)`	`error`

17.4.2.9LANGDIR

xsd:stringLANGDIR (literalltrl)

Returns thebase direction ofltrl, if it has one. It returns an empty string ifltrl has nobase direction. Note that the RDF data model does not include literals with an emptybase direction.

Expression	Result
`LANGDIR("abc"@en--ltr)`	`"ltr"`
`LANGDIR("abc"@en)`	`""`
`LANGDIR("abc")`	`""`
`LANGDIR(1)`	`""`
`LANGDIR(<http://example/>)`	`error`

17.4.2.10hasLANG

xsd:stringhasLANG (RDF termterm)

Returnstrue if the RDF term argument is a literal with alanguage tag. Otherwise, the function returnsfalse.

If the argument is a literal, the function is equivalent to testing for the datatype of the literal being eitherrdf:langString orrdf:dirLangString.

Expression	Result
`hasLANG("abc"@en)`	`true`
`hasLANG("abc@"en--ltr)`	`true`
`hasLANG("تصميم المواقع"@ar--rtl)`	`true`
`hasLANG(1)`	`false`
`hasLANG(<http://example/>)`	`false`

17.4.2.11hasLANGDIR

xsd:stringhasLANGDIR (RDF termterm)

Returnstrue if the RDF term argument is a literal with abase direction. Otherwise, the function returnsfalse.

If the argument is a literal, the function is equivalent to testing for the datatype of the literal beingrdf:dirLangString.

Expression	Result
`hasLANGDIR("abc"@en)`	`false`
`hasLANGDIR("abc@"en--ltr)`	`true`
`hasLANGDIR("تصميم المواقع"@ar--rtl)`	`true`
`hasLANGDIR(1)`	`false`
`hasLANGDIR(<http://example/>)`	`false`

17.4.2.12DATATYPE

iriDATATYPE (literalliteral)

Returns thedatatype IRI of the given literal.

Note

Thedatatype IRI of a literal with alanguage tag andnobase direction isrdf:langString.

Thedatatype IRI of a literal with alanguage tag and abase direction isrdf:dirLangString.

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>PREFIX eg:         <http://biometrics.example/ns#>PREFIX xsd:        <http://www.w3.org/2001/XMLSchema#>_:a  foaf:name       "Alice"._:a  eg:shoeSize     "9.5"^^xsd:float ._:b  foaf:name       "Bob"._:b  eg:shoeSize     "42"^^xsd:integer .

This query finds thefoaf:name andfoaf:shoeSize of everyone with a shoeSize that is an integer:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX xsd:  <http://www.w3.org/2001/XMLSchema#>PREFIX eg:   <http://biometrics.example/ns#>SELECT ?name ?shoeSizeWHERE {     ?x foaf:name  ?name ;       eg:shoeSize  ?shoeSize .    FILTER ( datatype(?shoeSize) = xsd:integer )}

Query result:

name	shoeSize
"Bob"	42

17.4.2.13IRI

iriIRI(xsd:string)iriIRI(iri)iriURI(xsd:string)iriURI(iri)

TheIRI function constructs an IRI by resolving the string argument (see [RFC3986] and [RFC3987] or any later RFC that superceeds RFC 3986 or RFC 3987). The IRI is resolved against the base IRI of the query and must result in an absolute IRI.

TheURI function is a synonym forIRI.

If the function is passed an IRI, it returns the IRI unchanged.

Passing any RDF term other than a literal with datatypexsd:string or an IRI is an error.

An implementationMAY normalize the IRI.

Examples:

`IRI("http://example/")`	<http://example/>
`IRI(<http://example/>)`	<http://example/>

17.4.2.14BNODE

blank nodeBNODE()

blank nodeBNODE(xsd:string)

TheBNODE function constructs a blank node that is distinct from all blank nodes in the dataset being queried and distinct from all blank nodes created by calls to this constructor for other query solutions. If the no argument form is used, every call results in a distinct blank node. If the form with anxsd:string literal is used, every call results in distinct blank nodes for differentxsd:string literals, and the same blank node for calls with the samexsd:string literal within expressions for onesolution mapping.

This functionality is compatible with thetreatment of blank nodes in SPARQL CONSTRUCT templates.

17.4.2.15STRDT

literalSTRDT(xsd:string lexicalForm,IRI datatypeIRI)

TheSTRDT function constructs a literal withlexical form anddatatype IRI as specified by the arguments.

Expression	Result
`STRDT("123", xsd:integer)`	"123"^^<http://www.w3.org/2001/XMLSchema#integer>
`STRDT("iiii", <http://example/romanNumeral>)`	"iiii"^^<http://example/romanNumeral>

Note

STRDT should not be called withdatatypeIRI argumentrdf:langString orrdf:dirLangString. To create literals with these IRIs as datatype IRI, functionSTRLANG orSTRLANGDIR should be used.

17.4.2.16STRLANG

literalSTRLANG(xsd:string lexicalForm,xsd:string langTag)

TheSTRLANG function constructs a literal withlexical form andlanguage tag, as specified by the arguments, and a datatype IRI ofrdf:langString.

The argumentlangTagMUST not be an empty string andSHOULD be a a validlanguage tag.

Expression	Result
`STRLANG("chat", "fr")`	"chat"@fr
`STRLANG("abc", "")`	error
`STRLANG(123, "en")`	error

17.4.2.17STRLANGDIR

literalSTRLANGDIR(xsd:string lexicalForm,xsd:string langTag,xsd:string baseDirection)

TheSTRLANGDIR function constructs a literal withlexical form,language tag andbase direction, as specified by the arguments, and a datatype IRI ofrdf:dirLangString.

The argumentlangTagMUST NOT be an empty string andSHOULD be a a validlanguage tag. The argumentbaseDirectionMUST be either"ltr" or"rtl".

Expression	Result
`STRLANGDIR("abc", "en", "ltr")`	`"abc"@en--ltr`
`STRLANGDIR("abc", "en", "LTR")`	error
`STRLANGDIR("قطة", "ar", "rtl")`	"قطة"@ar--rlt
`STRLANGDIR("abc", "en", "")`	error
`STRLANGDIR("abc", "", "ltr")`	error
`STRLANGDIR(123, "", "ltr")`	error
`STRLANGDIR(<x:uri>, "en", "ltr")`	error

17.4.2.18UUID

iriUUID()

Returns a fresh IRI from theA Universally Unique IDentifier (UUID) URN Namespace. Each call ofUUID() returns a different UUID. It must not be the "nil" UUID (all zeroes). The variant and version of the UUID is implementation dependent.

UUID() <urn:uuid:b9302fb5-642e-4d3b-af19-29a8f6d894c9>

17.4.2.19STRUUID

xsd:stringSTRUUID()

Returns a string that is the scheme-specific part of UUID. That is, as a literal with datatypexsd:string, the result of generating a UUID, converting to a literal with datatypexsd:string and removing the initialurn:uuid:.

STRUUID() "73cd4307-8a99-4691-a608-b5bda64fb6c1"

17.4.3Functions on Strings

Certain functions (e.g.,REGEX,STRLEN,CONTAINS) take astring literal as an argument. A string literal is one of

a literal with datatypexsd:string
a literal with datatyperdf:langString and withlanguage tag
a literal with datatyperdf:dirLangString and with both alanguage tag and abase direction

Use of any other RDF term will cause a call to the function to raise an error.

Note

"abc" is asimple literal syntactic shorthand for"abc"^^xsd:string.

The functionsSUBSTR,STRBEFORE,STRAFTER, andREPLACE return astring literal of the same kind as their first argument.

The functionsSTRSTARTS,STRENDS,CONTAINS,STRBEFORE,STRAFTER andCONCAT take two or more arguments. These arguments must beargument compatible; otherwise, invocation of the function raises an error.

Twostring literal arguments areargument compatible if:

The arguments are literals with datatypexsd:string
The arguments are literals with datatyperdf:langString and have the samelanguage tag
The arguments are literals with datatyperdf:dirLangString and have the samelanguage tag and the samebase direction
The first argument is a literal with datatyperdf:langString and the second argument is a literal with datatypexsd:string
The first argument is a literal with datatyperdf:dirLangString, and the second argument has datatypexsd:string

Argument1	Argument2	Compatible?
"abc"	"b"	yes
"abc"@en	"b"	yes
"abc"@en	"b"@en	yes
"abc"@fr	"b"@ja	no
"abc"	"b"@ja	no
"abc"	"b"@en--ltr	no
"abc"@en--ltr	"b"@en--ltr	yes
"abc"@en--ltr	"b"@en	no
"abc"@en--ltr	"z"	yes

17.4.3.1STRLEN

xsd:integerSTRLEN(string literal str)

Thestrlen function corresponds to the XPathfn:string-length function and returns anxsd:integer equal to the length in characters of thelexical form of the literal.

`strlen("chat")`	4
`strlen("chat"@en)`	4
`strlen("chat"@en--ltr)`	4
`strlen("chat"^^xsd:string)`	4

17.4.3.2SUBSTR

string literalSUBSTR(string literal source,xsd:integer startingLoc)string literalSUBSTR(string literal source,xsd:integer startingLoc,xsd:integer length)

Thesubstr function corresponds to the XPathfn:substring function and returns a literal of the same kind (literal with datatypexsd:string, literal with the same language tag, literal with the same language tag and base direction) as thesource input parameter but with alexical form derived from the substring of the lexical form of the source.

The argumentsstartingLoc andlength may be derived types of xsd:integer.

The index of the first character in a string is 1.

`substr("foobar", 4)`	"bar"
`substr("foobar"@en, 4)`	"bar"@en
`substr("foobar"^^xsd:string, 4)`	"bar"^^xsd:string
`substr("foobar", 4, 1)`	"b"
`substr("foobar"@en, 4, 1)`	"b"@en
`substr("foobar"^^xsd:string, 4, 1)`	"b"^^xsd:string

17.4.3.3UCASE

string literalUCASE(string literal str)

TheUCASE function corresponds to the XPathfn:upper-case function. It returns astring literal whose lexical form is the upper case of the lexical form of the argument.

`ucase("foo")`	"FOO"
`ucase("Foo"@en)`	"FOO"@en
`ucase("foo"@en--ltr)`	"FOO"@en--ltr
`ucase("foo"^^xsd:string)`	"FOO"^^xsd:string

17.4.3.4LCASE

string literalLCASE(string literal str)

TheLCASE function corresponds to the XPathfn:lower-case function. It returns a string literal whose lexical form is the lower case of the lexical form of the argument.

`lcase("BAR")`	"bar"
`lcase("Bar"@en)`	"bar"@en
`lcase("BAR"@en--ltr)`	"bar"@en--ltr
`lcase("BAR"^^xsd:string)`	"bar"^^xsd:string

17.4.3.5STRSTARTS

xsd:booleanSTRSTARTS(string literal arg1,string literal arg2)

TheSTRSTARTS function corresponds to the XPathfn:starts-with function. The arguments must beargument compatible, otherwise an error is raised.

For such input pairs, the function returns true if the lexical form ofarg1 starts with the lexical form ofarg2, otherwise it returns false.

`strStarts("foobar", "foo")`	true
`strStarts("foobar", "abc")`	false
`strStarts("foobar"@en, "foo"@en)`	true
`strStarts("foobar"^^xsd:string, "foo"^^xsd:string)`	true
`strStarts("foobar"^^xsd:string, "foo")`	true
`strStarts("foobar", "foo"^^xsd:string)`	true
`strStarts("foobar"@en, "foo")`	true
`strStarts("foobar"@en, "foo"^^xsd:string)`	true
`strStarts("foobar", "foo"@en)`	error

17.4.3.6STRENDS

xsd:booleanSTRENDS(string literal arg1,string literal arg2)

TheSTRENDS function corresponds to the XPathfn:ends-with function. The arguments must beargument compatible, otherwise an error is raised.

For such input pairs, the function returns true if the lexical form ofarg1 ends with the lexical form ofarg2, otherwise it returns false.

`strEnds("foobar", "bar")`	true
`strEnds("foobar", "abc")`	false
`strEnds("foobar"@en, "bar"@en)`	true
`strEnds("foobar"^^xsd:string, "bar"^^xsd:string)`	true
`strEnds("foobar"^^xsd:string, "bar")`	true
`strEnds("foobar", "bar"^^xsd:string)`	true
`strEnds("foobar"@en, "bar")`	true
`strEnds("foobar"@en, "bar"^^xsd:string)`	true
`strEnds("foobar"@en, "bar"@en)`	error

17.4.3.7CONTAINS

xsd:booleanCONTAINS(string literal arg1,string literal arg2)

TheCONTAINS function corresponds to the XPathfn:contains. The arguments must beargument compatible, otherwise an error is raised.

`contains("foobar", "bar")`	true
`contains("foobar"@en, "foo"@en)`	true
`contains("foobar"^^xsd:string, "bar"^^xsd:string)`	true
`contains("foobar"^^xsd:string, "foo")`	true
`contains("foobar", "bar"^^xsd:string)`	true
`contains("foobar"@en, "foo")`	true
`contains("foobar"@en, "bar"^^xsd:string)`	true
`contains("foobar", "bar"@en)`	error

17.4.3.8STRBEFORE

literalSTRBEFORE(string literal arg1,string literal arg2)

TheSTRBEFORE function corresponds to the XPathfn:substring-before function. The arguments must beargument compatible, otherwise an error is raised.

For compatible arguments, if the lexical part of the second argument occurs as a substring of the lexical part of the first argument, the function returns a literal of the same kind as the first argumentarg1 (literal with datatypexsd:string, literal with the same language tag). The lexical form of the result is the substring of the lexical form ofarg1 that precedes the first occurrence of the lexical form ofarg2. If the lexical form ofarg2 is the empty string, this is considered to be a match and the lexical form of the result is the empty string.

If there is no such occurrence, an empty literal with datatypexsd:string is returned.

`strBefore("abc","b")`	`"a"`
`strBefore("abc"@en,"bc")`	`"a"@en`
`strBefore("abc"@en,"b"@cy)`	error
`strBefore("abc"^^xsd:string,"")`	`""^^xsd:string`
`strBefore("abc","xyz")`	`""`
`strBefore("abc"@en, "z"@en)`	`""`
`strBefore("abc"@en, "z")`	`""`
`strBefore("abc"@en, ""@en)`	`""@en`
`strBefore("abc"@en, "")`	`""@en`

17.4.3.9STRAFTER

literalSTRAFTER(string literal arg1,string literal arg2)

TheSTRAFTER function corresponds to the XPathfn:substring-after function. The arguments must beargument compatible, otherwise an error is raised.

For compatible arguments, if the lexical part of the second argument occurs as a substring of the lexical part of the first argument, the function returns a literal of the same kind as the first argumentarg1 (literal with datatypexsd:string, literal with the same language tag). The lexical form of the result is the substring of the lexical form ofarg1 that follows the first occurrence of the lexical form ofarg2. If the lexical form ofarg2 is the empty string, this is considered to be a match and the lexical form of the result is the lexical form ofarg1.

If there is no such occurrence, an empty literal with datatypexsd:string is returned.

`strAfter("abc","b")`	`"c"
`strAfter("abc"@en,"ab")`	`"c"@en`
`strAfter("abc"@en,"b"@cy)`	error
`strAfter("abc"^^xsd:string,"")`	`"abc"^^xsd:string`
`strAfter("abc","xyz")`	`""`
`strAfter("abc"@en, "z"@en)`	`""`
`strAfter("abc"@en, "z")`	`""`
`strAfter("abc"@en, ""@en)`	`"abc"@en`
`strAfter("abc"@en, "")`	`"abc"@en`

17.4.3.10CONCAT

string literalCONCAT(string literal, ...,string literal)

TheCONCAT function takes zero or more arguments. The arguments must be pair-wiseargument compatible, otherwise an error is raised.

If zero arguments are given, the result is an empty string of datatypexsd:string.

If one argument is given, the result is that argument value.

If two or more arguments are given, the function returns astring literal such that thelexical form of the resulting string literal is obtained by concatenating the lexical forms of the arguments of the function using thefn:concat function. If all input literals are literals with the samelanguage tag and samebase direction then the returned string literal is a literal with that language tag and base direction. If all input literals are literals with the samelanguage tag, but not all the samebase direction, the returned literal is a literal with that language tag and no base direction.

`concat("foo", "bar")`	"foobar"
`concat("foo"@en, "bar"@en)`	"foobar"@en
`concat("foo", "bar")`	"foobar"
`concat("foo"@en, "bar")`	"foobar"
`concat("foo"@en, "bar"@es)`	"foobar"
`concat("abc")`	"abc"
`concat("abc"@en)`	"abc"@en
`concat()`	""

17.4.3.11langMATCHES

xsd:booleanlangMatches (xsd:stringlanguage-tag,xsd:stringlanguage-range)

Returnstrue if thelanguage tag (first argument) matcheslanguage-range (second argument). Otherwise, the function returnsfalse. Matching is performed according to the basic filtering scheme defined in [RFC4647] section 3.3.1.language-range is a basic language range perMatching of Language Tags [RFC4647] section 2.1. Alanguage-range of "*" matches any non-emptylanguage-tag string. Otherwise, the function returnsfalse.

PREFIX dc:       <http://purl.org/dc/elements/1.1/>_:a  dc:title         "That Seventies Show"@en ._:a  dc:title         "Cette Série des Années Soixante-dix"@fr ._:a  dc:title         "Cette Série des Années Septante"@fr-BE ._:b  dc:title         "Il Buono, il Bruto, il Cattivo" .

This query useslangMatches andlang to find the French titles for the show known in English as "That Seventies Show":

PREFIX dc: <http://purl.org/dc/elements/1.1/>SELECT ?titleWHERE {    ?x dc:title  "That Seventies Show"@en ;       dc:title  ?title .    FILTER langMatches( lang(?title), "FR" )}

Query result:

title
"Cette Série des Années Soixante-dix"@fr
"Cette Série des Années Septante"@fr-BE

The idiomlangMatches( lang( ?v ), "*" ) will not match literals without a language tag aslang( ?v ) will return an empty string, so

PREFIX dc: <http://purl.org/dc/elements/1.1/>SELECT ?titleWHERE {    ?x dc:title  ?title .    FILTER langMatches( lang(?title), "*" )}

will report all of the titles with a language tag:

title
"That Seventies Show"@en
"Cette Série des Années Soixante-dix"@fr
"Cette Série des Années Septante"@fr-BE

17.4.3.12REGEX

xsd:booleanREGEX (string literaltext,xsd:stringpattern)xsd:booleanREGEX (string literaltext,xsd:stringpattern,xsd:stringflags)

Invokes the XPathfn:matches function to matchtext against a regular expressionpattern. The regular expression language is defined in XQuery 1.0 and XPath 2.0 Functions and Operators section7.6.1 Regular Expression Syntax [XPATH-FUNCTIONS-31].

PREFIX foaf:       <http://xmlns.com/foaf/0.1/>_:a  foaf:name       "Alice"._:b  foaf:name       "Bob" .

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?nameWHERE {     ?x foaf:name  ?name    FILTER regex(?name, "^ali", "i")}

Query result:

name
"Alice"

17.4.3.13REPLACE

string literalREPLACE (string literal arg,xsd:string pattern,xsd:string replacement )string literalREPLACE (string literal arg,xsd:string pattern,xsd:string replacement,xsd:string flags)

TheREPLACE function corresponds to the XPathfn:replace function. It replaces each non-overlapping occurrence of the regular expressionpattern with the replacement string. Regular expession matching may involve modifier flags. SeeREGEX.

replace("abcd", "b", "Z")	"aZcd"
replace("abab", "B", "Z","i")	"aZaZ"
replace("abab", "B.", "Z","i")	"aZb"

17.4.3.14ENCODE_FOR_URI

xsd:stringENCODE_FOR_URI(string literal ltrl)

TheENCODE_FOR_URI function corresponds to the XPathfn:encode-for-uri function. It returns a literal with datatypexsd:string with the lexical form obtained from the lexical form of its input after translating reserved characters according to thefn:encode-for-uri function.

`encode_for_uri("Los Angeles")`	`"Los%20Angeles"`
`encode_for_uri("Los Angeles"@en)`	`"Los%20Angeles"`
`encode_for_uri("Los Angeles"^^xsd:string)`	`"Los%20Angeles"`

17.4.4Functions on Numerics

17.4.4.1ABS

numericABS (numericterm)

Returns the absolute value ofarg. An error is raised ifarg is not a numeric value.

This function is the same asfn:numeric-abs for terms with a datatype fromXDM.

`ABS(1)`	`1`
`ABS(-1.5)`	`1.5`

17.4.4.2ROUND

numericROUND (numericterm)

Returns the number with no fractional part that is closest to the argument. If there are two such numbers, then the one that is closest to positive infinity is returned. An error is raised ifarg is not a numeric value.

This function is the same asfn:numeric-round for terms with a datatype fromXDM.

`ROUND(2.4999)`	`2.0`
`ROUND(2.5)`	`3.0`
`ROUND(-2.5)`	`-2.0`

17.4.4.3CEIL

numericCEIL (numericterm)

Returns the smallest (closest to negative infinity) number with no fractional part that is not less than the value ofarg. An error is raised ifarg is not a numeric value.

This function is the same asfn:numeric-ceil for terms with a datatype fromXDM.

`CEIL(10.5)`	`11.0`
`CEIL(-10.5)`	`-10.0`

17.4.4.4FLOOR

numericFLOOR (numericterm)

Returns the largest (closest to positive infinity) number with no fractional part that is not greater than the value ofarg. An error is raised ifarg is not a numeric value.

This function is the same asfn:numeric-floor for terms with a datatype fromXDM.

`FLOOR(10.5)`	`10.0`
`FLOOR(-10.5)`	`-11.0`

17.4.4.5RAND

xsd:doubleRAND ( )

Returns a pseudo-random number between 0 (inclusive) and 1.0e0 (exclusive). Different numbers can be produced every time this function is invoked. Numbers should be produced with approximately equal probability.

rand() "0.31221030831984886"^^xsd:double

17.4.5Functions on Dates and Times

17.4.5.1NOW

xsd:dateTimeNOW ()

Returns an XSD dateTime value for the current query execution. All calls to this function in any one query execution must return the same value. The exact moment returned is not specified.

NOW() "2011-01-10T14:45:13.815-05:00"^^xsd:dateTime

17.4.5.2YEAR

xsd:integerYEAR (xsd:dateTimearg)

Returns the year part ofarg as an integer.

This function corresponds tofn:year-from-dateTime.

YEAR("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 2011

17.4.5.3MONTH

xsd:integerMONTH (xsd:dateTimearg)

Returns the month part ofarg as an integer.

This function corresponds tofn:month-from-dateTime.

MONTH("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 1

17.4.5.4DAY

xsd:integerDAY (xsd:dateTimearg)

Returns the day part ofarg as an integer.

This function corresponds tofn:day-from-dateTime.

day("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 10

17.4.5.5HOURS

xsd:integerHOURS (xsd:dateTimearg)

Returns the hours part ofarg as an integer. The value is as given in the lexical form of the XSD dateTime.

This function corresponds tofn:hours-from-dateTime.

HOURS("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 14

17.4.5.6MINUTES

xsd:integerMINUTES (xsd:dateTimearg)

Returns the minutes part of the lexical form ofarg. The value is as given in the lexical form of the XSD dateTime.

This function corresponds tofn:minutes-from-dateTime.

MINUTES("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 45

17.4.5.7SECONDS

xsd:decimalSECONDS (xsd:dateTimearg)

Returns the seconds part of the lexical form ofarg.

This function corresponds tofn:seconds-from-dateTime.

SECONDS("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime) 13.815

17.4.5.8TIMEZONE

xsd:dayTimeDurationTIMEZONE (xsd:dateTimearg)

Returns the timezone part ofarg as an xsd:dayTimeDuration. Raises an error if there is no timezone.

This function corresponds tofn:timezone-from-dateTime except for the treatment of literals with no timezone.

`TIMEZONE("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime)`	`"-PT5H"^^xsd:dayTimeDuration`
`TIMEZONE("2011-01-10T14:45:13.815Z"^^xsd:dateTime)`	`"PT0S"^^xsd:dayTimeDuration`
`TIMEZONE("2011-01-10T14:45:13.815"^^xsd:dateTime)`	error

17.4.5.9TZ

xsd:stringTZ (xsd:dateTimearg)

Returns the timezone part ofarg as a literal with datatypexsd:string. Returns the empty string if there is no timezone.

`TZ("2011-01-10T14:45:13.815-05:00"^^xsd:dateTime)`	`"-05:00"`
`TZ("2011-01-10T14:45:13.815Z"^^xsd:dateTime)`	`"Z"`
`TZ("2011-01-10T14:45:13.815"^^xsd:dateTime)`	`""`

17.4.6Functions on Triple Terms

17.4.6.1TRIPLE

triple termTRIPLE (RDF termsubj,RDF termpred,RDF termobj)

<<( subj pred obj)>>

If the 3-tuple (subj,pred,obj) is anRDF triple (that is,subj is anIRI orblank node;pred is anIRI; andobj is anIRI,triple term,blank node orliteral) the function returns a triple term with these three elements. Otherwise, the function raises an error.

As a shorthand notation, theTRIPLE function can also be written in the form of atriple term expression using<<( and)>>. There is a syntax limitation to this shorthand form: the three elements of the triple term expression can only be variables and directly written RDF terms, not arbitrary expressions. In contrast, the function form,TRIPLE, can be used with arbitrary expressions.

VERSION "1.2"PREFIX : <http://example/>PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?s ?date {    ?s ?p ?o .    BIND( <<( ?s ?p ?o )>> AS ?tt )    :myreifier rdf:reifies ?tt .    :myreifier :tripleAdded ?date .}

VERSION "1.2"PREFIX : <http://example/>PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?s ?date {    ?s ?p ?o .    BIND( TRIPLE(?s, ?p, ?o) AS ?tt )    :myreifier rdf:reifies ?tt .    :myreifier :tripleAdded ?date .}

17.4.6.2SUBJECT

RDF termSUBJECT (triple termtriple-term)

If the argument is atriple term, the function returns thesubject of the triple term. If the argument is not atriple term, an error is raised.

17.4.6.3PREDICATE

RDF termPREDICATE (triple termtriple-term)

If the argument is atriple term, the function returns thepredicate of the triple term. If the argument is not atriple term, an error is raised.

17.4.6.4OBJECT

RDF termOBJECT (triple termtriple-term)

If the argument is atriple term, the function returns theobject of the triple term. If the argument is not atriple term, an error is raised.

17.4.6.5isTRIPLE

xsd:booleanisTRIPLE (RDF termterm)

If the argument is atriple term, the function returns true. If the argument is any other kind ofRDF term, the function returns false.

17.4.7Hash Functions

17.4.7.1MD5

xsd:stringMD5 (xsd:stringarg)

Returns the MD5 checksum, as a hex digit string, calculated on the lexical form of thexsd:string. Hex digitsSHOULD be in lower case.

MD5("abc") "900150983cd24fb0d6963f7d28e17f72"

17.4.7.2SHA1

xsd:stringSHA1 (xsd:stringarg)

Returns the SHA1 checksum, as a hex digit string, calculated on the lexical form of thexsd:string. Hex digitsSHOULD be in lower case.

SHA1("abc") "a9993e364706816aba3e25717850c26c9cd0d89d"

17.4.7.3SHA256

xsd:stringSHA256 (xsd:stringarg)

Returns the SHA256 checksum, as a hex digit string, calculated on the lexical form of thexsd:string. Hex digitsSHOULD be in lower case.

SHA256("abc") "ba7816bf8f01cfea414140de5dae2223b00361a396177a9cb410ff61f20015ad"

17.4.7.4SHA384

xsd:stringSHA384 (xsd:stringarg)

Returns the SHA384 checksum, as a hex digit string, calculated on the lexical form of thexsd:string. Hex digitsSHOULD be in lower case.

SHA384("abc") "cb00753f45a35e8bb5a03d699ac65007272c32ab0eded1631a8b605a43ff5bed8086072ba1e7cc2358baeca134c825a7"

17.4.7.5SHA512

xsd:stringSHA512 (xsd:stringarg)

Returns the SHA512 checksum, as a hex digit string, calculated on the lexical form of thexsd:string. Hex digitsSHOULD be in lower case.

SHA512("abc") "ddaf35a193617abacc417349ae20413112e6fa4e89a97ea20a9eeee64b55d39a2192992a274fc1a836ba3c23a3feebbd454d4423643ce80e2a9ac94fa54ca49f"

17.5XPath Constructor Functions

SPARQL imports a subset of the XPath constructor functions defined inXPath and XQuery Functions and Operators 3.1 [XPATH-FUNCTIONS-31] in section19.1 Casting from primitive types to primitive types. SPARQL constructors include all of the XPath constructors for theSPARQL operand datatypes plus theadditional datatypes imposed by the RDF data model. Casting in SPARQL is performed by calling a constructor function for the target type on an operand of the source type.

XPath defines only the casts from one XML Schema datatype to another. The remaining cast is defined as follows:

Casting anIRI to anxsd:string produces aliteral with a lexical value of the codepoints comprising the IRI, and a datatype ofxsd:string.

The table below summarizes the casting operations that are always allowed (Y), never allowed (N) and dependent on the lexical value (M). For example, a casting operation from anxsd:string (the first row) to anxsd:float (the second column) is dependent on the lexical value (M).

bool =xsd:boolean
dbl =xsd:double
flt =xsd:float
dec =xsd:decimal
int =xsd:integer
dT =xsd:dateTime
str =xsd:string
IRI =IRI

From \ To	str	flt	dbl	dec	int	dT	bool
str	Y	M	M	M	M	M	M
flt	Y	Y	Y	M	M	N	Y
dbl	Y	Y	Y	M	M	N	Y
dec	Y	Y	Y	Y	Y	N	Y
int	Y	Y	Y	Y	Y	N	Y
dT	Y	N	N	N	N	Y	N
bool	Y	Y	Y	Y	Y	N	Y
IRI	Y	N	N	N	N	N	N

17.6Extensible Value Testing

It should be noted that any function or operator that is specified to return an error under some conditions is a valid extension point. That is, an implementation may return a non-error value in these error cases, and still be conformant with this recommendation.

APrimaryExpression grammar rule can be a call to an extension function named by an IRI. An extension function takes some number of RDF terms as arguments and returns an RDF term. The semantics of these functions are identified by the IRI that identifies the function.

SPARQL queries using extension functions are likely to have limited interoperability.

As an example, consider a function calledfunc:even:

xsd:booleanfunc:even (numericvalue)

This function would be invoked in a FILTER as such:

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX func: <http://example.org/functions#>SELECT ?name ?idWHERE {     ?x foaf:name  ?name ;       func:empId   ?id .    FILTER (func:even(?id))}

For a second example, consider a functionaGeo:distance that calculates the distance between two points, which is used here to find the places near Grenoble:

xsd:doubleaGeo:distance (numericx1,numericy1,numericx2,numericy2)

PREFIX aGeo: <http://example.org/geo#>SELECT ?neighborWHERE {    ?a aGeo:placeName "Grenoble" .    ?a aGeo:locationX ?axLoc .    ?a aGeo:locationY ?ayLoc .    ?b aGeo:placeName ?neighbor .    ?b aGeo:locationX ?bxLoc .    ?b aGeo:locationY ?byLoc .    FILTER ( aGeo:distance(?axLoc, ?ayLoc, ?bxLoc, ?byLoc) < 10 ) .}

An extension function might be used to test some application datatype not supported by the core SPARQL specification, it might be a transformation between datatype formats, for example into an XSD dateTime RDF term from another date format.

18.Definition of SPARQL

This section defines the correct behavior for evaluation of graph patterns and solution modifiers, given a query string and an RDF dataset. It does not imply a SPARQL implementation must use the process defined here.

The outcome of executing a SPARQL query is defined by a series of steps, starting from the SPARQL query as a string, turning that string into an abstract syntax form, then turning the abstract syntax into a SPARQL abstract query comprising operators from the SPARQL algebra. This abstract query is then evaluated on an RDF dataset.

18.1Initial Definitions

18.1.1RDF Dataset

The concept of anRDF Dataset is defined in [RDF12-CONCEPTS].

For the following definitions, we capture each RDF dataset as a set:

{ G, (<u₁>, G₁), (<u₂>, G₂), ... (<u_n>, G_n) } where G and each G_i are graphs, and each <u_i> is an IRI or blank node. Each <u_i> is distinct.

G is called the default graph. (<u_i>, G_i) are called named graphs.

Definition:Active Graph

Theactive graph is the graph from the dataset used for basic graph pattern matching.

18.1.2Query Variables

Definition:Query Variable

We assume a countably infinite setV that is disjoint from the set of allRDF terms. Every member of this setV is aquery variable.

18.1.3Triple Patterns

Definition:Triple Pattern

Atriple pattern is a 3-tuple (s,p,o) where:

s is anRDF term or avariable,
p is anIRI or avariable, and
o is anRDF term or avariable.

This definition of Triple Pattern includes literal subjects. This has been noted by RDF-core.

"[The RDF core Working Group] noted that it is aware of no reason why literals should            not be subjects anda future WG witha less restrictive charter may            extend the syntaxesto allow literals as the subjects of statements."

Because RDF graphs may not contain literal subjects, any SPARQL triple pattern with a literal as subject will fail to match on any RDF graph.

18.1.4Basic Graph Patterns

Definition:Basic Graph Pattern

ABasic Graph Pattern is a set ofTriple Patterns.

The empty graph pattern is a basic graph pattern which is the empty set.

18.1.5Property Path Patterns

Definition:Property Path

A Property Path is a sequence of triples, t_i in sequence ST, with n = length(ST)-1, such that, for i=0 to n, the object of t_i is the same term as the subject of t_i+1.

We call the subject of t₀ the start of the path.

We call the object of t_n the end of the path.

A Property Path is a path in graph G if each t_i is a triple of G.

A property path does not span multiple graphs in a dataset.

Definition:Property Path Expression

A property path expression is an expression using the property path forms described above.

Definition:Property Path Pattern

A property path pattern is a 3-tuple (s,p,o) where:

s is anRDF term or avariable,
p is aproperty path expression, and
o is anRDF term or avariable.

A Property Path Pattern is a generalization of aTriple Pattern to include a property path expression in the predicate position.

18.1.6Solution Mapping

A solution mapping is a mapping from a set of variables to a set of RDF terms. We use the term 'solution' where it is clear.

Definition:Solution Mapping

Asolution mapping,μ, is a partial functionμ :V →T, whereV is the set of allvariables andT is the set of allRDF terms.

The domain ofμ, denoted by dom(μ), is the subset ofV for whichμ is defined.

Definition:Solution Sequence

Asolution sequence is a list of solutions, possibly unordered.

Write expr(μ) for the value of the expression expr, using the terms for variables given by μ. Evaluation may result in an error.

18.1.7Solution Sequence Modifiers

Definition:Solution Sequence Modifier

Asolution sequence modifier is one of:

Order By modifier: put the solutions in order
Projection modifier: choose certain variables
Distinct modifier: ensure solutions in the sequence are unique
Reduced modifier: permit any non-distinct solutions to be eliminated
Offset modifier: control where the solutions start from in the overall sequence of solutions
Limit modifier: restrict the number of solutions

18.1.8SPARQL Query

Definition:SPARQL Query

ASPARQL Abstract Query is a tuple (E, DS, QF) where:

E is aSPARQL algebra expression
DS is anRDF Dataset [RDF12-CONCEPTS]
QF is aquery form

Definition:Query Level

A query level is a graph pattern, a set of group and aggregation, and a set of solution modifiers.

A query is a tree of "query levels", where eachsubquery forms one query level in the tree.

18.2Algebraic Syntax

To define the evaluation semantics of a SPARQL query, the abstract syntax tree of the SPARQL query string (as defined by theSPARQL grammar) is first translated into a syntax that resembles theSPARQL algebra. This section defines the expressions that can be formed in this algebraic syntax, and the translation of SPARQL query strings into this algebraic syntax is then defined in Section 18.3Translation to the SPARQL Algebra.

Analgebraic query expression is defined recursively as follows:

Abasic graph pattern is an algebraic query expression.
A multiset ofsolution mappings is an algebraic query expression.
A sequence ofsolution mappings is an algebraic query expression.
Issue 231: Keep both sequences and multisets of solution mappings as algebraic expressions?
Do we really need both of the previous two points?
Path(x,ppe,y) is an algebraic query expression ifppe is analgebraic property path expression,x is anRDF term or avariable, andy is anRDF term or avariable.
Union(A₁,A₂) is an algebraic query expression ifA₁ andA₂ are algebraic query expressions.
Join(A₁,A₂) is an algebraic query expression ifA₁ andA₂ are algebraic query expressions.
Minus(A₁,A₂) is an algebraic query expression ifA₁ andA₂ are algebraic query expressions.
LeftJoin(A₁,A₂,expr) is an algebraic query expression ifA₁ andA₂ are algebraic query expressions andexpr is anexpression.
Filter(expr,A) is an algebraic query expression ifexpr is anexpression andA is an algebraic query expression.
Extend(A,var,expr) is an algebraic query expression ifA is an algebraic query expression,var is avariable, andexpr is anexpression.
Graph(x,A) is an algebraic query expression ifx is anIRI or avariable andA is an algebraic query expression.
ToMultiset(A) is an algebraic query expression ifA is an algebraic query expression.
ToList(A) is an algebraic query expression ifA is an algebraic query expression.
Group(exprlist,A) is an algebraic query expression ifexprlist is a nonempty sequence ofexpressions andA is an algebraic query expression.
Aggregation(exprlist,func,scalarvals,grp) is an algebraic query expression ifexprlist is a nonempty sequence ofexpressions or the asterisk character (*),func is aset function,scalarvals is a partial function (which may be the empty function, i.e., with an empty domain), andgrp is an algebraic query expression of the formGroup(exprlist',A) whereexprlist' is a sequence ofexpressions andA is an algebraic query expression.
Issue 230: 'Group' and 'Aggregation' should not be operators of the algebraic syntax
Group andAggregation should not be independent operators but, instead, should be integrated into the definition ofAggregateJoin.
AggregateJoin(A₁, ...,A_n) is an algebraic query expression ifA₁, ...,A_n is a non-empty sequence of algebraic query expressions (i.e.,n ≥ 1) such that every expressionA_i in this sequence is of the formAggregation(exprlist_i,func_i,scalarvals_i,grp_i) that is captured bythe previous point.
OrderBy(A,condition) is an algebraic query expression ifA is an algebraic query expression andcondition is an ordering condition.
Issue 229: Notion of "ordering condition" undefined
The term "ordering condition" should link to some definition of this notion.
Project(A,PV) is an algebraic query expression ifA is an algebraic query expression andPV is a set ofvariables.
Distinct(A) is an algebraic query expression ifA is an algebraic query expression.
Reduced(A) is an algebraic query expression ifA is an algebraic query expression.
Slice(A,start,length) is an algebraic query expression ifA is an algebraic query expression andstart andlength are integers.

The notion of analgebraic property path expression, as used in the previous definition, is defined recursively as follows:

link(iri) is an algebraic property path expression ifiri is anIRI.
NPS(I) is an algebraic property path expression ifI is a nonempty set ofIRIs. NPS is an acronym fornegated property set.
seq(ppe₁,ppe₂) is an algebraic property path expression ifppe₁ andppe₂ are algebraic property path expressions.
alt(ppe₁,ppe₂) is an algebraic property path expression ifppe₁ andppe₂ are algebraic property path expressions.
inv(ppe) is an algebraic property path expression ifppe is an algebraic property path expression.
ZeroOrOnePath(ppe) is an algebraic property path expression ifppe is an algebraic property path expression.
ZeroOrMorePath(ppe) is an algebraic property path expression ifppe is an algebraic property path expression.
OneOrMorePath(ppe) is an algebraic property path expression ifppe is an algebraic property path expression.

18.3Translation to the SPARQL Algebra

This section defines the process of converting graph patterns and solution modifiers in a SPARQL query string into a SPARQL algebra expression. The process described converts one level of query nesting, as formed by subqueries using the nestedSELECT syntax and is applied recursively on subqueries. Each level consists of graph pattern matching and filtering, followed by the application of solution modifiers.

The SPARQL query string is parsed and the abbreviations for IRIs and triple patterns given insection 4 are applied. At this point the abstract syntax tree is composed of:

Patterns	Modifiers	Query Forms	Other
RDF terms	DISTINCT	SELECT	VALUES
Property path expression	REDUCED	CONSTRUCT	SERVICE
Property path patterns	Projection	DESCRIBE
Groups	ORDER BY	ASK
OPTIONAL	LIMIT
UNION	OFFSET
GRAPH	Select expressions
BIND
GROUP BY
HAVING
MINUS
FILTER

The result of converting such an abstract syntax tree is a SPARQL query that uses the following symbols in the SPARQL algebra:

Graph Pattern	Solution Modifiers	Property Path
BGP	ToList	PredicatePath
Join	OrderBy	InversePath
LeftJoin	Project	SequencePath
Filter	Distinct	AlernativePath
Union	Reduced	ZeroOrMorePath
Graph	Slice	OneOrMorePath
Extend	ToMultiSet	ZeroOrOnePath
Minus		NegatedPropertySet
Group
Aggregation
AggregateJoin

Slice is the combination of OFFSET and LIMIT.

ToList is used where conversion from the results of graph pattern matching to sequences occurs.

ToMultiSet is used where conversion from a solution sequence to a multiset occurs.

18.3.1Variable Scope

We define a variable to bein-scope if there is a way for a variable to be in the domain of a solution mapping at that point in the execution of the SPARQL algebra for the query. The definition below provides a way of determing this from the abstract syntax of a query.

Note that a subquery with a projection can hide variables; use of a variable inFILTER, or inMINUS does not cause a variable to be in-scope outside of those forms.

LetP,P1,P2 be graph patterns andE,E1,...En be expressions. A variablev is in-scope if:

Syntax Form	In-scope variables
Basic Graph Pattern (BGP)	`v` occurs in the BGP
Path	`v` occurs in the path
Group`{ P1 P2 ... }`	`v` is in-scope if it is in-scope in one or more of P1, P2, ...
`GRAPH term { P }`	`v` is`term` or`v` is in-scope in P
`{ P1 } UNION { P2 }`	`v` is in-scope in P1 or in-scope in P2
`OPTIONAL {P}`	`v` is in-scope in P
`SERVICE term {P}`	`v` is`term` or`v` is in-scope in P
`BIND (expr AS v)`	`v` is in-scope
`SELECT .. v .. { P }`	`v` is in-scope
`SELECT ... (expr AS v)`	`v` is in-scope
`GROUP BY (expr AS v)`	`v` is in-scope
`SELECT * { P }`	`v` is in-scope in`P`
`VALUES v { values }`	`v` is in-scope
`VALUES varlist { values }`	`v` is in-scope if`v` is in`varlist`

The variablev must not be in-scope at the point of the(expr AS v) form. The scoping for(expr AS v) applies immediately inSELECT expressions.

InBIND (expr AS v) requires that the variablev is not in-scope from the preceeding elements in the group graph pattern in which it is used.

InSELECT, the variablev must not be in-scope in the graph pattern of theSELECT clause, nor used in another select expression earlier in the clause.

18.3.2Converting Graph Patterns

This section describes the process for translating a SPARQL graph pattern into a SPARQL algebra expression. This process is applied to the group graph pattern (the unit between{...} delimiters) forming theWHERE clause of a query, and recursively to each syntactic element within the group graph pattern. The result of the translation is a SPARQL algebra expression.

In summary, the steps are applied as follows:

Expand syntax forms for IRIs, literals and triple patterns.
Translate property path expressions
Convert some property path patterns to triples
Collect theFILTERs in the group
Translate Basic Graph Patterns
Translate the remaining graph patterns in the group
Add in Filters
Simplify the algebra expression

We write

translate(graph pattern)

for the algorthm described here to translate graph patterns.

The working group notes that in SPARQL 1.0, the point at which the simplification step is applied leads to ambiguous transformation of queries involving a doubly nested filter and pattern in an optional:

OPTIONAL { { ...FILTER ( ... ?x ... ) } }..

This is illustrated by two non-normative test cases:

Simplification applied after all transformations or not at all.
Simplification applied during transformation.

Applying the simpification step after all the translation of graph patterns is the preferred reading.

18.3.2.1Expand Syntax Forms

Expand abbreviations for IRIs and triple patterns given insection 4.

18.3.2.2Collect`FILTER` Elements

FILTER expressions apply to the whole group graph pattern in which they appear. The algebra operators to perform filtering are added to the group after translation of each group element. We collect the filters together here and remove them from group, thenapply them to the whole translated group graph pattern.

In this step, we also translate graph patterns withinFILTER expressionsEXISTS andNOT EXISTS.

Let FS := empty setFor each form FILTER(expr) in the group graph pattern    In expr, replace NOT EXISTS{P} with fn:not(exists(translate(P)))     In expr, replace EXISTS{P} with exists(translate(P))    FS := FS ∪ {expr}    End

The set of filter expressionsFS isused later.

18.3.2.3Translate Property Path Expressions

The following table gives the translation ofproperty path expressions in SPARQL query strings intoalgebraic property path expressions. This applies to all elements of aproperty path expression recursively.

Thenext step after this one translates certain forms to triple patterns, and these are converted later to basic graph patterns by adjacency (without intervening group pattern delimiters{ and}) or other syntax forms. Overall, SPARQL syntax property paths of just an IRI become triple patterns and these are aggregated into basic graph patterns.

Notes:

The order of forms IRI and ^IRI in a negated property set (NPS) is not relevant.

Syntax Form (path)	Algebraic Form (path)
`iri`	`link(iri)`
`^path`	`inv(path)`
`!(:iri₁\|...\|:iri_n)`	`NPS({:iri₁ ... :iri_n})`
`!(^:iri₁\|...\|^:iri_n)`	`inv(NPS({:iri₁ ... :iri_n}) )`
`!(:iri₁\|...\|:iri_i\|^:iri_i+1\|...\|^:iri_m)`	`alt(NPS({:iri₁ ...:iri_i}), inv(NPS({:iri_i+1, ..., :iri_m})) )`
`path1 / path2`	`seq(path1, path2)`
`path1 \| path2`	`alt(path1, path2)`
`path*`	`ZeroOrMorePath(path)`
`path+`	`OneOrMorePath(path)`
`path?`	`ZeroOrOnePath(path)`

18.3.2.4Translate Property Path Patterns

The previous step translatedproperty path expressions. This step translatesproperty path patterns, which are a subject end point, a property path expression, and an object end point. This step assumes that the property path expression of the property path pattern is already given in the form of analgebraic property path expression. The result of this step may be triple patterns andalgebraic query expressions of the formPath(...).

Notes:

x andy areRDF terms orvariables.
var is a freshvariable.
ppe,ppe₁, andppe₂ arealgebraic property path expressions.
These are only applied to property path patterns, not within property path expressions.
Translations earlier in the table are applied in preference to the last translation.
The final translation simply wraps any remaining property path expression to use a common formPath(...).

Algebraic Form (path)	Translation
`x`link(`iri`)`y`	`xiriy`
`x`inv(`iri`)`y`	`yirix`
`x`seq(`ppe₁`,`ppe₂`)`y`	`xppe₁var` .`varppe₂y`
`xppey`	Path(`x`,`ppe`,`y`)

Issue 226: Translation of property path patterns incomplete

I assume that the translation rules in this table are meant to be applied recursively, but the section doesn't say anything about that. In particular, for the "xseq(ppe₁,ppe₂)y" case:ppe₁ andppe₂ in the resulting translation may still be arbitrary algebraic property path expressions and, thus, the translation should be applied again for each of the two resulting property path patterns (i.e., for "xppe₁var" and for "varppe₂y").

Examples of the whole path translation process (?_V is a fresh variable):

?s :p/:q ?o

?s :p ?_V .
?_V :q ?o

?s :p* ?o

Path(?s,ZeroOrMorePath(link(:p)), ?o)

:list rdf:rest*/rdf:first ?member

Path(:list,ZeroOrMorePath(link(rdf:rest)), ?_V) .
?_V rdf:first ?member

18.3.2.5Translate Basic Graph Patterns

After translating property paths, any adjacent triple patterns are collected together to form a basic graph patternBGP(triples).

18.3.2.6Translate Graph Patterns

Next, we translate each remaining graph pattern form, recursively applying the translation process.

If the form isGroupOrUnionGraphPattern

Let A := undefined          For each element G in the GroupOrUnionGraphPattern    If A is undefined        A := Translate(G)    Else        A := Union(A, Translate(G))    EndThe result is A

If the form isGraphGraphPattern

If the form is GRAPH IRI GroupGraphPattern    The result is Graph(IRI, Translate(GroupGraphPattern))

If the form is GRAPH Var GroupGraphPattern    The result is Graph(Var, Translate(GroupGraphPattern))

If the form isGroupGraphPattern:

Let FS := the empty setLet G := the empty pattern, a basic graph pattern which is the empty set.For each element E in the sequence of elements in the GroupGraphPattern    If E is of the form OPTIONAL{P}         Let A := Translate(P)        If A is of the form Filter(F, A2)            G := LeftJoin(G, A2, F)        Else             G := LeftJoin(G, A, true)            End        End    If E is of the form MINUS{P}        G := Minus(G, Translate(P))        End    If E is of the form BIND(expr AS var)        G := Extend(G, var, expr)        End    If E is any other form         Let A := Translate(E)        G := Join(G, A)        End   End   The result is G.

If the form isInlineData

The result is a multiset of solution mappings 'data'.

data is formed by forming a solution mapping from the variable in the corresponding position in list of variables (or single variable), omitting a binding if theDataBlockValue is the wordUNDEF.

If the form isSubSelect

The result is ToMultiset(Translate(SubSelect))

18.3.2.7Filters of Group

After the group has been translated, the filter expressions are added so they wil apply to the whole of the rest of the group:

If FS is not empty    Let G := output of preceding step    Let X := Conjunction of expressions in FS    G := Filter(X, G)End

18.3.2.8Simplification step

Some groups of one graph pattern becomejoin(Z, A), where Z is the empty basic graph pattern (which is the empty set). These can be replaced by A. The empty graph pattern Z is the identity for join:

Replace join(Z, A) by AReplace join(A, Z) by A

18.3.3Examples of Mapped Graph Patterns

The second form of a rewrite example is the first with empty group joins removed by the simplification step.

Example: group with a basic graph pattern consisting of a single triple pattern:

{ ?s ?p ?o }

Join(Z, BGP(?s ?p ?o) )

BGP(?s ?p ?o)

Example: group with a basic graph pattern consisting of two triple patterns:

{ ?s :p1 ?v1 ; :p2 ?v2 }

BGP( ?s :p1 ?v1 . ?s :p2 ?v2 )

Example: group consisting of a union of two basic graph patterns:

{ { ?s :p1 ?v1 } UNION {?s :p2 ?v2 } }

Union(Join(Z, BGP(?s :p1 ?v1)),
Join(Z, BGP(?s :p2 ?v2)) )

Union( BGP(?s :p1 ?v1) , BGP(?s :p2 ?v2) )

Example: group consisting of a union of a union and a basic graph pattern:

{ { ?s :p1 ?v1 } UNION {?s :p2 ?v2 } UNION {?s :p3 ?v3 } }

Union(
    Union( Join(Z, BGP(?s :p1 ?v1)),
           Join(Z, BGP(?s :p2 ?v2))) ,
    Join(Z, BGP(?s :p3 ?v3)) )

Union(
    Union( BGP(?s :p1 ?v1) ,
           BGP(?s :p2 ?v2),
    BGP(?s :p3 ?v3))

Example: group consisting of a basic graph pattern and an optional graph pattern:

{ ?s :p1 ?v1OPTIONAL {?s :p2 ?v2 } }

LeftJoin(
    Join(Z, BGP(?s :p1 ?v1)),
    Join(Z, BGP(?s :p2 ?v2)),
    true)

LeftJoin(BGP(?s :p1 ?v1), BGP(?s :p2 ?v2), true)

Example: group consisting of a basic graph pattern and two optional graph patterns:

{ ?s :p1 ?v1OPTIONAL {?s :p2 ?v2 }OPTIONAL { ?s :p3 ?v3 } }

LeftJoin(
    LeftJoin(
        BGP(?s :p1 ?v1),
        BGP(?s :p2 ?v2),
        true) ,
    BGP(?s :p3 ?v3),
    true)

Example: group consisting of a basic graph pattern and an optional graph pattern with a filter:

{ ?s :p1 ?v1OPTIONAL {?s :p2 ?v2 FILTER(?v1<3) } }

LeftJoin(
     Join(Z, BGP(?s :p1 ?v1)),
     Join(Z, BGP(?s :p2 ?v2)),
     (?v1<3) )

LeftJoin(
    BGP(?s :p1 ?v1) ,
    BGP(?s :p2 ?v2) ,
   (?v1<3) )

Example: group consisting of a union graph pattern and an optional graph pattern:

{ {?s :p1 ?v1} UNION {?s :p2 ?v2}OPTIONAL {?s :p3 ?v3} }

LeftJoin(
Union(BGP(?s :p1 ?v1),
BGP(?s :p2 ?v2)) ,
BGP(?s :p3 ?v3) ,
true )

Example: group consisting of a basic graph pattern, a filter and an optional graph pattern:

{ ?s :p1 ?v1 FILTER (?v1 < 3 )OPTIONAL {?s :p2 ?v2} }

Filter( ?v1 < 3 ,
LeftJoin( BGP(?s :p1 ?v1), BGP(?s :p2 ?v2), true) ,
)

Example: Pattern involving BIND:

{ ?s :p ?v . BIND (2*?v AS ?v2) ?s :p1 ?v2 }

Join(
Extend( BGP(?s :p ?v), ?v2, 2*?v) ,
BGP(?s :p1 ?v2) )

Example: Pattern involving BIND, with asimplification step:

{ ?s :p ?v . {} BIND (2*?v AS ?v2) }

Extend(
   Join(
     Join( {}, BGP(?s :p ?v)),
     {}),
   ?v2, 2*?v
)

Extend(
BGP(?s :p ?v) ,
?v2, 2*?v
)

Example: Pattern involving MINUS:

{ ?s :p ?v . MINUS {?s :p1 ?v2 } }

Minus(
BGP(?s :p ?v)
BGP(?s :p1 ?v2)
)

Example: Pattern involving a subquery:

{ ?s :p ?o . {SELECT DISTINCT ?o {?o ?p ?z} } }

Join(
   BGP(?s :p ?o) ,
   ToMultiSet(
     Distinct(
       Project( ToList(BGP(?o ?p ?z)), {?o} )
     )
   )
)

18.3.4Converting Groups, Aggregates, HAVING, final VALUES clause and SELECT Expressions

In this step, we process clauses on the query level in the following order:

Grouping
Aggregates
HAVING
VALUES
Select expressions

18.3.4.1Grouping and Aggregation

Step: GROUP BY

If theGROUP BY keyword is used, or there is implicit grouping due to the use of aggregates in the projection, then grouping is performed by theGroup function. In this case, before grouping, the solution set is converted into a solution sequence by applying theToList function. Next, theGroup function divides this solution sequence into groups of one or more solutions, with the same overall cardinality. In case of implicit grouping, a fixed constant (1) is used to group all solutions into a single group.

Step: Aggregates

The aggregation step is applied as a transformation on the query level, replacing aggregate expressions in the query level with Aggregation() algebraic expressions.

The transformation for query levels that use any aggregates is given below:

Let A := the empty sequenceLet Q := the query level being evaluatedLet P := the algebra translation of the GroupGraphPattern of the query levelLet E := [], a list of pairs of the form (variable, expression)If Q contains GROUP BY exprlist   Let Grp := Group(exprlist, ToList(P))Else If Q contains an aggregate in SELECT, HAVING, ORDER BY   Let Grp := Group((1), ToList(P))Else   skip the rest of the aggregate step   EndGlobal i := 1   # Initially 1 for each query processedFor each (X AS Var) in SELECT, each HAVING(X), and each ORDER BY X in Q  For each unaggregated variable V in X      Replace V with Sample(V)      End  For each aggregate R(args ; scalarvals) now in X      # note: scalarvals may be omitted; if so, it's equivalent to the empty function      Ai := Aggregation(args, R, scalarvals, Grp)      Replace R(...) with aggi in Q      i := i + 1      End  EndFor each variable V appearing outside of an aggregate   Ai := Aggregation(V, Sample, {}, Grp)   E := E append (V, aggi)   i := i + 1   EndA := Ai, ..., Ai-1P := AggregateJoin(A)

18.3.4.2HAVING

The HAVING expression is evaluated using the same rules as FILTER(). Note that, due to the logic position in which the HAVING clause is evaluated, expressions projected by the SELECT clause are not visible to the HAVING clause.

Let Q := the query level being evaluatedLet P := the algebra translation of the query level so farFor each HAVING(E) in Q    P := Filter(E, P)    End

18.3.4.3VALUES

If the query has a trailing VALUES clause:

Let P := the algebra translation of the query level so farP := Join(P, ToMultiSet(data))  where data is a solution sequence derived from the VALUES clause

The translatation of the data is the same as forinline data.

18.3.4.4SELECT Expressions

Step: Select expressions

We have two forms of the abstract syntax to consider:

SELECT selItem ... { pattern }SELECT * { pattern }

Let X := algebra from earlier stepsLet VS := list of all variables visible in the pattern,           so restricted by sub-SELECT projected variables and GROUP BY variables.           Not visible: only in filter, exists/not exists, masked by a subselect,                         non-projected GROUP variables, only in the right hand side of MINUSLet PV := {}, a set of variable namesNote, E is a list of pairs of the form (variable, expression), defined in section 18.2.4.  If "SELECT *"    PV := VSIf  "SELECT selItem ..."    For each selItem        If selItem is a variable            PV := PV ∪ { variable }        End        If selItem is (expr AS variable)            variable must not appear in VS nor in PV; if it does then generate a syntax error and stop            PV := PV ∪ { variable }            E := E append (variable, expr)         End    EndFor each pair (var, expr) in E    X := Extend(X, var, expr)    End  Result is X  The set PV is used later for projection.

The syntax error arises for use of a variable as the named target of AS (e.g. ... AS ?x) when the variable is used inside the WHERE clause of the SELECT or if already used as the target of AS in this SELECT expression.

18.3.5Converting Solution Modifiers

Solution modifiers apply to the processing of a SPARQL query after pattern matching.

Since the solution modifiers operate on sequences of solution mappings, the query result produced up to this point is first turned from a multiset of solution mappings into such a sequence. While there is no implied ordering to this sequence, and duplicates need not be adjacent, the sequence is identical to the multiset in terms of the elements that it contains, and their multiplicities. To apply this conversion from a multiset into a sequence, the algorithm for capturing the solution modifiers in the algebra expression begins with the following step, wherePattern is the algebra expression produced by the algorithm in the previous section.

Let M := ToList(Pattern)

Now, the solution modifiers are applied in the following order:

Order by
Projection
Distinct
Reduced
Offset
Limit

18.3.5.1ORDER BY

If the query string has an ORDER BY clause

M := OrderBy(M, list of order comparators)

18.3.5.2Projection

The set of projection variables,PV, was calculated in theprocessing of SELECT expressions.

M := Project(M, PV)

where vars is the set of variables mentioned in the SELECT clause or all named variables that arein-scope in the query if SELECT * used.

18.3.5.3DISTINCT

If the query contains DISTINCT,

M := Distinct(M)

18.3.5.4REDUCED

If the query contains REDUCED,

M := Reduced(M)

18.3.5.5OFFSET and LIMIT

If the query contains "OFFSET start" or "LIMIT length"

M := Slice(M, start, length)
start defaults to 0
length defaults to (size(M)-start).

18.3.5.6Final Algebra Expression

The overall abstract query is M.

18.4Basic Graph Patterns

When matching graph patterns, the possible solutions form amultiset, also known as abag. A multiset is an unordered collection of elements in which each element may appear more than once. It is described by a set of elements and a function giving the multiplicity of each of these elements (i.e., the number of times the element is contained in the multiset).

Writeμ for solution mappings.

Writeμ₀ for the mapping such thatdom(μ₀) is the empty set.

WriteΩ₀ for the multiset consisting of exactly the empty mappingμ₀, with multiplicity 1. This is the join identity.

Writeμ(x) for the solution mapping variablex to RDF termt : { (x,t) }.

Definition:Compatible Mappings

Two solution mappings μ₁ and μ₂ are compatible if, for every variable v in dom(μ₁) and in dom(μ₂), μ₁(v) = μ₂(v).

Here, μ₁(v) = μ₂(v) means that μ₁(v) and μ₂(v) are the same RDF term.

If μ₁ and μ₂ are compatible then μ₁ ∪ μ₂ is also a mapping. Write merge(μ₁, μ₂) for μ₁ ∪ μ₂

Definition:Multiplicity

Given a multiset Ω of solution mappings and a solution mapping μ, we writemultiplicity(μ | Ω) to denote the number of timesμ appears inΩ.

Similarly, given a solution sequence Ψ and a solution mapping μ, we writemultiplicity(μ | Ψ) to denote the number of timesμ appears inΨ.

18.4.1SPARQL Basic Graph Pattern Matching

A basic graph pattern is matched against the active graph for that part of the query. Basic graph patterns can be instantiated by replacing both variables and blank nodes by terms, giving two notions of instance. Blank nodes are replaced using anRDF instance mapping, σ, from blank nodes to RDF terms; variables are replaced by a solution mapping from query variables to RDF terms.

Definition:Pattern Instance Mapping

APattern Instance Mapping, P, is the combination of an RDF instance mapping, σ, and solution mapping, μ. P(x) = μ(σ(x))

For a BGP 'x', P(x) denotes the result of replacing blank nodes b in x for which σ is defined with σ(b) and all variables v in x for which μ is defined with μ(v).

Any pattern instance mapping defines a unique solution mapping and a unique RDF instance mapping obtained by restricting it to query variables and blank nodes respectively.

Definition: Basic Graph Pattern Matching

Let BGP be a basic graph pattern and let G be an RDF graph.

μ is asolution for BGP from G when there is a pattern instance mapping P such that P(BGP) is a subgraph of G and μ is the restriction of P to the query variables in BGP.

multiplicity( μ | Ω ) = number of distinct RDF instance mappings, σ, such that P = μ(σ) is a pattern instance mapping and P(BGP) is a subgraph of G.

If a basic graph pattern is the empty set, then the solution is Ω₀.

18.4.2Treatment of Blank Nodes

This definition allows the solution mapping to bind a variable in a basic graph pattern, BGP, to a blank node in G. Since SPARQL treats blank node identifiers in a results format document (SPARQL Query Results XML Format (Second Edition),SPARQL 1.1 Query Results JSON Format andSPARQL 1.1 Query Results CSV and TSV Formats) as scoped to the document, they cannot be understood as identifying nodes in the active graph of the dataset. If DS is the dataset of a query, pattern solutions are therefore understood to be not from the active graph of DS itself, but from an RDF graph, called thescoping graph, which is graph-equivalent to the active graph of DS but shares no blank nodes with DS or with BGP. The same scoping graph is used for all solutions to a single query. The scoping graph is purely a theoretical construct; in practice, the effect is obtained simply by the document scope conventions for blank node identifiers.

Since RDF blank nodes allow infinitely many redundant solutions for many patterns, there can be infinitely many pattern solutions (obtained by replacing blank nodes by different blank nodes). It is necessary, therefore, to somehow delimit the solutions for a basic graph pattern. SPARQL uses the subgraph match criterion to determine the solutions of a basic graph pattern. There is one solution for each distinct pattern instance mapping from the basic graph pattern to a subset of the active graph.

This is optimized for ease of computation rather than redundancy elimination. It allows query results to contain redundancies even when the active graph of the dataset islean, and it allows logically equivalent datasets to yield different query results.

18.5Property Path Patterns

This section defines the evaluation ofproperty path patterns. A property path pattern is a subject endpoint (an RDF term or a variable), a property path express and an object endpoint. Thetranslation of property path expressions converts some forms to other SPARQL expressions, such as converting property paths of length one to triple patterns, which in turn are combined into basic graph patterns. This leaves property path operators ZeroOrOnePath, ZeroOrMorePath, OneOrMorePath and NegatedPropertySets and also path expressions contained within these operators.

All remaining property path expressions are present in the algebra in the formPath(X, path, Y) for endpoints X and Y. For example: syntax(:p/:q)* is a ZeroOrMorePath expression involving a sequence property path becoming the algebra expessionZeroOrMorePath(seq(link(:p), link(:q))).

Notation

Write

eval(Path(X, PP,Y))

for the evaluation of the property path patterns. This produces a multiset of solution mappings μ, each solution mapping having a binding for variables used (each of X and Y can be a variable). Some operators only produce a set of solution mappings.

Write

Var(x1, x2, ..., xn) = { xi | i in 1...n and xi is a variable}

for the variables inx₁, x₂, ..., x_n.

Write

`x:term`	when`x` is an RDF term
`x:var`	when`x` is a variable
`x:path`	when`x` is a path expression

All evaluation is carried out by matching theactive graph at that point in the overall query evaluation. We omit explicitly including the active graph in each definition for clarity.

Definition:Evaluation of Predicate Property Path

Let Path(X, link(iri), Y) be an predicate inverse property path pattern, using some IRI iri.

eval(Path(X,link(iri), Y)) = evaluationof basic graph pattern {X iri Y}

If both X and Y are variables, this is the same as:

eval(Path(X:var,link(iri),Y:var)) = { (X, xn) (Y, yn) | xn and yn areRDF terms andtriple (xn iri yn) isin the active graph }

If X is a variable and Y an RDF term:

eval(Path(X:var,link(iri),Y:term)) = { (X, xn) | xn is anRDF term andtriple (xn iri Y) isin the active graph}

If X is an RDF term and Y is a variable:

eval(Path(X:term,link(iri),Y:var)) = { (Y, yn) | yn is anRDF term andtriple (X iri yn) isin the active graph}

If both X and Y are RDF terms:

eval(Path(X:term,link(iri),Y:term))     = { μ0 }iftriple (X iri Y) isin the active graph    = { { } } = Ω0eval(Path(X:term,link(iri),Y:term)) = { }iftriple (X iri Y) is notin the active graph

Informally, evaluating a Predicate Property Path is the same as executing a subquerySELECT * {X P Y } at that point in the query evaluation.

Definition:Evaluation of Inverse Property Path

Let P be a property path expression, then:

eval(Path(X,inv(P), Y)) =eval(Path(Y, P, X))

Definition:Evaluation of Sequence Property Path

Let P and Q be property path expressions. Let V be a fresh variable.

A =Join(eval(Path(X, P, V)),eval(Path(V, Q, Y)) )

eval(Path(X,seq(P,Q), Y)) =Project(A,Var(X,Y))

Informally, this is the same as:

SELECT * {XP _:a . _:a Q Y }

using the fact that a blank node_:a acts like a variable (under simple entailment) except it does not appear in the results fromSELECT *.

Definition:Evaluation of Alternative Property Path

Let P and Q be property path expressions.

eval(Path(X,alt(P,Q), Y)) =Union(eval(Path(X, P, Y)),eval(Path(X, Q, Y)))

Informally, this is the same as:

SELECT * { {XPY } UNION {XQY } }

Definition:Node set of a graph

The node set of a graph G, nodes(G), is:

nodes(G) = { n | n is an RDF term that is used as a subject or object of a triple of G}

Definition:Evaluation of ZeroOrOnePath

eval(Path(X:term,ZeroOrOnePath(P),Y:var)) =     { (Y, yn) | yn = X or {(Y, yn)}ineval(Path(X,P,Y)) }

eval(Path(X:var,ZeroOrOnePath(P),Y:term)) =    { (X, xn) | xn = Y or {(X, xn)}ineval(Path(X,P,Y)) }

eval(Path(X:term,ZeroOrOnePath(P),Y:term)) =     { {} }if X = Y oreval(Path(X,P,Y)) is not empty    { } othewise

eval(Path(X:var,ZeroOrOnePath(P),Y:var)) =     { (X, xn) (Y, yn) |either (yninnodes(G) and xn = yn) or {(X,xn), (Y,yn)}ineval(Path(X,P,Y)) }

We define an auxillary function, ALP, used in the definitions of ZeroOrMorePath and OneOrMorePath. Note that the algorithm given here serves to specify the feature. An implementation is free to implement evaluation by any method that produces the same results for the query overall. The ZeroOrMorePath and OneOrMorePath forms return matches based on distinct nodes connected by the path.

The matching algorithm is based on following all paths, and detecting when a graph node (subject or object), has been already visited on the path.

Informally, this algorithm attempts to extend the multiset of results by one application ofpath at each step, noting which nodes it has visited for this particular path. If a node has been visited for the path under consideration, it is not a candidate for another step.

Definition:Function ALP

Let eval(x:term, path) be the evaluation of'path', starting at RDF term x,  and returning a multiset of RDF terms reached  by repeated matches of path.ALP(x:term, path) =       Let V = empty setALP(x:term, path, V)      return is V  # V is the set of nodes visitedALP(x:term, path, V:set of RDF terms) =      if ( x in V ) return       add x to V      X =eval(x,path)       For n:term in XALP(n, path, V)          End

Definition: Evaluation of

ZeroOrMorePath

eval(Path(X:term,ZeroOrMorePath(path),vy:var)) =    { { (vy, n) } | ninALP(X, path) }eval(Path(vx:var,ZeroOrMorePath(path),vy:var)) =    { { (vx, t), (vy, n) } |  tinnodes(G), (vy, n)ineval(Path(t,ZeroOrMorePath(path), vy)) }eval(Path(vx:var,ZeroOrMorePath(path),y:term)) =eval(Path(y:term,ZeroOrMorePath(inv(path)),vx:var))eval(Path(x:term,ZeroOrMorePath(path),y:term)) =     { { } }if { (vy:var,y) }ineval(Path(x,ZeroOrMorePath(path) vy)    { } otherwise

Definition: Evaluation of

OneOrMorePath

eval(Path(X, OneOrMorePath(path), Y))

# For OneOrMorePath, we take one step of the path then start# recording nodes for results.eval(Path(x:term,OneOrMorePath(path),vy:var))=LetX=eval(x,path)LetV=theemptymultisetForninXALP(n,path,V)EndresultisVeval(Path(vx:var,OneOrMorePath(path),vy:var))=     { {(vx,t),(vy,n) }|tinnodes(G),(vy,n)ineval(Path(t,OneOrMorePath(path),vy)) }eval(Path(vx:var,OneOrMorePath(path),y:term))=eval(Path(y:term,OneOrMorePath(inv(path)),vx))eval(Path(x:term,OneOrMorePath(path),y:term))=    { { } }if {(vy:var,y) }ineval(Path(x,OneOrMorePath(path),vy))    { }otherwise

Definition:Evaluation of NegatedPropertySet

Write μ' as the extension of a solution mapping:μ'(μ,x) = μ(x)   if x is a variableμ'(μ,t) = t      if t is a RDF term

Let x and y be variables or RDF terms, and S a set of IRIs:   eval(Path(x, NPS(S), y)) = { μ | ∃ triple(μ'(μ,x), p, μ'(μ,y)) in G, such that the IRI of p ∉ S }

18.6SPARQL Algebra

For each remaining symbol in a SPARQL abstract query, we define an operator for evaluation. The SPARQL algebra operators of the same name are used to evaluate SPARQL abstract query nodes as described in the section "Evaluation Semantics". Evaluation of basic graph patterns and property path patterns has been described above.

Definition:Filter

LetΩ be a multiset of solution mappings andexpr be an expression. We define:

Filter(expr,Ω) = {μ |μ inΩ andexpr(μ) is an expression that has an effective boolean value of true }

multiplicity(μ |Filter(expr,Ω) ) =multiplicity(μ |Ω )

Note that evaluating anexists(pattern) expression uses the dataset and active graph, D(G). See theevaluation of filter.

Definition:Join

LetΩ₁ andΩ₂ be multisets of solution mappings. We define:

Join(Ω₁,Ω₂) = { merge(μ₁,μ₂) |μ₁ inΩ₁ andμ₂ inΩ₂, andμ₁ andμ₂ arecompatible }

multiplicity(μ |Join(Ω₁,Ω₂) ) =
for each merge(μ₁,μ₂),μ₁ inΩ₁ andμ₂ inΩ₂ such thatμ = merge(μ₁,μ₂),
sum over (μ₁,μ₂),multiplicity(μ₁ |Ω₁ ) *multiplicity(μ₂ |Ω₂ )

It is possible that a solution mappingμ in a Join can arise in different solution mappings,μ₁ andμ₂ in the multisets being joined. The multiplicity ofμ is the sum of the multiplicities from all possibilities.

Definition:Diff

LetΩ₁ andΩ₂ be multisets of solution mappings andexpr be an expression. We define:

Diff(Ω₁,Ω₂,expr) = {μ |μ inΩ₁ such that ∀μ' inΩ₂, eitherμ andμ' are notcompatible orμ andμ' arecompatible andexpr(merge(μ,μ')) does not have an effective boolean value of true }

multiplicity(μ |Diff(Ω₁,Ω₂,expr) ) =multiplicity(μ |Ω₁ )

The evaluation ofexpr(merge(μ,μ')) does not have an effective boolean value of true if it evaluates to false or if it raises an error.

Diff is used internally for the definition ofLeftJoin.

Definition:LeftJoin

LetΩ₁ andΩ₂ be multisets of solution mappings andexpr be an expression. We define:

LeftJoin(Ω₁,Ω₂,expr) =Filter(expr,Join(Ω₁,Ω₂)) ∪Diff(Ω₁,Ω₂,expr)

multiplicity(μ |LeftJoin(Ω₁,Ω₂,expr) ) =multiplicity(μ |Filter(expr,Join(Ω₁,Ω₂)) ) +multiplicity(μ |Diff(Ω₁,Ω₂,expr) )

Definition:Union

LetΩ₁ andΩ₂ be multisets of solution mappings. We define:

Union(Ω₁,Ω₂) = {μ |μ inΩ₁ orμ inΩ₂ }

multiplicity(μ |Union(Ω₁,Ω₂) ) =multiplicity(μ |Ω₁ ) +multiplicity(μ |Ω₂ )

Definition:Minus

LetΩ₁ andΩ₂ be multisets of solution mappings. We define:

Minus(Ω₁,Ω₂) = {μ |μ inΩ₁ . ∀μ' inΩ₂, eitherμ andμ' are notcompatible or dom(μ) and dom(μ') are disjoint }

multiplicity(μ |Minus(Ω₁,Ω₂) ) =multiplicity(μ |Ω₁ )

The additional restriction on dom(μ) and dom(μ') is added because otherwise if there is a solution mapping inΩ₂ that has no variables in common with the solution mappings ofΩ₁, thenMinus(Ω₁,Ω₂) would be empty, regardless of the rest ofΩ₂. The empty solution mapping is compatible with every other solution mapping soP MINUS {} would otherwise be empty for any patternP.

Definition:Extend

Letμ be a solution mapping,Ω a multiset of solution mappings,var a variable andexpr be anexpression, then we define:

Extend(μ,var,expr) =μ ∪ { (var,value) |var not in dom(μ) andvalue =expr(μ) }

Extend(μ,var,expr) =μ ifvar not in dom(μ) and expr(μ) is an error

Extend is undefined ifvar in dom(μ).

Extend(Ω,var,expr) = {Extend(μ,var,expr) |μ inΩ }

Write [x |C ] for a sequence of elements whereC is a condition onx.

Definition:ToList

LetΩ be a multiset of solution mappings. We define:

ToList(Ω) = a sequence of mappingsμ inΩ in any order, withmultiplicity(μ |Ω ) occurrences ofμ

multiplicity(μ |ToList(Ω) ) =multiplicity(μ |Ω )

Definition:OrderBy

LetΨ be a sequence of solution mappings. We define:

OrderBy(Ψ, condition) = [μ |μ inΨ and the sequence satisfies the ordering condition]

multiplicity(μ |OrderBy(Ψ, condition) ) =multiplicity(μ |Ψ )

Definition:Project

LetΨ be a sequence of solution mappings andPV a set of variables.

For mappingμ, write Proj(μ,PV) to be the restriction ofμ to variables inPV.

Project(Ψ,PV) = [ Proj(μ,PV) |μ inΨ ]

multiplicity(μ |Project(Ψ,PV) ) = sum(multiplicity(μ' |Ψ ) |μ' inΨ such thatμ' = Proj(μ,PV) )

The order ofProject(Ψ,PV) must preserve any ordering given byOrderBy.

Definition:Distinct

LetΨ be a sequence of solution mappings. We define:

Distinct(Ψ) = [μ |μ inΨ ]

multiplicity(μ |Distinct(Ψ) ) = 1 for everyμ ∈Distinct(Ψ)

multiplicity(μ |Distinct(Ψ) ) = 0 for everyμ ∉Distinct(Ψ)

The order ofDistinct(Ψ) must preserve any ordering given byOrderBy.

Definition:Reduced

LetΨ be a sequence of solution mappings. We define:

Reduced(Ψ) = [μ |μ inΨ ]

multiplicity(μ |Reduced(Ψ) ) is between 1 andmultiplicity(μ |Ψ ) for everyμ ∈Reduced(Ψ)

multiplicity(μ |Reduced(Ψ) ) = 0 for everyμ ∉Reduced(Ψ)

The order ofReduced(Ψ) must preserve any ordering given byOrderBy.

TheReduced solution sequence modifier does not guarantee a defined multiplicity.

Definition:Slice

LetΨ be a sequence of solution mappings. We define:

Slice(Ψ,start,length)[i] =Ψ[start+i] fori = 0 to (length-1)

Definition:ToMultiSet

LetΨ be a solution sequence. We define:

ToMultiSet(Ψ) = {μ |μ inΨ }

multiplicity(μ |ToMultiSet(Ψ) ) =multiplicity(μ |Ψ )

Definition:Exists

exists(pattern) is a function that returns true if the patternevaluates to a non-empty solution sequence, given the current solution mapping and active graph at the time of evaluation; otherwise it returns false.

18.6.1Aggregate Algebra

Group is a function which groups a solution sequence into multiple solutions, based on some attribute of the solutions.

Definition: Group

Group evaluates a list of expressions against a solution sequenceΨ, producing a partial function from keys to solution sequences.

Group(exprlist,Ψ) = {ListEval(exprlist,μ) → [μ' |μ' inΨ such thatListEval(exprlist,μ') andListEval(exprlist,μ) are the same ] |μ inΨ },

where two listsL andL' (as produced by theListEval function) are considered the same iff they have the same number of elements and, for every position k within the two lists, either of the following two conditions is true:

the element at thek-th position ofL is an RDF term; the element at thek-th position ofL' is also an RDF term; and these two RDF terms are thesame term
the element at thek-th position ofL is an error, and the element at thek-th position ofL' is also an error

Definition:ListEval

ListEval((expr₁, ...,expr_n),μ) returns a list (e₁, ...,e_n), wheree_i =expr_i(μ) or error.

ListEval retains errors resulting from the evaluation of the list elements.

Note that, although the result ofListEval may contain errors, and errors may be used to group, solutions containing error values are removed at the end of evaluating the group and any aggregation functions.

Note also that the result ofListEval((unbound),μ) is the list (error), as the evaluation of an unbound expression is an error.

Aggregation, a function which calculates a scalar value as an output of the aggregate expression. It is used in the SELECT clause, the HAVING evaluation process, and in ORDER BY (where required).Aggregation calculates aggregated values over groups of solutions, using set functions.

Definition: Aggregation

Letexprlist be a list of expressions or*;func, a set function;scalarvals, a partial function (possibly with an empty domain) passed from the aggregate in the query; and {key₁→Ψ₁, ...,key_m→Ψ_m }, a partial function from keys to solution sequences as produced by the grouping step.

Aggregation applies the set functionfunc to the given set and produces a single value for each key and a group of solutions for that key.

Aggregation(exprlist,func,scalarvals, {key₁→Ψ₁, ...,key_m→Ψ_m } )
= { (key,F(Ψ)) |key →Ψ in {key₁→Ψ₁, ...,key_m→Ψ_m } }

where
  M(Ψ) = [ListEval(exprlist,μ) |μ inΨ ]
  F(Ψ) =func(M(Ψ),scalarvals), for non-DISTINCT
  F(Ψ) =func(Dedup(M(Ψ)),scalarvals), forDISTINCT

with Dedup(M(Ψ)) being an order-preserving, duplicate-free version of the sequence M(Ψ); that is, Dedup(M(Ψ)) is a sequence of lists that has the following four properties (where each such list in this sequence may contain RDF terms and errors, as it is produced by theListEval function).

For every list L in M(Ψ) there exists a list L' in Dedup(M(Ψ)) such thatL andL' are the same, where two listsL andL' from M(Ψ) are considered the same as specified in the definition of theGroup operator.
For every list L in Dedup(M(Ψ)) there exists a list L' in M(Ψ) such thatL andL' are the same.
Dedup(M(Ψ)) is free of duplicates. That is, the list at thei-th position in Dedup(M(Ψ)) is not the same list as the list at thej-th position in Dedup(M(Ψ)) for every two natural numbersi andj such thati ≠j.
For any two listsL₁ andL₂ in Dedup(M(Ψ)), the relative order of their first occurrences in M(Ψ) is preserved in Dedup(M(Ψ)). That is, ifi₁ < i₂, thenj₁ < j₂, where
- i₁ is the smallest natural number such thatL₁ is at thei₁-th position in M(Ψ),
- i₂ is the smallest natural number such thatL₂ is at thei₂-th position in M(Ψ),
- j₁ is the position ofL₁ in Dedup(M(Ψ)), and
- j₂ is the position ofL₂ in Dedup(M(Ψ)).

Special Case: whenCOUNT is used with the expression*, then F(Ψ) is the cardinality of the group solution sequence, i.e., F(Ψ) = Card(Ψ), or F(Ψ) = Card(Distinct(Ψ)) if theDISTINCT keyword is present.

scalarvals are used to pass values to the underlying set function, bypassing the mechanics of the grouping. For example, the aggregate expressionGROUP_CONCAT(?x ; separator="|") has ascalarvals argument of { "separator" → "|" }.

All aggregates may have theDISTINCT keyword as the first token in their argument list. If this keyword is present, then first argument tofunc is Dedup(M(Ψ)).

Example

Given a solution sequenceΨ with the following values:

solution	?x	?y	?z
`μ₁`	1	2	3
`μ₂`	1	3	4
`μ₃`	2	5	6

And the query expression SELECT (ex:agg(?y, ?z) AS ?agg) WHERE { ?x ?y ?z } GROUP BY ?x.

We produceG =Group((?x),Ψ) = { (1) → [μ₁,μ₂], (2) → [μ₃] }

And soAggregation((?y, ?z), ex:agg, {},G) =
{ ((1), eg:agg([(2, 3), (3, 4)], {})), ((2), eg:agg([(5, 6)], {})) }.

Definition: AggregateJoin

LetS₁, ...,S_n be a list of sets, where each setS_i contains key to (aggregated) value maps as produced byAggregation.

LetK = {key |key in dom(S_j) for some 1 ≤j ≤n } be the set of keys, then
AggregateJoin(S₁, ...,S_n) = {agg₁→val₁, ...,agg_n→val_n |key inK andkey→val_i inS_i for each 1 ≤i ≤n }

18.6.1.1Set Functions

The set functions which underlie SPARQL aggregates all have a common signature: SetFunc(S), or SetFunc(S,scalarvals) whereS is a sequence of lists, andscalarvals is one or more scalar values that are passed to the set function indirectly via the ( ... ; key=value ) syntax for aggregates in the SPARQL grammar. The only use of this that is supported by the built-in aggregates in SPARQL Query 1.1 isGROUP_CONCAT, as inGROUP_CONCAT(?x ; separator=", ").

Note that the name "Set Function" is somewhat historical — the arguments to set functions are in fact sequences. The name is retained due to the commonality with SQL Set Functions, which operate over multisets.

The set functions defined in this document areCount,Sum,Min,Max,Avg,GroupConcat, andSample — corresponding to the aggregatesCOUNT,SUM,MIN,MAX,AVG,GROUP_CONCAT, andSAMPLE. Definitions may be found in the following sections. Systems may choose to expand this set using local extensions, using the same notation as for functions and casts. Note that, unless the ; separator is used this requires the parser to know whether some IRI refers to a function, cast, or aggregate before it can determine if there are any errors in a query where aggregates are used.

The definitions of the set functions in the following sections are based on two functions,Flatten andCard, which are defined as follows.

Flatten is a function which is used to collapse a sequence of lists into a single list. For example, [(1, 2), (3, 4)] becomes (1, 2, 3, 4).

Definition:Flatten

LetS be a sequence of lists, i.e.,S = [L₁,L₂, ...,L_m] where, for everyi ∈ {1, ...,m},L_i is a list.

Flatten(S) is the list (x |L inS andx inL ).

Card is a function that returns the cardinality of a sequence or a list of elements (which may be solution mappings or other types of elements, depending on the context).

Definition:Card

Given a sequence or a listL,Card(L) is the cardinality ofL.

18.6.1.2Count

Count is a SPARQL set function which counts the number of times a given expression has a bound, non-error value within the aggregate group.

Definition:Count

xsd:integerCount(sequenceS)

Count(S) =Card(L'),

whereL' is the listL =Flatten(S) with all error elements removed.

18.6.1.3Sum

Sum is a SPARQL set function that returns the numeric value obtained by summing the values within the aggregate group. Type promotion happens as per the op:numeric-add function, applied transitively, (see definition below) so the value of SUM(?x), in an aggregate group where ?x has values 1 (integer), 2.0e0 (float), and 3.0 (decimal) will be 6.0 (float).

Definition:Sum

numericSum(sequenceS)

Sum(S) =SumList(L),

whereL =Flatten(S) andSumList(L) is defined recursively as follows.

IfCard(L) = 0, thenSumList(L) = "0"^^xsd:integer.
IfCard(L) = 1, thenSumList(L) =op:numeric-add(L₁, 0).
IfCard(L) > 1, thenSumList(L) =op:numeric-add(L₁,SumList(L_2..n)).

Note thatL₁ is the first element inL, andL_2..n isL without its first element.

In this way,Sum( [(1), (2), (3)] ) =SumList( (1, 2, 3) ) = op:numeric-add(1, op:numeric-add(2, op:numeric-add(3, 0))).

18.6.1.4Avg

TheAvg set function calculates the average value for an expression over a group. It is defined in terms of Sum and Count.

Definition:Avg

numericAvg(sequenceS)

IfCount(S) = 0, thenAvg(S) = "0"^^xsd:integer.

IfCount(S) > 0, thenAvg(S) =Sum(S) /Count(S).

For example,Avg([(1), (2), (3)]) =Sum([(1), (2), (3)])/Count([(1), (2), (3)]) = 6/3 = 2.

18.6.1.5Min

Min is a SPARQL set function that returns the minimum value from a group respectively.

It makes use of the SPARQL ORDER BY ordering definition, to allow ordering over arbitrarily typed expressions.

Definition:Min

termMin(sequenceS)

Min(S) =MinList(L),

whereL is the list of values obtained byFlatten(S) and then ordered as per theORDER BY ASC clause, andMinList(L) is defined as follows.

IfCard(L) = 0, thenMinList(L) = error.
IfCard(L) > 0, thenMinList(L) =L₁, whereL₁ is the first element inL.

18.6.1.6Max

Max is a SPARQL set function that returns the maximum value from a group respectively.

It makes use of the SPARQL ORDER BY ordering definition, to allow ordering over arbitrarily typed expressions.

Definition:Max

termMax(sequenceS)

Max(S) =MaxList(L),

whereL is the list of values obtained byFlatten(S) and then ordered as per theORDER BY DESC clause, andMaxList(L) is defined as follows.

IfCard(L) = 0, thenMaxList(L) = error.
IfCard(L) > 0, thenMaxList(L) =L₁, whereL₁ is the first element inL.

18.6.1.7GroupConcat

GroupConcat is a set function which performs a string concatenation across the values of an expression with a group. The order of the strings is not specified. The separator character used in the concatenation may be given with the scalar argument SEPARATOR.

Definition:GroupConcat

literalGroupConcat(sequenceS, functionscalarvals)

If thescalarvals argument is absent fromGROUP_CONCAT, thenscalarvals is taken to be the empty function.

Letsep be a string that is defined as follows.

Ifscalarvals is defined for the argument "separator", thensep =scalarvals("separator").
Ifscalarvals is undefined for the argument "separator", thensep is the "space" character (i.e., unicode codepoint U+0020).

GroupConcat(S,scalarvals) =GCList(L,sep),

whereL =Flatten(S) andGCList(L,sep) is defined recursively as follows.

IfCard(L) = 0, thenGCList(L,sep) = "".
IfCard(L) = 1, thenGCList(L,sep) =CONCAT("",L₁).
IfCard(L) > 1, thenGCList(L,sep) =CONCAT(L₁,sep,GCList(L_2..n,sep)).

Note thatL₁ is the first element inL, andL_2..n isL without its first element.

For example,GroupConcat([("a"), ("b"), ("c")], {"separator" → "."}) =GCList( ("a", "b", "c"), "." ) = "a.b.c".

18.6.1.8Sample

Sample is a set function which returns an arbitrary value from the sequence passed to it.

Definition:Sample

RDFTermSample(sequenceS)

IfCard(S) = 0, thenSample(S) = error.

IfCard(S) > 0, thenSample(S) =v, wherev inFlatten(S).

For example, givenSample([("a"), ("b"), ("c")]), "a", "b", and "c" are all valid return values. Note that theSample function is not required to be deterministic for a given input. The only restriction is that the output value must be present in the input sequence.

18.6.2Evaluation Semantics

We defineeval(D(G),A) as the evaluation of analgebraic query expressionA with respect to adatasetD havingactive graphG. The active graph is initially the default graph ofD. Further symbols used in the following definitions are:

P,P₁,P₂ : graph patterns
L : a solution sequence
F : an expression

Issue 225: eval function undefined for multisets and sequences of solution mappings

The definitions in this section do not cover the cases in whichA is a sequence or a multiset of solution mappings.

Definition:Evaluation of a Basic Graph Pattern

eval(D(G),BGP ) = multiset of solution mappings

See sectionBasic Graph Patterns

Definition:Evaluation of a Property Path Pattern

eval(D(G),Path(X,path,Y) ) = multiset of solution mappings

See sectionProperty Path Patterns

Definition:Evaluation of Filter

eval(D(G),Filter(F,P) ) =Filter(F,eval(D(G),P),D(G) )

'substitute' is a filter function in support of the evaluation ofEXISTS andNOT EXISTS forms which were translated toexists.

Definition:Substitute

Letμ be a solution mapping.

substitute(pattern,μ) = the pattern formed by replacing every occurrence of a variablev inpattern byμ(v) for eachv in dom(μ)

Definition:Evaluation of Exists

Letμ be the current solution mapping for a filter andP a graph pattern:

The value exists(P), givenD(G) is true if and only ifeval(D(G), substitute(P,μ)) is a non-empty sequence.

Definition:Evaluation of Join

eval(D(G),Join(P₁,P₂) ) =Join(eval(D(G),P₁),eval(D(G),P₂) )

Definition:Evaluation of LeftJoin

eval(D(G),LeftJoin(P₁,P₂,F) ) =LeftJoin(eval(D(G),P₁),eval(D(G),P₂),F )

Definition:Evaluation of Union

eval(D(G),Union(P₁,P₂) ) =Union(eval(D(G),P₁),eval(D(G),P₂) )

Definition:Evaluation of Graph

For everyx that is anIRI or avariable,eval(D(G),Graph(x,P) ) is defined as follows:

Ifx is an IRI that is agraph name inD,
eval(D(G),Graph(x,P) ) =eval(D(G_x),P ),
whereG_x is the RDF graph of the named graph with namex inD.
Ifx is an IRI that is not agraph name inD,
eval(D(G),Graph(x,P) ) is the empty multiset.

Ifx is a variable,

eval(D(G),Graph(x,P) ) =Ω,

whereΩ is the multiset of solution mappings produced by the following algorithm:

Ω := the empty multisetfor eachgraph namegn inD (recall that a graph name may be an IRI or a blank node)G' := the RDF graph of the named graph with namegn inDΩ' :=eval(D(G'),P )Ω :=Union(Ω,Join(Ω',μ) ), whereμ = {x →gn}the result isΩ

Definition: Evaluation of Group

eval(D(G),Group(exprlist,P) ) =Group(exprlist,eval(D(G),P) )

Definition: Evaluation of Aggregation

eval(D(G),Aggregation(exprlist,func,scalarvals,Grp) ) =Aggregation(exprlist,func,scalarvals,eval(D(G),Grp) )

Definition: Evaluation of AggregateJoin

eval(D(G),AggregateJoin(A₁, ...,A_n) ) =AggregateJoin(eval(D(G),A₁), ...,eval(D(G),A_n) )

Note that ifeval(D(G),A_i) is an error, it is ignored.

Definition:Evaluation ofExtend

eval(D(G),Extend(P,var,expr) ) =Extend(eval(D(G),P),var,expr )

Definition:Evaluation of ToList

eval(D(G),ToList(P) ) =ToList(eval(D(G),P) )

Definition:Evaluation of Distinct

eval(D(G),Distinct(L) ) =Distinct(eval(D(G),L) )

Definition:Evaluation of Reduced

eval(D(G),Reduced(L) ) =Reduced(eval(D(G),L) )

Definition:Evaluation of Project

eval(D(G),Project(L,vars) ) =Project(eval(D(G),L),vars )

Definition:Evaluation of OrderBy

eval(D(G),OrderBy(L,condition) ) =OrderBy(eval(D(G),L),condition )

Definition:Evaluation of ToMultiSet

eval(D(G),ToMultiset(L) ) =ToMultiSet(eval(D(G),L) )

Definition:Evaluation of Slice

eval(D(G),Slice(L,start,length) ) =Slice(eval(D(G),L),start,length )

18.6.3Extending SPARQL Basic Graph Matching

The overall SPARQL design can be used for queries which assume a more elaborate form of entailment than simple entailment, by re-writing the matching conditions for basic graph patterns. Since it is an open research problem to state such conditions in a single general form which applies to all forms of entailment and optimally eliminates needless or inappropriate redundancy, this document only gives necessary conditions which any such solution should satisfy. These will need to be extended to full definitions for each particular case.

Basic graph patterns stand in the same relation to triple patterns that RDF graphs do to RDF triples, and much of the same terminology can be applied to them. In particular, two basic graph patterns are said to beequivalent if there is a bijection M between the terms of the triple patterns that maps blank nodes to blank nodes and maps variables, literals and IRIs to themselves, such that a triple ( s, p, o ) is in the first pattern if and only if the triple ( M(s), M(p), M(o) ) is in the second. This definition extends that for RDF graph equivalence to basic graph patterns by preserving variable names across equivalent patterns.

Anentailment regime specifies

a subset of RDF graphs calledwell-formed for the regime
anentailment relation between subsets of well-formed graphs and well-formed graphs.

Detailed definitions for querying various entailment regimes can be found inSPARQL 1.1 Entailment Regimes.

Some entailment regimes can categorize some RDF graphs as inconsistent. For example, the RDF graph:

_:x rdf:type xsd:string ._:x rdf:type xsd:decimal .

is D-inconsistent when D contains the XSD datatypes. The effect of a query on an inconsistent graph is not covered by this specification, but must be specified by the particular SPARQL extension.

An entailment regime E must provide conditions on basic graph pattern evaluation such that for any basic graph pattern BGP, any RDF graph G, and any evaluation that satisfies the conditions, the resulting multiset of solutions is uniquely determined up to RDF graph equivalence. We denote the multiset of solutions from evaluating BGP over G using E with Eval-E(G, BGP).
An entailment regime must further satisfy the following conditions:

For any E-consistent active graph AG, the entailment regime E uniquely specifies ascoping graph SG that is E-equivalent to AG.
A set of well-formed graphs for E is specified such that, for any basic graph pattern BGP, scoping graph SG, and solution mapping μ in Eval-E(SG, BGP), the graph μ(BGP) is well-formed for E.
For any basic graph pattern BGP and scoping graph SG, if μ₁, ..., μ_n in Eval-E(SG, BGP) and BGP₁, ..., BGP_n are basic graph patterns all equivalent to BGP but not sharing any blank nodes with each other or with SG, then
SG E-entails (SG union μ₁(BGP₁) union ... union μ_n(BGP_n))
These conditions do not fully determine the set of possible answers, since RDF allows unlimited amounts of redundancy. In addition, therefore, the following must hold.
Entailment regimes should provide conditions to prevent trivial infinite solution multisets as appropriate to the regime.

18.6.3.1Notes

(a) SG will often be graph equivalent to AG, but restricting this to E-equivalence allows some forms of normalization, for example elimination of semantic redundancies, to be applied to the source documents before querying.

(b) The construction in condition 3 ensures that any blank nodes introduced by the solution mapping are used in a way which is internally consistent with the way that blank nodes occur in SG. This ensures that blank node identifiers occur in more than one answer in an answer set only when the blank nodes so identified are indeed identical in SG. If the extension does not allow bindings to blank nodes, then this condition can be simplified to the condition:

SG E-entails μ(BGP) for each solution mapping μ.

(c) These conditions do not impose the SPARQL requirement that SG shares no blank nodes with AG or BGP. In particular, it allows SG to actually be AG. This allows query protocols in which blank node identifiers retain their meaning between the query and the source document, or across multiple queries. Such protocols are not supported by the current SPARQL protocol specification, however.

(d) Since conditions 1 to 3 are only necessary conditions on answers, condition 4 allows cases where the set of legal answers can be restricted in various ways.

(e) None of these conditions refer explicitly to instance mappings on blank nodes in BGP. For some entailment regimes, the existential interpretation of blank nodes cannot be fully captured by the existence of a single instance mapping. These conditions allow such regimes to give blank nodes in query patterns a 'fully existential' reading.

It is straightforward to show that SPARQL satisfies these conditions for the case where E is simple entailment, given that the SPARQL condition on SG is that it is graph-equivalent to AG but shares no blank nodes with AG or BGP (which satisfies the first condition). The only condition which is nontrivial is (3).

For every solution mapping μ_i, there is, by definition of basic graph pattern matching, an RDF instance mapping σ_i such that P_i(BGP_i) is a subgraph of SG where P_i is the pattern instance mapping composed of μ_i and σ_i. Since BGP_i and SG have no blank nodes in common, the ranges of σ_i and μ_i contain no blank nodes from BGP_i; therefore, the solution mapping μ_i and the RDF instance mapping σ_i of P_i commute, so P_i(BGP_i) = σ_i(μ_i(BGP_i)). So

P₁(BGP₁) union ... union P_n(BGP_n)
= σ₁(μ₁(BGP₁)) union ... union σ_n(μ_n(BGP_n))
= [ σ₁ + ... + σ_n]( μ₁(BGP₁) union ... union μ_n(BGP_n) )

since the domains of the σ_i RDF instance mappings are all mutually exclusive. Since they are also exclusive from SG,

SG union [ σ₁ + ... + σ_n]( μ₁(BGP₁) union ... union μ_n(BGP_n) )
= [ σ₁ + ... + σ_n](SG union μ₁(BGP₁) union ... union μ_n(BGP_n) )

i.e.

SG union μ₁(BGP₁) union ... union μ_n(BGP_n)

has an instance which is a subgraph of SG, so is simply entailed by SG by theRDF interpolation lemma [RDF12-SEMANTICS].

19.SPARQL Grammar

The SPARQL grammar covers both SPARQL Query andSPARQL 1.1 Update.

19.1SPARQL String

ASPARQL string is anRDF string that conforms to the grammar given in this section.

Note

AnRDF string is a sequence ofUnicode code points which areUnicode scalar values. Unicode scalar values do not include thesurrogate code points.

ASPARQL query string is aSPARQL string that conforms to the grammar starting at theQueryUnit production.

ASPARQL update string is aSPARQL string that conforms to the grammar starting at theUpdateUnit production.

For compatibility with future versions of Unicode, the characters in this string may include Unicode codepoints that are unassigned as of the date of this publication (seeUnicode Identifiers and Syntax [UAX31] section 4 Pattern Syntax). For productions with excluded character classes (for example[^<>'{}|^`]), the characters are excluded from the range#x0 - #x10FFFF.

19.2Codepoint Escape Sequences

ASPARQL string is processed for codepoint escape sequences before parsing by the grammar defined in EBNF below. The codepoint escape sequences for a SPARQL query string are:

Escape	Unicode code point
'\u'HEX HEX HEX HEX	A Unicode code point in the range U+0 to U+FFFF inclusive corresponding to the encoded hexadecimal value, excluding U+D800 to U+DFFF, thesurrogate code points.
'\U'HEX HEX HEX HEX HEX HEX HEX HEX	A Unicode code point in the range U+0 to U+10FFFF inclusive corresponding to the encoded hexadecimal value, excluding U+D800 to U+DFFF, thesurrogate code points.

whereHEX is a hexadecimal character

HEX ::= [0-9] | [A-F] | [a-f]

Examples:

<ab\u00E9xy>        # Codepoint 00E9 is Latin small e with acute - é\u03B1:a            # Codepoint x03B1 is Greek small alpha - αa\u003Ab            # a:b -- codepoint x3A is colon

Codepoint escape sequences can appear anywhere in the query string. They are processed before parsing based on the grammar rules and so may be replaced by codepoints with significance in the grammar, such as ":" marking a prefixed name.

These escape sequences are not included in the grammar below. Only escape sequences for characters that would be legal at that point in the grammar may be given. For example, the variable "?x\u0020y" is not legal (\u0020 is a space and is not permitted in a variable name).

19.3White Space

White space (productionWS) is used to separate two terminals which would otherwise be (mis-)recognized as one terminal. Rule names below in capitals indicate where white space is significant; these form a possible choice of terminals for constructing a SPARQL parser. White space is significant in strings. Otherwise, white space is ignored between tokens.

For example:

?a<?b&&?c>?d

is the token sequence variable '?a', an IRI '<?b&&?c>', and variable '?d', not a expression involving the operator '&&' connecting two expression using '<' (less than) and '>' (greater than).

19.4Comments

Comments in SPARQL queries take the form of '#', outside an IRI or string, and continue to the end of line (marked by characters0x0D or0x0A) or end of file if there is no end of line after the comment marker. Comments are treated as white space.

19.5IRI References

Text matched by theIRIREF production andPrefixedName (after prefix expansion) production, after escape processing, must conform to the generic syntax of IRI references in section 2.2 of RFC 3987 "ABNF for IRI References and IRIs" [RFC3987]. For example, theIRIREF<abc#def> may occur in a SPARQL query string, but theIRIREF<abc##def> must not.

Base IRIs declared with theBASE keyword must be absolute IRIs. A prefix declared with thePREFIX keyword may not be re-declared in the same query. See section 4.1.1,Syntax of IRI Terms, for a description ofBASE andPREFIX.

19.6Blank Nodes and Blank Node Identifiers

Blank nodes can not be used in:

in aSPARQL update request.

Blank node identifiers are scoped to theSPARQL string in which they occur. Different uses of the same blank node identifier in a request string refer to the same blank node. Fresh blank nodes are generated for each request; blank nodes can not be referenced by identifier across requests.

The same blank node identifier can not be used in:

two separate basic graph patterns in a SPARQL Query
twoWHERE clauses within a single SPARQL update request
twoINSERT DATA operations within a single SPARQL update request

Note that the same blank node identifier can occur in differentQuadPattern clauses in aSPARQL 1.1 Update request.

19.7Escape sequences in strings

In addition to thecodepoint escape sequences, the following escape sequences apply to anystring production (e.g.STRING_LITERAL1,STRING_LITERAL2,STRING_LITERAL_LONG1,STRING_LITERAL_LONG2):

Escape	Unicode code point
'\t'	U+0009 (tab)
'\n'	U+000A (line feed)
'\r'	U+000D (carriage return)
'\b'	U+0008 (backspace)
'\f'	U+000C (form feed)
'\"'	U+0022 (quotation mark, double quote mark)
"\'"	U+0027 (apostrophe-quote, single quote mark)
'\\'	U+005C (backslash)

Examples:

"abc\n""xy\rz"'xy\tz'

19.8Grammar

The EBNF notation used in the grammar is defined in Extensible Markup Language (XML) 1.1 [XML11] section 6Notation.

Notes:

Keywords are matched in a case-insensitive manner with the exception of the keyword 'a' which, in line with Turtle and N3, is used in place of the IRIrdf:type (in full,http://www.w3.org/1999/02/22-rdf-syntax-ns#type).
Escape sequences are case sensitive.
When tokenizing the input and choosing grammar rules, the longest match is chosen.
The SPARQL grammar is LL(1) when the rules with uppercased names are used as terminals.
There are two entry points into the grammar:QueryUnit for the SPARQL query language andUpdateUnit for the SPARQL update language.
In signed numbers, no white space is allowed between the sign and the number. TheAdditiveExpression grammar rule allows for this by covering the two cases of an expression followed by a signed number. These produce an addition or subtraction of the unsigned number as appropriate.
The tokensINSERT DATA,DELETE DATA andDELETE WHERE allow any amount of white space between the words. The single space version is used in the grammar for clarity.
TheQuadData andQuadPattern rules both use ruleQuads. The ruleQuadData, used inINSERTDATA andDELETE DATA, must not allow variables in the quad patterns.
Blank node syntax is not allowed inDELETE WHERE, theDeleteClause forDELETE, nor inDELETE DATA.
Rules for limiting the use of blank node identifiers are given insection 19.6.
The number of variables in the variable list ofVALUES block must be the same as the number of each list of associated values in theDataBlock.
Variables introduced byAS in aSELECT clause must not already bein-scope.
The variable assigned in aBIND clause must not be already in-use within the immediately precedingTriplesBlock within aGroupGraphPattern.
Aggregate functions can be one of thebuilt-in keywords for aggregates or a custom aggregate, which is syntactically afunction call. Aggregate functions may only be used inSELECT,HAVING andORDER BY clauses.
The expression argument of an aggregate function can not contain an aggregate function.
Only custom aggregate functions use theDISTINCT keyword in afunction call.
Areifier orannotation syntax is only permitted after a triple when the property position is a simple path (an IRI, the keyworda, or a variable), and not for other path expressions.

`[1]`	`QueryUnit`	::=	`Query`
`[2]`	`Query`	::=	`Prologue (SelectQuery \|ConstructQuery \|DescribeQuery \|AskQuery ) ValuesClause`
`[3]`	`UpdateUnit`	::=	`Update`
`[4]`	`Prologue`	::=	`(BaseDecl \|PrefixDecl \|VersionDecl )*`
`[5]`	`BaseDecl`	::=	`'BASE'IRIREF`
`[6]`	`PrefixDecl`	::=	`'PREFIX'PNAME_NSIRIREF`
`[7]`	`VersionDecl`	::=	`'VERSION'VersionSpecifier`
`[8]`	`VersionSpecifier`	::=	`STRING_LITERAL1 \|STRING_LITERAL2`
`[9]`	`SelectQuery`	::=	`SelectClauseDatasetClause*WhereClauseSolutionModifier`
`[10]`	`SubSelect`	::=	`SelectClauseWhereClauseSolutionModifierValuesClause`
`[11]`	`SelectClause`	::=	`'SELECT' ('DISTINCT' \|'REDUCED' )? ( (Var \| ('('Expression'AS'Var')' ) )+ \|'*' )`
`[12]`	`ConstructQuery`	::=	`'CONSTRUCT' (ConstructTemplateDatasetClauseWhereClauseSolutionModifier \|DatasetClause'WHERE''{'TriplesTemplate?'}'SolutionModifier )`
`[13]`	`DescribeQuery`	::=	`'DESCRIBE' (VarOrIri+ \|'' )DatasetClauseWhereClause?SolutionModifier`
`[14]`	`AskQuery`	::=	`'ASK'DatasetClause*WhereClauseSolutionModifier`
`[15]`	`DatasetClause`	::=	`'FROM' (DefaultGraphClause \|NamedGraphClause )`
`[16]`	`DefaultGraphClause`	::=	`SourceSelector`
`[17]`	`NamedGraphClause`	::=	`'NAMED'SourceSelector`
`[18]`	`SourceSelector`	::=	`iri`
`[19]`	`WhereClause`	::=	`'WHERE'?GroupGraphPattern`
`[20]`	`SolutionModifier`	::=	`GroupClause?HavingClause?OrderClause?LimitOffsetClauses?`
`[21]`	`GroupClause`	::=	`'GROUP''BY'GroupCondition+`
`[22]`	`GroupCondition`	::=	`BuiltInCall \|FunctionCall \|'('Expression ('AS'Var )?')' \|Var`
`[23]`	`HavingClause`	::=	`'HAVING'HavingCondition+`
`[24]`	`HavingCondition`	::=	`Constraint`
`[25]`	`OrderClause`	::=	`'ORDER''BY'OrderCondition+`
`[26]`	`OrderCondition`	::=	`( ('ASC' \|'DESC' )BrackettedExpression ) \| (Constraint \|Var )`
`[27]`	`LimitOffsetClauses`	::=	`LimitClauseOffsetClause? \|OffsetClauseLimitClause?`
`[28]`	`LimitClause`	::=	`'LIMIT'INTEGER`
`[29]`	`OffsetClause`	::=	`'OFFSET'INTEGER`
`[30]`	`ValuesClause`	::=	`('VALUES'DataBlock )?`
`[31]`	`Update`	::=	`Prologue (Update1 (';'Update )? )?`
`[32]`	`Update1`	::=	`Load \|Clear \|Drop \|Add \|Move \|Copy \|Create \|DeleteWhere \|Modify \|InsertData \|DeleteData`
`[33]`	`Load`	::=	`'LOAD''SILENT'?iri ('INTO'GraphRef )?`
`[34]`	`Clear`	::=	`'CLEAR''SILENT'?GraphRefAll`
`[35]`	`Drop`	::=	`'DROP''SILENT'?GraphRefAll`
`[36]`	`Create`	::=	`'CREATE''SILENT'?GraphRef`
`[37]`	`Add`	::=	`'ADD''SILENT'?GraphOrDefault'TO'GraphOrDefault`
`[38]`	`Move`	::=	`'MOVE''SILENT'?GraphOrDefault'TO'GraphOrDefault`
`[39]`	`Copy`	::=	`'COPY''SILENT'?GraphOrDefault'TO'GraphOrDefault`
`[40]`	`InsertData`	::=	`'INSERT DATA'QuadData`
`[41]`	`DeleteData`	::=	`'DELETE DATA'QuadData`
`[42]`	`DeleteWhere`	::=	`'DELETE WHERE'QuadPattern`
`[43]`	`Modify`	::=	`('WITH'iri )? (DeleteClauseInsertClause? \|InsertClause )UsingClause*'WHERE'GroupGraphPattern`
`[44]`	`DeleteClause`	::=	`'DELETE'QuadPattern`
`[45]`	`InsertClause`	::=	`'INSERT'QuadPattern`
`[46]`	`UsingClause`	::=	`'USING' (iri \|'NAMED'iri )`
`[47]`	`GraphOrDefault`	::=	`'DEFAULT' \|'GRAPH'?iri`
`[48]`	`GraphRef`	::=	`'GRAPH'iri`
`[49]`	`GraphRefAll`	::=	`GraphRef \|'DEFAULT' \|'NAMED' \|'ALL'`
`[50]`	`QuadPattern`	::=	`'{'Quads'}'`
`[51]`	`QuadData`	::=	`'{'Quads'}'`
`[52]`	`Quads`	::=	`TriplesTemplate? (QuadsNotTriples'.'?TriplesTemplate? )*`
`[53]`	`QuadsNotTriples`	::=	`'GRAPH'VarOrIri'{'TriplesTemplate?'}'`
`[54]`	`TriplesTemplate`	::=	`TriplesSameSubject ('.'TriplesTemplate? )?`
`[55]`	`GroupGraphPattern`	::=	`'{' (SubSelect \|GroupGraphPatternSub )'}'`
`[56]`	`GroupGraphPatternSub`	::=	`TriplesBlock? (GraphPatternNotTriples'.'?TriplesBlock? )*`
`[57]`	`TriplesBlock`	::=	`TriplesSameSubjectPath ('.'TriplesBlock? )?`
`[58]`	`ReifiedTripleBlock`	::=	`ReifiedTriplePropertyList`
`[59]`	`ReifiedTripleBlockPath`	::=	`ReifiedTriplePropertyListPath`
`[60]`	`GraphPatternNotTriples`	::=	`GroupOrUnionGraphPattern \|OptionalGraphPattern \|MinusGraphPattern \|GraphGraphPattern \|ServiceGraphPattern \|Filter \|Bind \|InlineData`
`[61]`	`OptionalGraphPattern`	::=	`'OPTIONAL'GroupGraphPattern`
`[62]`	`GraphGraphPattern`	::=	`'GRAPH'VarOrIriGroupGraphPattern`
`[63]`	`ServiceGraphPattern`	::=	`'SERVICE''SILENT'?VarOrIriGroupGraphPattern`
`[64]`	`Bind`	::=	`'BIND''('Expression'AS'Var')'`
`[65]`	`InlineData`	::=	`'VALUES'DataBlock`
`[66]`	`DataBlock`	::=	`InlineDataOneVar \|InlineDataFull`
`[67]`	`InlineDataOneVar`	::=	`Var'{'DataBlockValue*'}'`
`[68]`	`InlineDataFull`	::=	`(NIL \|'('Var')' )'{' ('('DataBlockValue')' \|NIL )*'}'`
`[69]`	`DataBlockValue`	::=	`iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|'UNDEF' \|TripleTermData`
`[70]`	`Reifier`	::=	`'~'VarOrReifierId?`
`[71]`	`VarOrReifierId`	::=	`Var \|iri \|BlankNode`
`[72]`	`MinusGraphPattern`	::=	`'MINUS'GroupGraphPattern`
`[73]`	`GroupOrUnionGraphPattern`	::=	`GroupGraphPattern ('UNION'GroupGraphPattern )*`
`[74]`	`Filter`	::=	`'FILTER'Constraint`
`[75]`	`Constraint`	::=	`BrackettedExpression \|BuiltInCall \|FunctionCall`
`[76]`	`FunctionCall`	::=	`iriArgList`
`[77]`	`ArgList`	::=	`NIL \|'(''DISTINCT'?Expression (','Expression )*')'`
`[78]`	`ExpressionList`	::=	`NIL \|'('Expression (','Expression )*')'`
`[79]`	`ConstructTemplate`	::=	`'{'ConstructTriples?'}'`
`[80]`	`ConstructTriples`	::=	`TriplesSameSubject ('.'ConstructTriples? )?`
`[81]`	`TriplesSameSubject`	::=	`VarOrTermPropertyListNotEmpty \|TriplesNodePropertyList \|ReifiedTripleBlock`
`[82]`	`PropertyList`	::=	`PropertyListNotEmpty?`
`[83]`	`PropertyListNotEmpty`	::=	`VerbObjectList (';' (VerbObjectList )? )*`
`[84]`	`Verb`	::=	`VarOrIri \|'a'`
`[85]`	`ObjectList`	::=	`Object (','Object )*`
`[86]`	`Object`	::=	`GraphNodeAnnotation`
`[87]`	`TriplesSameSubjectPath`	::=	`VarOrTermPropertyListPathNotEmpty \|TriplesNodePathPropertyListPath \|ReifiedTripleBlockPath`
`[88]`	`PropertyListPath`	::=	`PropertyListPathNotEmpty?`
`[89]`	`PropertyListPathNotEmpty`	::=	`(VerbPath \|VerbSimple )ObjectListPath (';' ( (VerbPath \|VerbSimple )ObjectListPath )? )*`
`[90]`	`VerbPath`	::=	`Path`
`[91]`	`VerbSimple`	::=	`Var`
`[92]`	`ObjectListPath`	::=	`ObjectPath (','ObjectPath )*`
`[93]`	`ObjectPath`	::=	`GraphNodePathAnnotationPath`
`[94]`	`Path`	::=	`PathAlternative`
`[95]`	`PathAlternative`	::=	`PathSequence ('\|'PathSequence )*`
`[96]`	`PathSequence`	::=	`PathEltOrInverse ('/'PathEltOrInverse )*`
`[97]`	`PathElt`	::=	`PathPrimaryPathMod?`
`[98]`	`PathEltOrInverse`	::=	`PathElt \|'^'PathElt`
`[99]`	`PathMod`	::=	`'?' \|'*' \|'+'`
`[100]`	`PathPrimary`	::=	`iri \|'a' \|'!'PathNegatedPropertySet \|'('Path')'`
`[101]`	`PathNegatedPropertySet`	::=	`PathOneInPropertySet \|'(' (PathOneInPropertySet ('\|'PathOneInPropertySet )* )?')'`
`[102]`	`PathOneInPropertySet`	::=	`iri \|'a' \|'^' (iri \|'a' )`
`[103]`	`TriplesNode`	::=	`Collection \|BlankNodePropertyList`
`[104]`	`BlankNodePropertyList`	::=	`'['PropertyListNotEmpty']'`
`[105]`	`TriplesNodePath`	::=	`CollectionPath \|BlankNodePropertyListPath`
`[106]`	`BlankNodePropertyListPath`	::=	`'['PropertyListPathNotEmpty']'`
`[107]`	`Collection`	::=	`'('GraphNode+')'`
`[108]`	`CollectionPath`	::=	`'('GraphNodePath+')'`
`[109]`	`AnnotationPath`	::=	`(Reifier \|AnnotationBlockPath )*`
`[110]`	`AnnotationBlockPath`	::=	`'{\|'PropertyListPathNotEmpty'\|}'`
`[111]`	`Annotation`	::=	`(Reifier \|AnnotationBlock )*`
`[112]`	`AnnotationBlock`	::=	`'{\|'PropertyListNotEmpty'\|}'`
`[113]`	`GraphNode`	::=	`VarOrTerm \|TriplesNode \|ReifiedTriple`
`[114]`	`GraphNodePath`	::=	`VarOrTerm \|TriplesNodePath \|ReifiedTriple`
`[115]`	`VarOrTerm`	::=	`Var \|iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|BlankNode \|NIL \|TripleTerm`
`[116]`	`ReifiedTriple`	::=	`'<<'ReifiedTripleSubjectVerbReifiedTripleObjectReifier?'>>'`
`[117]`	`ReifiedTripleSubject`	::=	`Var \|iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|BlankNode \|ReifiedTriple \|TripleTerm`
`[118]`	`ReifiedTripleObject`	::=	`Var \|iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|BlankNode \|ReifiedTriple \|TripleTerm`
`[119]`	`TripleTerm`	::=	`'<<('TripleTermSubjectVerbTripleTermObject')>>'`
`[120]`	`TripleTermSubject`	::=	`Var \|iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|BlankNode \|TripleTerm`
`[121]`	`TripleTermObject`	::=	`Var \|iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|BlankNode \|TripleTerm`
`[122]`	`TripleTermData`	::=	`'<<('TripleTermDataSubject (iri \|'a' )TripleTermDataObject')>>'`
`[123]`	`TripleTermDataSubject`	::=	`iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|TripleTermData`
`[124]`	`TripleTermDataObject`	::=	`iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|TripleTermData`
`[125]`	`VarOrIri`	::=	`Var \|iri`
`[126]`	`Var`	::=	`VAR1 \|VAR2`
`[127]`	`Expression`	::=	`ConditionalOrExpression`
`[128]`	`ConditionalOrExpression`	::=	`ConditionalAndExpression ('\|\|'ConditionalAndExpression )*`
`[129]`	`ConditionalAndExpression`	::=	`ValueLogical ('&&'ValueLogical )*`
`[130]`	`ValueLogical`	::=	`RelationalExpression`
`[131]`	`RelationalExpression`	::=	`NumericExpression ('='NumericExpression \|'!='NumericExpression \|'<'NumericExpression \|'>'NumericExpression \|'<='NumericExpression \|'>='NumericExpression \|'IN'ExpressionList \|'NOT''IN'ExpressionList )?`
`[132]`	`NumericExpression`	::=	`AdditiveExpression`
`[133]`	`AdditiveExpression`	::=	`MultiplicativeExpression ('+'MultiplicativeExpression \|'-'MultiplicativeExpression \| (NumericLiteralPositive \|NumericLiteralNegative ) ( (''UnaryExpression ) \| ('/'UnaryExpression ) ) )*`
`[134]`	`MultiplicativeExpression`	::=	`UnaryExpression (''UnaryExpression \|'/'UnaryExpression )`
`[135]`	`UnaryExpression`	::=	`'!'PrimaryExpression \|'+'PrimaryExpression \|'-'PrimaryExpression \|PrimaryExpression`
`[136]`	`PrimaryExpression`	::=	`BrackettedExpression \|BuiltInCall \|iriOrFunction \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|Var \|ExprTripleTerm`
`[137]`	`ExprTripleTerm`	::=	`'<<('ExprTripleTermSubjectVerbExprTripleTermObject')>>'`
`[138]`	`ExprTripleTermSubject`	::=	`iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|Var \|ExprTripleTerm`
`[139]`	`ExprTripleTermObject`	::=	`iri \|RDFLiteral \|NumericLiteral \|BooleanLiteral \|Var \|ExprTripleTerm`
`[140]`	`BrackettedExpression`	::=	`'('Expression')'`
`[141]`	`BuiltInCall`	::=	Aggregate \|'STR''('Expression')' \|'LANG''('Expression')' \|'LANGMATCHES''('Expression','Expression')' \|'LANGDIR''('Expression')' \|'DATATYPE''('Expression')' \|'BOUND''('Var')' \|'IRI''('Expression')' \|'URI''('Expression')' \|'BNODE' ('('Expression')' \|NIL ) \|'RAND'NIL \|'ABS''('Expression')' \|'CEIL''('Expression')' \|'FLOOR''('Expression')' \|'ROUND''('Expression')' \|'CONCAT'ExpressionList \|SubstringExpression \|'STRLEN''('Expression')' \|StrReplaceExpression \|'UCASE''('Expression')' \|'LCASE''('Expression')' \|'ENCODE_FOR_URI''('Expression')' \|'CONTAINS''('Expression','Expression')' \|'STRSTARTS''('Expression','Expression')' \|'STRENDS''('Expression','Expression')' \|'STRBEFORE''('Expression','Expression')' \|'STRAFTER''('Expression','Expression')' \|'YEAR''('Expression')' \|'MONTH''('Expression')' \|'DAY''('Expression')' \|'HOURS''('Expression')' \|'MINUTES''('Expression')' \|'SECONDS''('Expression')' \|'TIMEZONE''('Expression')' \|'TZ''('Expression')' \|'NOW'NIL \|'UUID'NIL \|'STRUUID'NIL \|'MD5''('Expression')' \|'SHA1''('Expression')' \|'SHA256''('Expression')' \|'SHA384''('Expression')' \|'SHA512''('Expression')' \|'COALESCE'ExpressionList \|'IF''('Expression','Expression','Expression')' \|'STRLANG''('Expression','Expression')' \|'STRLANGDIR''('Expression','Expression','Expression')' \|'STRDT''('Expression','Expression')' \|'sameTerm''('Expression','Expression')' \|'isIRI''('Expression')' \|'isURI''('Expression')' \|'isBLANK''('Expression')' \|'isLITERAL''('Expression')' \|'isNUMERIC''('Expression')' \|'hasLANG''('Expression')' \|'hasLANGDIR''('Expression')' \|RegexExpression \|ExistsFunc \|NotExistsFunc \|'isTRIPLE''('Expression')' \|'TRIPLE''('Expression','Expression','Expression')' \|'SUBJECT''('Expression')' \|'PREDICATE''('Expression')' \|'OBJECT''('Expression')'
`[142]`	`RegexExpression`	::=	`'REGEX''('Expression','Expression (','Expression )?')'`
`[143]`	`SubstringExpression`	::=	`'SUBSTR''('Expression','Expression (','Expression )?')'`
`[144]`	`StrReplaceExpression`	::=	`'REPLACE''('Expression','Expression','Expression (','Expression )?')'`
`[145]`	`ExistsFunc`	::=	`'EXISTS'GroupGraphPattern`
`[146]`	`NotExistsFunc`	::=	`'NOT''EXISTS'GroupGraphPattern`
`[147]`	`Aggregate`	::=	`'COUNT''(''DISTINCT'? ('*' \|Expression )')' \|'SUM''(''DISTINCT'?Expression')' \|'MIN''(''DISTINCT'?Expression')' \|'MAX''(''DISTINCT'?Expression')' \|'AVG''(''DISTINCT'?Expression')' \|'SAMPLE''(''DISTINCT'?Expression')' \|'GROUP_CONCAT''(''DISTINCT'?Expression (';''SEPARATOR''='String )?')'`
`[148]`	`iriOrFunction`	::=	`iriArgList?`
`[149]`	`RDFLiteral`	::=	`String (LANG_DIR \|'^^'iri )?`
`[150]`	`NumericLiteral`	::=	`NumericLiteralUnsigned \|NumericLiteralPositive \|NumericLiteralNegative`
`[151]`	`NumericLiteralUnsigned`	::=	`INTEGER \|DECIMAL \|DOUBLE`
`[152]`	`NumericLiteralPositive`	::=	`INTEGER_POSITIVE \|DECIMAL_POSITIVE \|DOUBLE_POSITIVE`
`[153]`	`NumericLiteralNegative`	::=	`INTEGER_NEGATIVE \|DECIMAL_NEGATIVE \|DOUBLE_NEGATIVE`
`[154]`	`BooleanLiteral`	::=	`'true' \|'false'`
`[155]`	`String`	::=	`STRING_LITERAL1 \|STRING_LITERAL2 \|STRING_LITERAL_LONG1 \|STRING_LITERAL_LONG2`
`[156]`	`iri`	::=	`IRIREF \|PrefixedName`
`[157]`	`PrefixedName`	::=	`PNAME_LN \|PNAME_NS`
`[158]`	`BlankNode`	::=	`BLANK_NODE_LABEL \|ANON`

Productions for terminals:

`[159]`	`IRIREF`	::=	'<' ([^<>"{}\|^`\]-[#x00-#x20])* '>'
`[160]`	`PNAME_NS`	::=	`PN_PREFIX? ':'`
`[161]`	`PNAME_LN`	::=	`PNAME_NSPN_LOCAL`
`[162]`	`BLANK_NODE_LABEL`	::=	`'_:' (PN_CHARS_U \| [0-9] ) ((PN_CHARS\|'.')*PN_CHARS)?`
`[163]`	`VAR1`	::=	`'?'VARNAME`
`[164]`	`VAR2`	::=	`'$'VARNAME`
`[165]`	`LANG_DIR`	::=	`'@' [a-zA-Z]+ ('-' [a-zA-Z0-9]+)* ('--' [a-zA-Z]+)?`
`[166]`	`INTEGER`	::=	`[0-9]+`
`[167]`	`DECIMAL`	::=	`[0-9]* '.' [0-9]+`
`[168]`	`DOUBLE`	::=	`( ([0-9]+ ('.'[0-9]*)? ) \| ( '.' ([0-9])+ ) )EXPONENT`
`[169]`	`EXPONENT`	::=	`[eE] [+-]? [0-9]+`
`[170]`	`INTEGER_POSITIVE`	::=	`'+'INTEGER`
`[171]`	`DECIMAL_POSITIVE`	::=	`'+'DECIMAL`
`[172]`	`DOUBLE_POSITIVE`	::=	`'+'DOUBLE`
`[173]`	`INTEGER_NEGATIVE`	::=	`'-'INTEGER`
`[174]`	`DECIMAL_NEGATIVE`	::=	`'-'DECIMAL`
`[175]`	`DOUBLE_NEGATIVE`	::=	`'-'DOUBLE`
`[176]`	`STRING_LITERAL1`	::=	`"'" ( ([^#x27#x5C#xA#xD]) \|ECHAR )* "'"`
`[177]`	`STRING_LITERAL2`	::=	`'"' ( ([^#x22#x5C#xA#xD]) \|ECHAR )* '"'`
`[178]`	`STRING_LITERAL_LONG1`	::=	`"'''" ( ( "'" \| "''" )? ( [^'\] \|ECHAR ) )* "'''"`
`[179]`	`STRING_LITERAL_LONG2`	::=	`'"""' ( ( '"' \| '""' )? ( [^"\] \|ECHAR ) )* '"""'`
`[180]`	`ECHAR`	::=	`'\' [tbnrf\"']`
`[181]`	`NIL`	::=	`'('WS* ')'`
`[182]`	`WS`	::=	`#x20 \| #x9 \| #xD \| #xA`
`[183]`	`ANON`	::=	`'['WS* ']'`
`[184]`	`PN_CHARS_BASE`	::=	`[A-Z] \| [a-z] \| [#x00C0-#x00D6] \| [#x00D8-#x00F6] \| [#x00F8-#x02FF] \| [#x0370-#x037D] \| [#x037F-#x1FFF] \| [#x200C-#x200D] \| [#x2070-#x218F] \| [#x2C00-#x2FEF] \| [#x3001-#xD7FF] \| [#xF900-#xFDCF] \| [#xFDF0-#xFFFD] \| [#x10000-#xEFFFF]`
`[185]`	`PN_CHARS_U`	::=	`PN_CHARS_BASE \| '_'`
`[186]`	`VARNAME`	::=	`(PN_CHARS_U \| [0-9] ) (PN_CHARS_U \| [0-9] \| #x00B7 \| [#x0300-#x036F] \| [#x203F-#x2040] )*`
`[187]`	`PN_CHARS`	::=	`PN_CHARS_U \| '-' \| [0-9] \| #x00B7 \| [#x0300-#x036F] \| [#x203F-#x2040]`
`[188]`	`PN_PREFIX`	::=	`PN_CHARS_BASE ((PN_CHARS\|'.')*PN_CHARS)?`
`[189]`	`PN_LOCAL`	::=	`(PN_CHARS_U \| ':' \| [0-9] \|PLX ) ((PN_CHARS \| '.' \| ':' \|PLX)* (PN_CHARS \| ':' \|PLX) )?`
`[190]`	`PLX`	::=	`PERCENT \|PN_LOCAL_ESC`
`[191]`	`PERCENT`	::=	`'%'HEXHEX`
`[192]`	`HEX`	::=	`[0-9] \| [A-F] \| [a-f]`
`[193]`	`PN_LOCAL_ESC`	::=	`'\' ( '_' \| '~' \| '.' \| '-' \| '!' \| '$' \| '&' \| "'" \| '(' \| ')' \| '*' \| '+' \| ',' \| ';' \| '=' \| '/' \| '?' \| '#' \| '@' \| '%' )`

A text version of this grammar is availablehere.

20.Conformance

As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.

The key wordsMAY,MUST,MUST NOT,OPTIONAL, andSHOULD in this document are to be interpreted as described inBCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.

20.1Conformance

See Section19 SPARQL Grammar regarding conformance ofSPARQL query strings, and section16 Query Forms for conformance of query results. See section22. Internet Media Type for conformance to the application/sparql-query media type.

This specification is intended for use in conjunction with theSPARQL 1.1 Protocol [SPARQL11-PROTOCOL], theSPARQL Query Results XML Format (Second Edition) [RDF-SPARQL-XMLRES], theSPARQL 1.1 Query Results JSON Format [SPARQL11-RESULTS-JSON] and theSPARQL 1.1 Query Results CSV and TSV Formats [SPARQL11-RESULTS-CSV-TSV]. See those specifications for their conformance criteria.

Note that the SPARQL protocol describes a means for conveying SPARQL queries to an SPARQL query processing service and returning the query results to the entity that requested them.

21.Internet Media Type, File Extension and Macintosh File Type

The Internet Media Type (formerly known as MIME Type) for the SPARQL Query Language is "application/sparql-query".

It is recommended that sparql query files have the extension ".rq" (lowercase) on all platforms.

It is recommended that sparql query files stored on Macintosh HFS file systems be given a file type of "TEXT".

Type name:: application
Subtype name:: sparql-query
Required parameters:: None
Optional parameters:: None
Encoding considerations:: The syntax of the SPARQL Query Language is expressed over code points in Unicode [UNICODE]. The encoding is always UTF-8 [RFC3629].; Unicode code points may also be expressed using an \uXXXX (U+0 to U+FFFF) or \UXXXXXXXX syntax (for U+10000 onwards) where X is a hexadecimal digit [0-9A-F]
Security considerations:: See SPARQL Query appendix C,Security Considerations as well asUTF-8, a transformation format of ISO 10646 [RFC3629] section 7, Security Considerations.
Interoperability considerations:: There are no known interoperability issues.
Published specification:: This specification.
Applications which use this media type:: No known applications currently use this media type.
Additional information:
Magic number(s):: A SPARQL query may have the string 'PREFIX' (case independent) near the beginning of the document.
File extension(s):: ".rq"
Base URI:: The SPARQL 'BASE <IRIref>' term can change the current base URI for relative IRIrefs in the query language that are used sequentially later in the document.
Macintosh file type code(s):: "TEXT"
Person & email address to contact for further information:: public-rdf-dawg-comments@w3.org
Intended usage:: COMMON
Restrictions on usage:: None
Author/Change controller:: The SPARQL 1.2 specification is a work product of the World Wide Web Consortium's RDF-star Working Group. TheW3C has change control over these specifications.

A.Changes between SPARQL 1.1 Query Language and SPARQL 1.2 Query Language

This section is non-normative.

Normative changes:
- Update grammar for triple terms, reifiers, reified triples, annotation syntax, and triple term functions in19.8Grammar
- Add functions related totriple terms to17.4.6Functions on Triple Terms:TRIPLE,isTRIPLE,SUBJECT,PREDICATE,OBJECT
- Update grammar for literalbase direction syntax
- Update grammar for VERSION declaration and anew section to describe its usage
- Add functions related tolanguage tag andbase direction:LANGDIR,hasLANG, hasLANGDIR, andSTRLANGDIR
- Define parser input as being anRDF string. Exclude Unicode surrogates from Unicode escape sequences.
- Remove concepts of plain and simple literals, in favor of explicit mentions ofxsd:string
- Migrate XML Schema references to 1.1
- Update references to XPath from 2.0 to 3.1
Editorial changes:
- Give an actual function signature to17.4.2.2sameValue
- Improve wording of blank nodes in16.2.1Templates with Blank Nodes
- Improve display on mobile
- MovesameValue (wasRDFterm-equal) andsameTerm to17.4.2Functions on RDF Terms
- Add note on deduplication of triples produced by CONSTRUCT to16.2CONSTRUCT
- Remove historical notes on rdf:langString datatype from17.4.2.12DATATYPE
- Remove inconsistencies between the definitions of the set functions
- Introduce a function called multiplicity to replace card[Ω](μ) in18.4Basic Graph Patterns
- Update to Media Type language instead of MIME Type language
- Clarify that aggregation returns a single partial function in18.3.4.1Grouping and Aggregation
- Update Filter Evaluation language to reference more functional forms in17.2Filter Evaluation
- Use PREFIX instead of @prefix
- Clarify definition of the Sum set function in18.6.1.3Sum
- Improve definition of Group operator in18.6.1Aggregate Algebra
- Move definitions of Flatten and Card to18.6.1.1Set Functions
- Improve definitions in18.1Initial Definitions
- Fix algorithm for translation SELECT expressions18.3.4.4SELECT Expressions
- Clarify the use of ToList in algebra expressions in18.3Translation to the SPARQL Algebra
Errata:
- editorial-query-1: Missing right parenthesis inEvaluation of Graph definition
- editorial-query-2: Missing space inJoin definition
- editorial-query-3: Incorrect link for DELETE DATA in19.6Blank Nodes and Blank Node Identifiers
- clarification-query-1: Fix explanation of IN and NOT IN in17.4.1.7IN and17.4.1.8NOT IN
- clarification-query-2: Remove unneeded reference to the semantics above in17.3.1Operator Extensibility
- clarification-query-3: Rephrase equality definition in17.4.2.2sameValue
- errata-query-1: Let V be an empty set instead of empty multiset inFunction ALP definition
- errata-query-2: Fix grammar of PropertyListPathNotEmpty in19.SPARQL Grammar
- errata-query-4: Fix CONCAT definition for zero and one argument in17.4.3.10CONCAT
- errata-query-5: Mention illegal nesting of aggregates in19.8Grammar
- errata-query-7 anderrata-query-7a: Remove incorrect full exampleLeftJoin definition
- errata-query-9: Fix examples in18.3.3Examples of Mapped Graph Patterns
- errata-query-11: Rename group variable in18.3.4.1Grouping and Aggregation
- errata-query-12: Clarify definition of Diff in18.6SPARQL Algebra
- errata-query-13: Fix definition of Project cardinality in18.6SPARQL Algebra
- errata-query-18: Fix table in18.3.2.4Translate Property Path Patterns
- errata-query-19: Fix translation in18.3.2.6Translate Graph Patterns
- errata-query-23: Fix inconsistenties betweenMIN andMAX

B.Privacy Considerations

TODO

C.Security Considerations

SPARQL queries using FROM, FROM NAMED, or GRAPH may cause the specified URI to be dereferenced. This may cause additional use of network, disk or CPU resources along with associated secondary issues such as denial of service. The security issues ofUniform Resource Identifier (URI): Generic Syntax [RFC3986] Section 7 should be considered. In addition, the contents offile: URIs can in some cases be accessed, processed and returned as results, providing unintended access to local resources.

SPARQL requests may cause additional requests to be issued from the SPARQL endpoint, such as FROM NAMED. The endpoint is potentially within an organisations firewall or DMZ, and so such queries may be a source of indirection attacks.

The SPARQL language permits extensions, which will have their own security implications.

Multiple IRIs may have the same appearance. Characters in different scripts may look similar (a Cyrillic "о" may appear similar to a Latin "o"). A character followed by combining characters may have the same visual representation as another character (LATIN SMALL LETTER E followed by COMBINING ACUTE ACCENT has the same visual representation as LATIN SMALL LETTER E WITH ACUTE). Users of SPARQL must take care to construct queries with IRIs that match the IRIs in the data. Further information about matching of similar characters can be found inUnicode Security Considerations [UTR36] andInternationalized Resource Identifiers (IRIs) [RFC3987] Section 8.

D.Internationalization Considerations

TODO

E.Index

E.1Terms defined by this specification

E.2Terms defined by reference

[CURIE] defines the following:
- CURIE
[I18N-GLOSSARY] defines the following:
- base direction
- surrogate code points
- Unicode code points
- Unicode scalar values
[RDF12-CONCEPTS] defines the following:
- base direction
- blank node
- blank node identifier
- datatype IRI
- equal as blank nodes
- equal as literal terms
- equal as triple terms
- graph name
- ill-typed
- IRI
- IRI
- IRIs
- language tag
- lexical form
- lexical space
- literal
- object type
- part of SPARQL and RDF concrete serializations
- predicate
- RDF Dataset
- RDF string
- RDF term
- RDF terms
- RDF triple
- same as IRIs
- same RDF term
- simple literal
- subject
- triple term
[RDF12-SEMANTICS] defines the following:
- lean
- RDF collections
- RDF instance mapping
- RDF interpolation lemma
- RDF merge
[RDF12-XML] defines the following:
- RDF/XML
[SPARQL11-PROTOCOL] defines the following:
- specified in a SPARQL protocol request
[SPARQL11-QUERY] defines the following:
- solution mapping
[SPARQL11-UPDATE] defines the following:
- SPARQL update request
[VCARD-RDF] defines the following:
- vCard vocabulary
[WEBARCH] defines the following:
- further details
[XML-NAMES11] defines the following:
- XML local names
[XML11] defines the following:
- Notation
[XMLSCHEMA11-2] defines the following:
- atomic datatype
- XML Schema Regular Expressions
- xsd:boolean
- xsd:byte
- xsd:dateTime
- xsd:decimal
- xsd:double
- xsd:float
- xsd:int
- xsd:integer
- xsd:long
- xsd:negativeInteger
- xsd:nonNegativeInteger
- xsd:nonPositiveInteger
- xsd:positiveInteger
- xsd:short
- xsd:string
- xsd:unsignedByte
- xsd:unsignedInt
- xsd:unsignedLong
- xsd:unsignedShort
[XPATH-31] defines the following:
- numeric type promotions
- string value
- subtype substitution
- typed value
- XPath Operator Mapping
[XPATH-DATAMODEL-31] defines the following:
- XDM
[XPATH-FUNCTIONS-31] defines the following:
- 19.1 Casting from primitive types to primitive types
- defined by XPath
- defined by XQuery and XPath Functions and Operators
- fn:boolean
- fn:compare
- fn:concat
- fn:contains
- fn:day-from-dateTime
- fn:encode-for-uri
- fn:ends-with
- fn:hours-from-dateTime
- fn:lower-case
- fn:matches
- fn:minutes-from-dateTime
- fn:month-from-dateTime
- fn:not
- fn:numeric-abs
- fn:numeric-ceil
- fn:numeric-floor
- fn:numeric-round
- fn:replace
- fn:seconds-from-dateTime
- fn:starts-with
- fn:string-length
- fn:substring
- fn:substring-after
- fn:substring-before
- fn:timezone-from-dateTime
- fn:upper-case
- fn:year-from-dateTime
- op:boolean-equal
- op:boolean-greater-than
- op:boolean-less-than
- op:dateTime-equal
- op:dateTime-greater-than
- op:dateTime-less-than
- op:numeric-add
- op:numeric-divide
- op:numeric-equal
- op:numeric-greater-than
- op:numeric-less-than
- op:numeric-multiply
- op:numeric-subtract
- op:numeric-unary-minus
- op:numeric-unary-plus
[XQUERY-31] defines the following:
- 2.3.1, Kinds of Errors
- Effective Boolean Value
- Expression Processing
- Operator Mapping

F.References

F.1Normative references

[BCP47]: Tags for Identifying Languages. A. Phillips, Ed.; M. Davis, Ed. IETF. September 2009. Best Current Practice. URL:https://www.rfc-editor.org/rfc/rfc5646
[CBD]: CBD - Concise Bounded Description. Patrick Stickler, Nokia. W3C. 3 June 2005. W3C Member Submission. URL:https://www.w3.org/Submission/CBD/
[CURIE]: CURIE Syntax 1.0. Mark Birbeck; Shane McCarron. W3C. 16 December 2010. W3C Working Group Note. URL:https://www.w3.org/TR/curie/
[I18N-GLOSSARY]: Internationalization Glossary. Richard Ishida; Addison Phillips. W3C. 17 October 2024. W3C Working Group Note. URL:https://www.w3.org/TR/i18n-glossary/
[RDF-DAWG-UC]: RDF Data Access Use Cases and Requirements. Kendall Clark. W3C. 25 March 2005. W3C Working Draft. URL:https://www.w3.org/TR/rdf-dawg-uc/
[RDF-SPARQL-XMLRES]: SPARQL Query Results XML Format (Second Edition). Dave Beckett; Jeen Broekstra. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/rdf-sparql-XMLres/
[RDF12-CONCEPTS]: RDF 1.2 Concepts and Abstract Syntax. Olaf Hartig; Pierre-Antoine Champin; Gregg Kellogg; Andy Seaborne. W3C. 3 July 2025. W3C Working Draft. URL:https://www.w3.org/TR/rdf12-concepts/
[RDF12-SEMANTICS]: RDF 1.2 Semantics. Peter Patel-Schneider; Dörthe Arndt; Enrico Franconi. W3C. 27 June 2025. W3C Working Draft. URL:https://www.w3.org/TR/rdf12-semantics/
[RDF12-XML]: RDF 1.2 XML Syntax. Gregg Kellogg. W3C. 26 June 2025. W3C Working Draft. URL:https://www.w3.org/TR/rdf12-xml/
[RFC2119]: Key words for use in RFCs to Indicate Requirement Levels. S. Bradner. IETF. March 1997. Best Current Practice. URL:https://www.rfc-editor.org/rfc/rfc2119
[RFC3629]: UTF-8, a transformation format of ISO 10646. F. Yergeau. IETF. November 2003. Internet Standard. URL:https://www.rfc-editor.org/rfc/rfc3629
[RFC3986]: Uniform Resource Identifier (URI): Generic Syntax. T. Berners-Lee; R. Fielding; L. Masinter. IETF. January 2005. Internet Standard. URL:https://www.rfc-editor.org/rfc/rfc3986
[RFC3987]: Internationalized Resource Identifiers (IRIs). M. Duerst; M. Suignard. IETF. January 2005. Proposed Standard. URL:https://www.rfc-editor.org/rfc/rfc3987
[RFC4122]: A Universally Unique IDentifier (UUID) URN Namespace. P. Leach; M. Mealling; R. Salz. IETF. July 2005. Proposed Standard. URL:https://www.rfc-editor.org/rfc/rfc4122
[RFC4647]: Matching of Language Tags. A. Phillips, Ed.; M. Davis, Ed. IETF. September 2006. Best Current Practice. URL:https://www.rfc-editor.org/rfc/rfc4647
[RFC8174]: Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words. B. Leiba. IETF. May 2017. Best Current Practice. URL:https://www.rfc-editor.org/rfc/rfc8174
[SPARQL-FEATURES]: SPARQL New Features and Rationale. Kjetil Kjernsmo; Alexandre Passant. W3C. 2 July 2009. W3C Working Draft. URL:https://www.w3.org/TR/sparql-features/
[SPARQL11-ENTAILMENT]: SPARQL 1.1 Entailment Regimes. Birte Glimm; Chimezie Ogbuji. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-entailment/
[SPARQL11-FEDERATED-QUERY]: SPARQL 1.1 Federated Query. Eric Prud'hommeaux; Carlos Buil Aranda. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-federated-query/
[SPARQL11-PROTOCOL]: SPARQL 1.1 Protocol. Lee Feigenbaum; Gregory Williams; Kendall Clark; Elias Torres. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-protocol/
[SPARQL11-QUERY]: SPARQL 1.1 Query Language. Steven Harris; Andy Seaborne. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-query/
[SPARQL11-RESULTS-CSV-TSV]: SPARQL 1.1 Query Results CSV and TSV Formats. Andy Seaborne. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-results-csv-tsv/
[SPARQL11-RESULTS-JSON]: SPARQL 1.1 Query Results JSON Format. Andy Seaborne. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-results-json/
[SPARQL11-UPDATE]: SPARQL 1.1 Update. Paula Gearon; Alexandre Passant; Axel Polleres. W3C. 21 March 2013. W3C Recommendation. URL:https://www.w3.org/TR/sparql11-update/
[TURTLE]: RDF 1.1 Turtle. Eric Prud'hommeaux; Gavin Carothers. W3C. 25 February 2014. W3C Recommendation. URL:https://www.w3.org/TR/turtle/
[UAX31]: Unicode Identifiers and Syntax. Mark Davis; Robin Leroy. Unicode Consortium. 2 September 2024. Unicode Standard Annex #31. URL:https://www.unicode.org/reports/tr31/tr31-41.html
[UNICODE]: The Unicode Standard. Unicode Consortium. URL:https://www.unicode.org/versions/latest/
[UTR36]: Unicode Security Considerations. Mark Davis; Michel Suignard. Unicode Consortium. 19 September 2014. Unicode Technical Report #36. URL:https://www.unicode.org/reports/tr36/tr36-15.html
[vcard-rdf]: vCard Ontology - for describing People and Organizations. Renato Iannella; James McKinney. W3C. 22 May 2014. W3C Working Group Note. URL:https://www.w3.org/TR/vcard-rdf/
[WEBARCH]: Architecture of the World Wide Web, Volume One. Ian Jacobs; Norman Walsh. W3C. 15 December 2004. W3C Recommendation. URL:https://www.w3.org/TR/webarch/
[XML-NAMES11]: Namespaces in XML 1.1 (Second Edition). Tim Bray; Dave Hollander; Andrew Layman; Richard Tobin et al. W3C. 16 August 2006. W3C Recommendation. URL:https://www.w3.org/TR/xml-names11/
[XML11]: Extensible Markup Language (XML) 1.1 (Second Edition). Tim Bray; Jean Paoli; Michael Sperberg-McQueen; Eve Maler; François Yergeau; John Cowan et al. W3C. 16 August 2006. W3C Recommendation. URL:https://www.w3.org/TR/xml11/
[XMLSCHEMA11-2]: W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes. David Peterson; Sandy Gao; Ashok Malhotra; Michael Sperberg-McQueen; Henry Thompson; Paul V. Biron et al. W3C. 5 April 2012. W3C Recommendation. URL:https://www.w3.org/TR/xmlschema11-2/
[XPATH-31]: XML Path Language (XPath) 3.1. Jonathan Robie; Michael Dyck; Josh Spiegel. W3C. 21 March 2017. W3C Recommendation. URL:https://www.w3.org/TR/xpath-31/
[XPATH-DATAMODEL-31]: XQuery and XPath Data Model 3.1. Norman Walsh; John Snelson; Andrew Coleman. W3C. 21 March 2017. W3C Recommendation. URL:https://www.w3.org/TR/xpath-datamodel-31/
[XPATH-FUNCTIONS-31]: XPath and XQuery Functions and Operators 3.1. Michael Kay. W3C. 21 March 2017. W3C Recommendation. URL:https://www.w3.org/TR/xpath-functions-31/
[XQUERY-31]: XQuery 3.1: An XML Query Language. Jonathan Robie; Michael Dyck; Josh Spiegel. W3C. 21 March 2017. W3C Recommendation. URL:https://www.w3.org/TR/xquery-31/

F.2Informative references

[SPARQL12-CONCEPTS]: SPARQL 1.2 Concepts. The W3C RDF & SPARQL Working Group. W3C. W3C Working Draft. URL:https://w3c.github.io/sparql-concepts/spec/
[SPARQL12-ENTAILMENT]: SPARQL 1.2 Entailment Regimes. Peter Patel-Schneider. W3C. 19 December 2024. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-entailment/
[SPARQL12-FEDERATED-QUERY]: SPARQL 1.2 Federated Query. Ruben Taelman; Gregory Williams. W3C. 5 May 2025. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-federated-query/
[SPARQL12-GRAPH-STORE-PROTOCOL]: SPARQL 1.2 Graph Store Protocol. Andy Seaborne; Thomas Pellissier Tanon. W3C. 19 December 2024. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-graph-store-protocol/
[SPARQL12-NEW]: What’s New in SPARQL 1.2. The W3C RDF & SPARQL Working Group. W3C. W3C Working Draft. URL:https://w3c.github.io/sparql-new/spec/
[SPARQL12-PROTOCOL]: SPARQL 1.2 Protocol. Andy Seaborne; Ruben Taelman; Gregory Williams; Thomas Pellissier Tanon. W3C. 24 June 2025. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-protocol/
[SPARQL12-RESULTS-CSV-TSV]: SPARQL 1.2 Query Results CSV and TSV Formats. Ruben Taelman; Gregory Williams; Thomas Pellissier Tanon. W3C. 19 December 2024. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-results-csv-tsv/
[SPARQL12-RESULTS-JSON]: SPARQL 1.2 Query Results JSON Format. Andy Seaborne; Ruben Taelman; Gregory Williams; Thomas Pellissier Tanon. W3C. 8 May 2025. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-results-json/
[SPARQL12-RESULTS-XML]: SPARQL 1.2 Query Results XML Format. Ruben Taelman; Dominik Tomaszuk; Thomas Pellissier Tanon. W3C. 27 December 2024. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-results-xml/
[SPARQL12-SERVICE-DESCRIPTION]: SPARQL 1.2 Service Description. Ruben Taelman; Gregory Williams. W3C. 19 December 2024. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-service-description/
[SPARQL12-UPDATE]: SPARQL 1.2 Update. Ruben Taelman; Andy Seaborne; Thomas Pellissier Tanon. W3C. 6 February 2025. W3C Working Draft. URL:https://www.w3.org/TR/sparql12-update/