Part of the book series:Lecture Notes in Computer Science ((LNISA,volume 5318))
Included in the following conference series:
2860Accesses
Abstract
Efficient RDF data management is one of the cornerstones in realizing the Semantic Web vision. In the past, different RDF storage strategies have been proposed, ranging from simple triple stores to more advanced techniques like clustering or vertical partitioning on the predicates. We present an experimental comparison of existing storage strategies on top of the SP2Bench SPARQL performance benchmark suite and put the results into context by comparing them to a purely relational model of the benchmark scenario. We observe that (1) in terms of performance and scalability, a simple triple store built on top of a column-store DBMS is competitive to the vertically partitioned approach when choosing a physical (predicate, subject, object) sort order, (2) in our scenario with real-world queries, none of the approaches scales to documents containing tens of millions of RDF triples, and (3) none of the approaches can compete with a purely relational model. We conclude that future research is necessary to further bring forward RDF data management.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
W3C: Resource Description Framework (RDF),http://www.w3.org/RDF/
W3C: SPARQL Query Language,http://www.w3.org/TR/rdf-sparql-query/
Bizer, C., Cyganiak, R.: D2R Server – Publishing the DBLP Bibliography Database (2007),http://www4.wiwiss.fu-berlin.de/dblp/
Tauberer, J.: U.S. Census RDF Data,http://www.rdfabout.com/demo/census/
Alexaki, S., Christophides, V., Karvounarakis, G., Plexousakis, D.: On Storing Voluminous RDF Descriptions: The case of Web Portal Catalogs. In: WebDB (2001)
Broekstra, J., Kampman, A., van Harmelen, F.: Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 54–68. Springer, Heidelberg (2002)
Bonstrom, V., Hinze, A., Schweppe, H.: Storing RDF as a Graph. In: Web Congress, pp. 27–36 (2003)
Theoharis, Y., Christophides, V., Karvounarakis, G.: Benchmarking RDF Representations of RDF/S Stores. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 685–701. Springer, Heidelberg (2005)
Chong, E.I., Das, S., Eadon, G., Srinivasan, J.: An Efficient SQL-based RDF Querying Scheme. In: VLDB, pp. 1216–1227 (2005)
Wilkinson, K.: Jena Property Table Implementation. In: International Workshop on Scalable Semantic Web Knowledge Base, pp. 35–46 (2006)
Abadi, D.J., Marcus, A., Madden, S., Hollenbach, K.J.: Scalable Semantic Web Data Management Using Vertical Partitioning. In: VLDB, pp. 411–422 (2007)
Abadi, D.J., Marcus, A., Madden, S., Hollenbach, K.J.: Using the Barton libraries dataset as an RDF benchmark. Technical report, MIT-CSAIL-TR-2007-036, MIT (2007)
Schmidt, M., Hornung, T., Lausen, G., Pinkel, C.: SP2Bench: A SPARQL Performance Benchmark. Technical report, arXiv:0806.4627v1 cs.DB (2008)
Ley, M.: DBLP Database,http://www.informatik.uni-trier.de/~ley/db/
openRDF.org: Home of Sesame,http://www.openrdf.org/documentation.jsp
Sidirourgos, L., Goncalves, R., Kersten, M., Nes, N., Manegold, S.: Column-store Support for RDF Data Management: not all swans are white. In: VLDB (2008)
Bizer, C., Schultz, A.: The Berlin SPARQL Benchmark,http://www4.wiwiss.fu-berlin.de/bizer/BerlinSPARQLBenchmark/
Stonebraker, M., et al.: C-store: a Column-oriented DBMS. In: VLDB, pp. 553–564 (2005)
CWI Amsterdam: MonetDB,http://monetdb.cwi.nl/
Chebotko, A., Lu, S., Yamil, H.M., Fotouhi, F.: Semantics Preserving SPARQL-to-SQL Query Translation for Optional Graph Patterns. Technical report, TR-DB-052006-CLJF (2006)
Cyganiac, R.: A Relational Algebra for SPARQL. Technical report, HP Bristol
Harris, S.: SPARQL Query Processing with Conventional Relational Database Systems. In: SSWS (2005)
SourceForge: Jena2,http://jena.sourceforge.net/DB/index.html
Harris, S., Gibbins, N.: 3store: Efficient Bulk RDF Storage. In: PSSS (2003)
Author information
Authors and Affiliations
Freiburg University, Georges-Köhler-Allee 51, 79106, Freiburg, Germany
Michael Schmidt, Thomas Hornung, Norbert Küchlin & Georg Lausen
MTC Infomedia OHG, Kaiserstr. 26, 66121, Saarbrücken, Germany
Christoph Pinkel
- Michael Schmidt
You can also search for this author inPubMed Google Scholar
- Thomas Hornung
You can also search for this author inPubMed Google Scholar
- Norbert Küchlin
You can also search for this author inPubMed Google Scholar
- Georg Lausen
You can also search for this author inPubMed Google Scholar
- Christoph Pinkel
You can also search for this author inPubMed Google Scholar
Editor information
Editors and Affiliations
Department of Computer Science and Engineering, Wright State University, Colonel Glenn Way 3640, 454350001, Dayton, USA
Amit Sheth
Institut für Informatik, Universität Koblenz-Landau, Universitätsstr. 1, 56016, Koblenz, Germany
Steffen Staab
BBN Technologies, 48103, Ann Arbor, USA
Mike Dean
DoCoMo Communications Laboratories Europe GmbH, 80687, Munich, Germany
Massimo Paolucci
Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, S1 4DP, Sheffield, UK
Diana Maynard
CSEE Department, UMBC, 1000 Hilltop Circle, MD 21250, Baltimore, USA
Timothy Finin
Department of Computer Science and Engineering, Wright State University, 3640 Colonel Glenn Highway, OH 45435, Dayton, USA
Krishnaprasad Thirunarayan
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schmidt, M., Hornung, T., Küchlin, N., Lausen, G., Pinkel, C. (2008). An Experimental Comparison of RDF Data Management Approaches in a SPARQL Benchmark Scenario. In: Sheth, A.,et al. The Semantic Web - ISWC 2008. ISWC 2008. Lecture Notes in Computer Science, vol 5318. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88564-1_6
Download citation
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative