Part of the book series:Lecture Notes in Computer Science ((LNISA,volume 9366))
Included in the following conference series:
2650Accesses
Abstract
Making available and archiving scientific results is for the most part still considered the task of classical publishing companies, despite the fact that classical forms of publishing centered around printed narrative articles no longer seem well-suited in the digital age. In particular, there exist currently no efficient, reliable, and agreed-upon methods for publishing scientific datasets, which have become increasingly important for science. Here we propose to design scientific data publishing as a Web-based bottom-up process, without top-down control of central authorities such as publishing companies. Based on a novel combination of existing concepts and technologies, we present a server network to decentrally store and archive data in the form of nanopublications, an RDF-based format to represent scientific data. We show how this approach allows researchers to publish, retrieve, verify, and recombine datasets of nanopublications in a reliable and trustworthy manner, and we argue that this architecture could be used for the Semantic Web in general. Evaluation of the current small network shows that this system is efficient and reliable.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Belhajjame, K., Corcho, O., Garijo, D., Zhao, J., Missier, P., Newman, D., Palma, R., Bechhofer, S., Garcıa, E., Cuesta, J.M.G.-P., et al.: Workflow-centric research objects: first class citizens in scholarly discourse. In: Proceedings of SePublica 2012. CEUR-WS (2012)
Berners-Lee, T.: Linked data – design issues (2006).http://www.w3.org/DesignIssues/LinkedData.html
Buil-Aranda, C., Hogan, A., Umbrich, J., Vandenbussche, P.-Y.: SPARQL Web-Querying Infrastructure: Ready for Action? In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 277–293. Springer, Heidelberg (2013)
Carroll, J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: Proceedings of WWW 2005, pp. 613–622. ACM (2005)
Chichester, C., Gaudet, P., Karch, O., Groth, P., Lane, L., Bairoch, A., Mons, B., Loizou, A.: Querying nextprot nanopublications and their value for insights on sequence variants and tissue expression. Web Semantics: Science, Services and Agents on the World Wide Web (2014)
Chichester, C., Karch, O., Gaudet, P., Lane, L., Mons, B., Bairoch, A.: Converting neXtProt into linked data and nanopublications. Semantic Web (2014, to appear)
Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: a distributed anonymous information storage and retrieval system. In: Federrath, H. (ed.) Designing Privacy Enhancing Technologies. LNCS, vol. 2009, p. 46. Springer, Heidelberg (2001)
Cohen, J.P., Lo, H.Z.: Academic torrents: a community-maintained distributed repository. In: Proceedings of XSEDE 2014, p. 2. ACM (2014)
Filali, I., Bongiovanni, F., Huet, F., Baude, F.: A survey of structured P2P systems for RDF data storage and retrieval. In: Transactions on Large-Scale Data- and Knowledge-Centered Systems III, pp. 20–55. Springer (2011)
Fu, K., Kaashoek, M.F., Mazières, D.: Fast and secure distributed read-only file system. ACM Transactions on Computer Systems20(1), 1–24 (2002)
Groth, P., Gibson, A., Velterop, J.: The anatomy of a nano-publication. Information Services and Use30(1), 51–56 (2010)
Jacobson, V., Smetters, D.K., Thornton, J.D., Plass, M., Briggs, N., Braynard, R.: Networking named content. Commun. ACM55(1), 117–124 (2012)
Kuhn, T.: Science bots: a model for the future of scientific computation? In: WWW 2015 Companion Proceedings, pp. 1061–1062. ACM (2015)
Kuhn, T., Barbano, P.E., Nagy, M.L., Krauthammer, M.: Broadening the scope of nanopublications. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 487–501. Springer, Heidelberg (2013)
Kuhn, T., Dumontier, M.: Trusty URIs: verifiable, immutable, and permanent digital artifacts for linked data. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 395–410. Springer, Heidelberg (2014)
Kuhn, T., Dumontier, M.: Making digital artifacts on the web verifiable and reliable. IEEE Transactions on Knowledge and Data Engineering27(9) (2015)
Ladwig, G., Harth, A.: CumulusRDF: linked data management on nested key-value stores. In: Proceedings of SSWS 2011 (2011)
Markman, C., Zavras, C.: BitTorrent and libraries: Cooperative data publishing, management and discovery. D-Lib Magazine20(3), 5 (2014)
McCusker, J.P., Lebo, T., Krauthammer, M., McGuinness, D.L.: Next generation cancer data discovery, access, and integration using prizms and nanopublications. In: Baker, C.J.O., Butler, G., Jurisica, I. (eds.) DILS 2013. LNCS, vol. 7970, pp. 105–112. Springer, Heidelberg (2013)
Miller, A., Juels, A., Shi, E., Parno, B., Katz, J.: Permacoin: repurposing Bitcoin work for data preservation. In: Proceedings of the IEEE Symposium on Security and Privacy (SP), pp. 475–490. IEEE (2014)
Mons, B., van Haagen, H., Chichester, C., den Dunnen, J.T., van Ommen, G., van Mulligen, E., Singh, B., Hooft, R., Roos, M., Hammond, J., et al.: The value of data. Nature genetics43(4), 281–283 (2011)
Paskin, N.: Digital object identifiers for scientific data. Data Science Journal4, 12–20 (2005)
Patrinos, G.P., Cooper, D.N., van Mulligen, E., Gkantouna, V., Tzimas, G., Tatum, Z., Schultes, E., Roos, M., Mons, B.: Microattribution and nanopublication as means to incentivize the placement of human genome variation data into the public domain. Human mutation33(11), 1503–1512 (2012)
Proell, S., Rauber, A.: A scalable framework for dynamic data citation of arbitrary structured data. In: 3rd International Conference on Data Management Technologies and Applications (DATA2014), 8 2014
Queralt-Rosinach, N., Kuhn, T., Chichester, C., Dumontier, M., Sanz, F., Furlong, L.I.: Publishing DisGeNET as nanopublications. Semantic Web – Interoperability, Usability, Applicability (2015, to appear)
Speicher, S., Arwe, J., Malhotra, A.: Linked data platform 1.0. Recommendation, W3C, February 26, 2015
Verborgh, R., Vander Sande, M., Colpaert, P., Coppens, S., Mannens, E., Van de Walle, R.: Web-scale querying through linked data fragments. In: Proceedings of LDOW 2014 (2014)
Williams, A.J., Harland, L., Groth, P., Pettifer, S., Chichester, C., Willighagen, E.L., Evelo, C.T., Blomberg, N., Ecker, G., Goble, C., et al.: Open PHACTS: semantic interoperability for drug discovery. Drug discovery today17(21), 1188–1198 (2012)
AIDA Nanopubs extracted from GeneRIF. Nanopublication index, 4 March 2015.http://np.inn.ac/RAY_lQruuagCYtAcKAPptkY7EpITwZeUilGHsWGm9ZWNI
Nanopubs converted from neXtProt protein data (preliminary). Nanopublication index, 10 March 2015.http://np.inn.ac/RAXFlG04YMi1A5su7oF6emA8mSp6HwyS3mFTVYreDeZRg
Nanopubs converted from OpenBEL’s Small and Large Corpus 1.0. Nanopublication index, 4 March 2015.http://np.inn.ac/RACy0I4f_wr62Ol7BhnD5EkJU6Glf-wp0oPbDbyve7P6o
Nanopubs converted from OpenBEL’s Small and Large Corpus 20131211. Nanopublication indexhttp://np.inn.ac/RAR5dwELYLKGSfrOclnWhjOj-2nGZN_8BW1JjxwFZINHw, 4 March 2015
Nanopubs extracted from DisGeNET v2.1.0.0. Nanopublication indexhttp://np.inn.ac/RAXy332hxqHPKpmvPc-wqJA7kgWiWa-QA0DIpr29LIG0Q, 5 March 2015
Author information
Authors and Affiliations
Department of Humanities, Social and Political Sciences, ETH Zurich, Zürich, Switzerland
Tobias Kuhn
Department of Computer Science, VU University Amsterdam, Amsterdam, The Netherlands
Tobias Kuhn
Swiss Institute of Bioinformatics, Geneva, Switzerland
Christine Chichester
Yale University School of Medicine, New Haven, CT, USA
Michael Krauthammer
Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA
Michel Dumontier
- Tobias Kuhn
You can also search for this author inPubMed Google Scholar
- Christine Chichester
You can also search for this author inPubMed Google Scholar
- Michael Krauthammer
You can also search for this author inPubMed Google Scholar
- Michel Dumontier
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toTobias Kuhn.
Editor information
Editors and Affiliations
Pontificia Universidad Católica de Chile, Santiago de Chile, Chile
Marcelo Arenas
Universidad Politecnica de Madrid, Boadilla del Monte, Spain
Oscar Corcho
University of Southampton, Southampton, United Kingdom
Elena Simperl
Department of Computational Social Science, GESIS Leibniz-Institut, Köln, Nordrhein-Westfalen, Germany
Markus Strohmaier
The Open University, Milton Keynes, United Kingdom
Mathieu d'Aquin
IBM Research, Yorktown Heights, New York, USA
Kavitha Srinivas
Elsevier Labs., Amsterdam, The Netherlands
Paul Groth
School of Medicine, Stanford University, Stanford, California, USA
Michel Dumontier
Lehigh University, Bethlehem, Pennsylvania, USA
Jeff Heflin
DAYTON, Ohio, USA
Krishnaprasad Thirunarayan
Wright State University, Dayton, Ohio, USA
Krishnaprasad Thirunarayan
University of Koblenz-Landau, Koblenz, Rheinland-Pfalz, Germany
Steffen Staab
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Kuhn, T., Chichester, C., Krauthammer, M., Dumontier, M. (2015). Publishing Without Publishers: A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. In: Arenas, M.,et al. The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science(), vol 9366. Springer, Cham. https://doi.org/10.1007/978-3-319-25007-6_38
Download citation
Published:
Publisher Name:Springer, Cham
Print ISBN:978-3-319-25006-9
Online ISBN:978-3-319-25007-6
eBook Packages:Computer ScienceComputer Science (R0)
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative