Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Data Commons

From Wikipedia, the free encyclopedia
Knowledge repository integrating open datasets
Data Commons
Screenshot of a query in Data Commons
Results for a query in Data Commons
FounderRamanathan V. Guha
Key peoplePrem Ramaswami (Head of Data Commons)
ParentGoogle
URLdatacommons.org
LaunchedMay 2018; 7 years ago (2018-05)

Data Commons is an open-source platform[1] created byGoogle[2] that provides anopen knowledge graph, combining economic, scientific and other public datasets into a unified view.[3]Ramanathan V. Guha, a creator of web standards includingRDF,[4]RSS, andSchema.org,[5] founded the project,[6] which is now led by Prem Ramaswami.[7]

The Data Commons website was launched in May 2018 with an initial dataset consisting offact-checking data published inSchema.org "ClaimReview" format by several fact checkers from theInternational Fact-Checking Network.[8][9] Google has worked with partners such as theUnited Nations (UN) to populate the repository,[2] which also includes data from theUnited States Census, theWorld Bank, theUS Bureau of Labor Statistics,[10]Wikipedia, theNational Oceanic and Atmospheric Administration and theFederal Bureau of Investigation.[11]

The service expanded during 2019 to include anRDF-styleknowledge graph populated from a number of largely statistical open datasets. The service was announced to a wider audience in 2019.[12] In 2020 the service improved its coverage of non-US datasets, while also increasing its coverage ofbioinformatics andcoronavirus.[13] In 2023, the service relaunched with a natural-language front end powered by alarge language model.[2] It also launched as the back end to the UN data portal withSustainable Development Goals data.[14]

Features

[edit]

Data Commons places more emphasis on statistical data than is common forlinked data andknowledge graph initiatives. It includes geographical, demographic, weather and real estate data alongside other categories,[3] describing states, Congressional districts, and cities in the United States as well as biological specimens, power plants, and elements of thehuman genome via theEncyclopedia of DNA Elements (ENCODE) project.[11] It represents data assemantic triples each of which can have its own provenance.[3] It centers on the entity-oriented integration of statistical observations from a variety of public datasets. Although it supports a subset of the W3CSPARQL query language,[15] itsAPIs[16] also include tools — such as aPandas dataframe interface — oriented towards data science, statistics and data visualization.

Data Commons is integrative, meaning that it does not provide a hosting platform for different datasets, but rather attempts to consolidate much of the information provided by the datasets into a single data graph.

Technology

[edit]

Data Commons is built on agraph data-model. The graph can be accessed through a browser interface and several APIs,[3][11] and is expanded through loading data (typically CSV andMCF-based templates).[17] The graph can be accessed by natural language queries inGoogle Search.[18] The data vocabulary used to define the datacommons.org graph is based uponSchema.org.[3] In particular the Schema.org terms StatisticalPopulation[19] and Observation[20] were proposed to Schema.org to support datacommons-like use cases.[21]

Software from the project is available onGitHub underApache 2 license.[22]

References

[edit]
  1. ^"Custom Data Commons".Docs - Data Commons. Retrieved16 July 2024.
  2. ^abc"Data Commons is using AI to make the world's public data more accessible and helpful".Google. 13 September 2023. Retrieved16 July 2024.
  3. ^abcdeFensel, Dieter; Şimşek, Umutcan; Angele, Kevin; Huaman, Elwin; Kärle, Elias; Panasiuk, Oleksandra; Toma, Ioan; Umbrich, Jürgen; Wahler, Alexander (2020),"Introduction: What Is a Knowledge Graph?",Knowledge Graphs, Cham: Springer International Publishing, pp. 1–10,doi:10.1007/978-3-030-37439-6_1,ISBN 978-3-030-37438-9,S2CID 213620389, retrieved2020-10-16
  4. ^Guns, Raf (2013). "Tracing the origins of the semantic web".Journal of the American Society for Information Science and Technology.64 (10):2173–2181.doi:10.1002/asi.22907.hdl:10067/1111170151162165141.
  5. ^Funke, Daniel (7 December 2017)."This website helps you find related fact checks - and it was built by a 17-year-old".Poynter. Retrieved16 July 2024.
  6. ^Guha, Ramanathan V. (15 October 2020)."Data Commons, now accessible on Google Search".docs.datacommons.org. Retrieved2020-10-16.
  7. ^O'Donnell, James (12 September 2024)."Google's new tool lets large language models fact-check their responses".MIT Technology Review. Retrieved17 September 2024.
  8. ^"Fact Checks".datacommons.org. 29 March 2019. Retrieved14 October 2020.
  9. ^Jiang, Shan; Baumgartner, Simon; Ittycheriah, Abe; Yu, Cong (2020-04-20)."Factoring Fact-Checks: Structured Information Extraction from Fact-Checking Articles".Proceedings of the Web Conference 2020. WWW '20. Taipei Taiwan: ACM. pp. 1592–1603.doi:10.1145/3366423.3380231.ISBN 978-1-4503-7023-3.S2CID 215882520.
  10. ^Raghavan, Prabhakar (2020-10-15)."How AI is powering a more helpful Google".Google. Retrieved2020-10-16.
  11. ^abcSheth, Amit; Padhee, Swati; Gyrard, Amelie; Sheth, Amit (2019-07-01). "Knowledge Graphs and Knowledge Networks: The Story in Brief".IEEE Internet Computing.23 (4):67–75.arXiv:2003.03623.Bibcode:2019IIC....23d..67S.doi:10.1109/MIC.2019.2928449.ISSN 1089-7801.S2CID 204820800.
  12. ^Luong, Daphne; Chou, Charina (5 March 2019)."Doing our part to share open data responsibly".The Keyword. Retrieved14 October 2020.
  13. ^Ramasubramanian, Sowmya (21 September 2020)."Google's open source data to study impact of COVID-19".The Hindu. Retrieved14 October 2020.
  14. ^Manyika, James (19 September 2023)."Using data and AI to track progress toward the UN Global Goals".Google. Retrieved22 July 2024.
  15. ^"Query the Data Commons Knowledge Graph using SPARQL".datacommons.org. Retrieved14 October 2020.
  16. ^"Overview".datacommons.org. Retrieved14 October 2020.
  17. ^"Contributing to Data Commons – Adding datasets".datacommons.org. Data Commons. Archived fromthe original on 2020-09-19. Retrieved2020-10-14.
  18. ^Guha, Ramanathan V. (15 October 2020)."Data Commons, now accessible on Google Search".docs.datacommons.org. Retrieved2020-10-16.
  19. ^"StatisticalPopulation type at Schema.org".schema.org. Retrieved14 October 2020.
  20. ^"Observation type at Schema.org".schema.org. Retrieved14 October 2020.
  21. ^"Proposal for representing Aggregate Statistical Data".GitHub – Schema.org repository. 25 June 2019. Retrieved14 October 2020.
  22. ^"datacommons.org GitHub".GitHub.

External links

[edit]
a subsidiary ofAlphabet
Company
Divisions
Subsidiaries
Active
Defunct
Programs
Events
Infrastructure
People
Current
Former
Criticism
General
Incidents
Other
Software
A–C
D–N
O–Z
Operating systems
Machine learning models
Neural networks
Computer programs
Formats and codecs
Programming languages
Search algorithms
Domain names
Typefaces
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
Y
Hardware
Pixel
Smartphones
Smartwatches
Tablets
Laptops
Other
Nexus
Smartphones
Tablets
Other
Other
Advertising
Antitrust
Intellectual
property
Privacy
Other
Related
Concepts
Products
Android
Street View coverage
YouTube
Other
Documentaries
Books
Popular culture
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Data_Commons&oldid=1322884219"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp