Movatterモバイル変換


[0]ホーム

URL:


Fariz Darari, profile picture
Uploaded byFariz Darari
PPTX, PDF119 views

Comparing Index Structures for Completeness Reasoning

The document discusses the challenges and techniques for completeness reasoning in knowledge graphs, particularly in the context of querying for movies written by Steven Spielberg. It highlights the issue of scalability when dealing with a large number of completeness statements and proposes indexing methods and filtering principles to improve performance. Experimental evaluations demonstrate the effectiveness of these techniques in optimizing completeness reasoning tasks.

Embed presentation

Download to read offline
Comparing Index Structuresfor Completeness ReasoningFariz Darari*, Werner Nutt†, Simon Razniewski‡*Universitas Indonesia, Indonesia†Free University of Bozen-Bolzano, Italy‡Max Planck Institute for Informatics, GermanyIWBIS 2018 - Jakarta, Indonesia
Imagine you are a Spielberg movie fan
Imagine you are a Spielberg movie fanwho happens to be a knowledge graph fan too
Now you want to query to the knowledge graph:Give movies written by Spielberg!Movies written by Spielberg?
The knowledge graph gives some answers..Movies written by Spielberg?AnswersPoltergeistET.....
The knowledge graph gives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete?
The knowledge graph gives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete? Maybe.Maybe yes,maybe no..
Imagine you are a Spielberg movie fanwho happens to be a knowledge graph fan too
Imagine you are a Spielberg movie fanwho happens to be a knowledge graph fan too*Darari, et al. Completeness Statements about RDF Data Sources and Their Use for Query Answering. ISWC 2013.*But now the knowledge graph has been augmented with completeness statements, as in:This knowledge graph is completefor all movies written by Spielberg
Now you want to query to the knowledge graph:Give movies written by Spielberg!Movies written by Spielberg?This knowledge graph is completefor all movies written by Spielberg
The knowledge graph gives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete?This knowledge graph is completefor all movies written by Spielberg
The knowledge graph gives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete? Yes!This knowledge graph is completefor all movies written by Spielberg
Completeness reasoning:Movies written by Spielberg?This knowledge graph is completefor all movies written by SpielbergChecking if completeness statements guarantee query completenessQuery: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statement:(?x, type, Movie), (?x, writtenBy, Spielberg)
Completeness reasoning:Movies written by Spielberg?This knowledge graph is completefor all movies written by SpielbergChecking if completeness statements guarantee query completenessQuery: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statement:(?x, type, Movie), (?x, writtenBy, Spielberg)Can the statement(s) cover all query components?In general, multiple completeness statements may be requiredto cover all components of the query!
Completeness reasoning scalability challenge:What if there were a million completeness statements?**We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!QueryX 1 million!
X 1 million!QueryUnoptimized completeness reasoningtakes minutes!Completeness reasoning scalability challenge:What if there were a million completeness statements?**We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!
X 1 million!QueryUnoptimized completeness reasoningtakes minutes!Completeness reasoning scalability challenge:What if there were a million completeness statements?*Can we do any better?*We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!
We propose techniquesfor scalable completeness reasoningOur contributions:1. Principle for filtering out irrelevant completeness statements2. Indexing methods for completeness statements3. Experimental evaluations ofnaive completeness reasoning vs. indexed completeness reasoning
Constant-relevance principleA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)
Constant-relevance principleA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
Constant-relevance principleA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
Constant-relevance principleA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
Constant-relevance principle: subset queryingA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, childReduce the problem into:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
Index structure for subset querying:Standard hashing, inverted index, trieSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsHash indexes map keys to values: Set-equality queriesHow do we perform subset querying using hash indexes?
Index structure for subset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsHash indexes map keys to values: Set-equality queriesHow do we perform subset querying using hash indexes?By enumerating all possible non-empty subsets of query constants!Standard hashing, inverted index, trie
Index structure for subset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bStandard hashing, inverted index, trie
Index structure for subset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bM(a, b) = C1M(a, b, c) = C2, C3M(d) = C4Standard hashing, inverted index, trie
Index structure for subset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bM(a, b) = C1M(a, b, c) = C2, C3M(d) = C4Relevant completeness statements:M(a) U M(b) U M(a, b) = C1Standard hashing, inverted index, trie
Index structure for subset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
Index structure for subset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsStandard hashing, inverted index, trie
Index structure for subset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsBag(Q) = Inv(a) U Inv(b)= C1, C2, C3, C1, C2, C3Constant-relevant statements:Those statements appearing as many times in Bag(Q)as the statements' constantsStandard hashing, inverted index, trie
Index structure for subset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsBag(Q) = Inv(a) U Inv(b)= C1, C2, C3, C1, C2, C3Constant-relevant statements:Those statements appearing as many times in Bag(Q)as the statements' constantsStandard hashing, inverted index, trie
Index structure for subset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
Index structure for subset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
Index structure for subset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsTraverse the above trie usingquery constant sequence (a, b)
Index structure for subset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsTraverse the above trie usingquery constant sequence (a, b)visited node = constant-relevantStandard hashing, inverted index, trie
Experimental evaluation:SetupSynthetic dataset with realistic parametersfrom DBpedia, a large real-world knowledge graphParameters:- Number of completeness statements, default value: 1 million- Max length of completeness statements, default value: 6- Query length, two default values, short = 3 and long = 6
Experimental evaluation:Results – Varying number of statementsShort queries Long queries
Experimental evaluation:Results – Varying max length of statementsShort queries Long queries
Experimental evaluation:Results – Varying query length
Experimental evaluation:Results – Unoptimized vs Indexed Reasoning
Experimental evaluation:Results – Unoptimized vs Indexed ReasoningConclusion: We fast forward completeness reasoning.
Comparing Index Structures for Completeness Reasoning

Recommended

PDF
PPTX
Dissertation Defense - Managing and Consuming Completeness Information for RD...
PDF
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
PDF
Slides Workshopon Explainable Logic-Based Knowledge Representation (XLoKR 2020)
PPT
Slides
 
PDF
Looking for Invariant Operators in Argumentation
PDF
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
PDF
Zen and the Art of Python
PPT
lecture03-tolerant.pptFWGDEJIRTPOKJDHWGFTEO
PPT
lecture03-tolerant12345678876543234567.ppt
PDF
191CSEH IR UNIT - II for an engineering subject
PDF
Information Retrieval and Map-Reduce Implementations
PDF
Enabling Fine-grained RDF Data Completeness Assessment
PPTX
IRT Unit_ 2.pptx
PDF
Managing Completeness of Web Data
PPT
Query compiler
PPT
Information Retrieval
PDF
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
PDF
Performance Evaluation of Query Processing Techniques in Information Retrieval
PDF
Fosdem 2013 petra selmer flexible querying of graph data
PPT
Computing FDs
PPTX
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
PPTX
unit -4MODELING AND RETRIEVAL EVALUATION
PDF
Scorer’s Diversity Phase 2.0: Presented by Mikhail Khludnev, Grid Dynamics Inc.
PDF
Search pitb
PDF
Query optimization to improve performance of the code execution
PDF
11.query optimization to improve performance of the code execution
PPTX
Once upon a time in Datatown ...
PPTX
Basic Python Programming: Part 01 and Part 02
PPTX
Python in 30 minutes!

More Related Content

PDF
PPTX
Dissertation Defense - Managing and Consuming Completeness Information for RD...
PDF
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
PDF
Slides Workshopon Explainable Logic-Based Knowledge Representation (XLoKR 2020)
PPT
Slides
 
PDF
Looking for Invariant Operators in Argumentation
PDF
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
PDF
Zen and the Art of Python
Dissertation Defense - Managing and Consuming Completeness Information for RD...
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
Slides Workshopon Explainable Logic-Based Knowledge Representation (XLoKR 2020)
Slides
 
Looking for Invariant Operators in Argumentation
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
Zen and the Art of Python

Similar to Comparing Index Structures for Completeness Reasoning

PPT
lecture03-tolerant.pptFWGDEJIRTPOKJDHWGFTEO
PPT
lecture03-tolerant12345678876543234567.ppt
PDF
191CSEH IR UNIT - II for an engineering subject
PDF
Information Retrieval and Map-Reduce Implementations
PDF
Enabling Fine-grained RDF Data Completeness Assessment
PPTX
IRT Unit_ 2.pptx
PDF
Managing Completeness of Web Data
PPT
Query compiler
PPT
Information Retrieval
PDF
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
PDF
Performance Evaluation of Query Processing Techniques in Information Retrieval
PDF
Fosdem 2013 petra selmer flexible querying of graph data
PPT
Computing FDs
PPTX
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
PPTX
unit -4MODELING AND RETRIEVAL EVALUATION
PDF
Scorer’s Diversity Phase 2.0: Presented by Mikhail Khludnev, Grid Dynamics Inc.
PDF
Search pitb
PDF
Query optimization to improve performance of the code execution
PDF
11.query optimization to improve performance of the code execution
PPTX
Once upon a time in Datatown ...
lecture03-tolerant.pptFWGDEJIRTPOKJDHWGFTEO
lecture03-tolerant12345678876543234567.ppt
191CSEH IR UNIT - II for an engineering subject
Information Retrieval and Map-Reduce Implementations
Enabling Fine-grained RDF Data Completeness Assessment
IRT Unit_ 2.pptx
Managing Completeness of Web Data
Query compiler
Information Retrieval
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
Performance Evaluation of Query Processing Techniques in Information Retrieval
Fosdem 2013 petra selmer flexible querying of graph data
Computing FDs
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
unit -4MODELING AND RETRIEVAL EVALUATION
Scorer’s Diversity Phase 2.0: Presented by Mikhail Khludnev, Grid Dynamics Inc.
Search pitb
Query optimization to improve performance of the code execution
11.query optimization to improve performance of the code execution
Once upon a time in Datatown ...

More from Fariz Darari

PPTX
Basic Python Programming: Part 01 and Part 02
PPTX
Python in 30 minutes!
PPTX
Testing in Python: doctest and unittest
PPTX
Neural Networks and Deep Learning: An Intro
PPTX
Research Writing - 2018.07.18
PDF
Defense Slides of Avicenna Wisesa - PROWD
PPTX
Artificial Neural Networks: Pointers
PDF
Data X Museum - Hari Museum Internasional 2022 - WMID
PPTX
KOI - Knowledge Of Incidents - SemEval 2018
PPTX
Free AI Kit - Game Theory
PPTX
AI in education done properly
PDF
[PUBLIC] quiz-01-midterm-solutions.pdf
PPTX
Testing in Python: doctest and unittest (Updated)
PPTX
Recursion in Python
PPTX
Foundations of Programming - Java OOP
PPTX
Supply and Demand - AI Talents
PPTX
NLP guest lecture: How to get text to confess what knowledge it has
PPTX
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
PPTX
Research Writing - Universitas Indonesia
PPTX
Open Tridharma at ICACSIS 2019
Basic Python Programming: Part 01 and Part 02
Python in 30 minutes!
Testing in Python: doctest and unittest
Neural Networks and Deep Learning: An Intro
Research Writing - 2018.07.18
Defense Slides of Avicenna Wisesa - PROWD
Artificial Neural Networks: Pointers
Data X Museum - Hari Museum Internasional 2022 - WMID
KOI - Knowledge Of Incidents - SemEval 2018
Free AI Kit - Game Theory
AI in education done properly
[PUBLIC] quiz-01-midterm-solutions.pdf
Testing in Python: doctest and unittest (Updated)
Recursion in Python
Foundations of Programming - Java OOP
Supply and Demand - AI Talents
NLP guest lecture: How to get text to confess what knowledge it has
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Research Writing - Universitas Indonesia
Open Tridharma at ICACSIS 2019

Recently uploaded

PDF
Optimizing DNS Performance in Kubernetes: Challenges and Best Practices
PPTX
Understanding Universal Acceptance (UA) and Technical Challenges
PPTX
ARCHITECTURESACGCHCIUOHCOHCSAKJCOQKCHUO.pptx
PDF
Cybrain Software Solutions – Building Future-Ready Digital Tools
PDF
A Day in the Life of IPv6 Scanning by Matsuzaki ʻmazʼ
PDF
Hybrid Mesh Firewall: Network firewall revolution
PDF
DNSSEC Implementation Journey at Prime Bank’s Domain
PDF
OFFENSIVE OPERATIONS : THE ANATOMY OF A NETWORK TAKEOVER
PDF
Automating ISP Networks Using Ansible and IPAM as a Source of Truth [SoT]
PDF
Ethereum Fusaka Upgrade Set For December 3: Everything you need to know | 3.0 TV
PPTX
BTCFi on Starknet,Troves.fi offers automated strategies for Bitcoin users on ...
PDF
A La Recherche Du Temps Perdu: In Search of the Cozy Web
PDF
DNSSEC Deployment for .BD SLDs by Abdul Awal
PDF
Call For Research Papers.....! International Journal of Ubiquitous Computing...
PPTX
Passive Presentation pasdskpasdasdasdasf
PPTX
STORY-NAMED-SHARKY when he was a kid growing up
PPTX
MEANING OF EMOJIS OF SOCIAL MEDIA - RUKUNDO Emmanuel..pptx
PPTX
AI Presentation it all about what is ai and how to implement in real life
PPTX
[HUN][Hackersuli] Tickets Please - Kerberos
PPTX
Facebook: How to Maximize for Everyday Business
Optimizing DNS Performance in Kubernetes: Challenges and Best Practices
Understanding Universal Acceptance (UA) and Technical Challenges
ARCHITECTURESACGCHCIUOHCOHCSAKJCOQKCHUO.pptx
Cybrain Software Solutions – Building Future-Ready Digital Tools
A Day in the Life of IPv6 Scanning by Matsuzaki ʻmazʼ
Hybrid Mesh Firewall: Network firewall revolution
DNSSEC Implementation Journey at Prime Bank’s Domain
OFFENSIVE OPERATIONS : THE ANATOMY OF A NETWORK TAKEOVER
Automating ISP Networks Using Ansible and IPAM as a Source of Truth [SoT]
Ethereum Fusaka Upgrade Set For December 3: Everything you need to know | 3.0 TV
BTCFi on Starknet,Troves.fi offers automated strategies for Bitcoin users on ...
A La Recherche Du Temps Perdu: In Search of the Cozy Web
DNSSEC Deployment for .BD SLDs by Abdul Awal
Call For Research Papers.....! International Journal of Ubiquitous Computing...
Passive Presentation pasdskpasdasdasdasf
STORY-NAMED-SHARKY when he was a kid growing up
MEANING OF EMOJIS OF SOCIAL MEDIA - RUKUNDO Emmanuel..pptx
AI Presentation it all about what is ai and how to implement in real life
[HUN][Hackersuli] Tickets Please - Kerberos
Facebook: How to Maximize for Everyday Business

Comparing Index Structures for Completeness Reasoning

  • 1.
    Comparing Index StructuresforCompleteness ReasoningFariz Darari*, Werner Nutt†, Simon Razniewski‡*Universitas Indonesia, Indonesia†Free University of Bozen-Bolzano, Italy‡Max Planck Institute for Informatics, GermanyIWBIS 2018 - Jakarta, Indonesia
  • 2.
    Imagine you area Spielberg movie fan
  • 3.
    Imagine you area Spielberg movie fanwho happens to be a knowledge graph fan too
  • 4.
    Now you wantto query to the knowledge graph:Give movies written by Spielberg!Movies written by Spielberg?
  • 5.
    The knowledge graphgives some answers..Movies written by Spielberg?AnswersPoltergeistET.....
  • 6.
    The knowledge graphgives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete?
  • 7.
    The knowledge graphgives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete? Maybe.Maybe yes,maybe no..
  • 8.
    Imagine you area Spielberg movie fanwho happens to be a knowledge graph fan too
  • 9.
    Imagine you area Spielberg movie fanwho happens to be a knowledge graph fan too*Darari, et al. Completeness Statements about RDF Data Sources and Their Use for Query Answering. ISWC 2013.*But now the knowledge graph has been augmented with completeness statements, as in:This knowledge graph is completefor all movies written by Spielberg
  • 10.
    Now you wantto query to the knowledge graph:Give movies written by Spielberg!Movies written by Spielberg?This knowledge graph is completefor all movies written by Spielberg
  • 11.
    The knowledge graphgives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete?This knowledge graph is completefor all movies written by Spielberg
  • 12.
    The knowledge graphgives some answers..Movies written by Spielberg?AnswersPoltergeistET.....But are these answers complete? Yes!This knowledge graph is completefor all movies written by Spielberg
  • 13.
    Completeness reasoning:Movies writtenby Spielberg?This knowledge graph is completefor all movies written by SpielbergChecking if completeness statements guarantee query completenessQuery: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statement:(?x, type, Movie), (?x, writtenBy, Spielberg)
  • 14.
    Completeness reasoning:Movies writtenby Spielberg?This knowledge graph is completefor all movies written by SpielbergChecking if completeness statements guarantee query completenessQuery: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statement:(?x, type, Movie), (?x, writtenBy, Spielberg)Can the statement(s) cover all query components?In general, multiple completeness statements may be requiredto cover all components of the query!
  • 15.
    Completeness reasoning scalabilitychallenge:What if there were a million completeness statements?**We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!QueryX 1 million!
  • 16.
    X 1 million!QueryUnoptimizedcompleteness reasoningtakes minutes!Completeness reasoning scalability challenge:What if there were a million completeness statements?**We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!
  • 17.
    X 1 million!QueryUnoptimizedcompleteness reasoningtakes minutes!Completeness reasoning scalability challenge:What if there were a million completeness statements?*Can we do any better?*We'd need this many because a large KG (= knowledge graph) naturallywould require many completeness statements!
  • 18.
    We propose techniquesforscalable completeness reasoningOur contributions:1. Principle for filtering out irrelevant completeness statements2. Indexing methods for completeness statements3. Experimental evaluations ofnaive completeness reasoning vs. indexed completeness reasoning
  • 19.
    Constant-relevance principleA statementcontributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)
  • 20.
    Constant-relevance principleA statementcontributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
  • 21.
    Constant-relevance principleA statementcontributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
  • 22.
    Constant-relevance principleA statementcontributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, child
  • 23.
    Constant-relevance principle: subsetqueryingA statement contributes to query completeness only ifits constants are among the query’sMovies written by Spielberg?Query: (?x, type, Movie), (?x, writtenBy, Spielberg)Completeness statements:(?x, type, Movie), (?x, writtenBy, Spielberg)(?x, type, Movie), (?x, writtenBy, Bob)(?x, type, Song, (?x, writtenBy, Mary)(Spielberg, child, ?y)Constants: type, Movie, writtenBy, SpielbergConstants:type, Movie, writtenBy, Spielbergtype, Movie, writtenBy, Bobtype, Song, writtenBy, MarySpielberg, childReduce the problem into:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
  • 24.
    Index structure forsubset querying:Standard hashing, inverted index, trieSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsHash indexes map keys to values: Set-equality queriesHow do we perform subset querying using hash indexes?
  • 25.
    Index structure forsubset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsHash indexes map keys to values: Set-equality queriesHow do we perform subset querying using hash indexes?By enumerating all possible non-empty subsets of query constants!Standard hashing, inverted index, trie
  • 26.
    Index structure forsubset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bStandard hashing, inverted index, trie
  • 27.
    Index structure forsubset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bM(a, b) = C1M(a, b, c) = C2, C3M(d) = C4Standard hashing, inverted index, trie
  • 28.
    Index structure forsubset querying:Subset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bM(a, b) = C1M(a, b, c) = C2, C3M(d) = C4Relevant completeness statements:M(a) U M(b) U M(a, b) = C1Standard hashing, inverted index, trie
  • 29.
    Index structure forsubset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
  • 30.
    Index structure forsubset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsStandard hashing, inverted index, trie
  • 31.
    Index structure forsubset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsBag(Q) = Inv(a) U Inv(b)= C1, C2, C3, C1, C2, C3Constant-relevant statements:Those statements appearing as many times in Bag(Q)as the statements' constantsStandard hashing, inverted index, trie
  • 32.
    Index structure forsubset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsBag(Q) = Inv(a) U Inv(b)= C1, C2, C3, C1, C2, C3Constant-relevant statements:Those statements appearing as many times in Bag(Q)as the statements' constantsStandard hashing, inverted index, trie
  • 33.
    Index structure forsubset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
  • 34.
    Index structure forsubset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constants
  • 35.
    Index structure forsubset querying:Standard hashing, inverted index, trieconst(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsTraverse the above trie usingquery constant sequence (a, b)
  • 36.
    Index structure forsubset querying:const(C1) = a, bconst(C2) = a, b, cconst(C3) = a, b, cconst(C4) = dconst(Q) = a, bSubset queryingRetrieve completeness statementswhose constants are subsets ofthe query's constantsTraverse the above trie usingquery constant sequence (a, b)visited node = constant-relevantStandard hashing, inverted index, trie
  • 37.
    Experimental evaluation:SetupSynthetic datasetwith realistic parametersfrom DBpedia, a large real-world knowledge graphParameters:- Number of completeness statements, default value: 1 million- Max length of completeness statements, default value: 6- Query length, two default values, short = 3 and long = 6
  • 38.
    Experimental evaluation:Results –Varying number of statementsShort queries Long queries
  • 39.
    Experimental evaluation:Results –Varying max length of statementsShort queries Long queries
  • 40.
  • 41.
    Experimental evaluation:Results –Unoptimized vs Indexed Reasoning
  • 42.
    Experimental evaluation:Results –Unoptimized vs Indexed ReasoningConclusion: We fast forward completeness reasoning.

Editor's Notes

  • #3 https://en.wikipedia.org/wiki/Steven_Spielberg
  • #4 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #5 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #6 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #7 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #8 Say you are at a restaurant, and you are always not sure of the menu you will be order, I can guarantee that the waiter won't be happy with thathttps://emojiisland.com/products/thinking-face-emoji-icon
  • #9 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #10 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #11 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #12 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #13 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #14 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #15 https://en.wikipedia.org/wiki/Steven_Spielberghttps://ontotext.com/knowledgehub/webinars/tagging-with-rich-knowledge-graphs/
  • #42 100000 times faster for short30000 times faster for long
  • #43 100000 times faster for short30000 times faster for long

[8]ページ先頭

©2009-2025 Movatter.jp