KG Quality Flashcards

Question 1

Q

What does ‘Garbage In, Garbage Out’ imply for KG quality?

Answer

A

A KG’s quality directly depends on input data; poor-quality inputs yield poor-quality graphs.

Question 2

Q

Why is KG quality multi-dimensional?

Answer

A

It involves multiple aspects (accuracy, completeness, consistency, etc.) and is use-case dependent.

Question 3

Q

How does KG quality differ from software quality?

Answer

A

KGs lack modularization, power unknown downstream systems, and follow unique development processes.

Question 4

Q

Define ‘Accuracy’ in a KG.

Answer

A

Closeness of recorded values to true values, measured via correctness (bool) or distance metrics.

Question 5

Q

How can Accuracy be aggregated across a KG?

Answer

A

As the proportion of accurate triples or as a weighted average accuracy.

Question 6

Q

Define ‘Completeness’ in a KG.

Answer

A

Degree to which required knowledge is present: schema, property, and population completeness.

Question 7

Q

What is ‘Population Completeness’?

Answer

A

Extent to which all entities in the scoped domain are present in the KG.

Question 8

Q

Define ‘Consistency’ in a KG.

Answer

A

Absence of conflicting statements; measured by number of inconsistencies detected.

Question 9

Q

What are ‘Reference Values’?

Answer

A

Ground-truth facts from experts used to measure accuracy and population completeness.

Question 10

Q

What are ‘Competency Questions’?

Answer

A

SPARQL queries with known results serving as unit tests for KG capabilities.

Question 11

Q

What role does SHACL play in KG quality?

Answer

A

Defines shape constraints (cardinality, datatype, class) for automated quality validation.

Question 12

Q

Define ‘Syntactic Validity’.

Answer

A

Conformance to RDF syntax: well-formed triples, correct prefixes, literals, and serialization grammar.

Question 13

Q

Define ‘Timeliness’ for a KG.

Answer

A

Degree to which KG data is up-to-date, measured via timestamps, update latency, and volatility.

Question 14

Q

What is ‘Freshness’ in KG quality?

Answer

A

Coverage and recency of timestamped data and the age-to-volatility ratio.

Question 15

Q

Define ‘Conciseness’.

Answer

A

Avoidance of redundant or duplicate schema elements and data instances in the KG.

Question 16

Q

Name a metric for Conciseness.

Answer

Study These Flashcards

A

Ratio of unique instances to total instances; ratio of unique predicates in schema.

Question 17

Q

Define ‘Understandability’.

Answer

Study These Flashcards

A

Ease of human comprehension, supported by labels, comments, and readable IRI patterns.

Question 18

Q

What metric measures Understandability?

Answer

Study These Flashcards

A

Proportion of classes/properties with rdfs:label and rdfs:comment annotations.

Question 19

Q

Why is Human Judgment valid in KG quality?

Answer

Study These Flashcards

A

Quality is subjective and context-dependent; expert judgment is valid when acknowledged.

Question 20

Q

Define ‘Availability’ in decentralized KGs.

Answer

Study These Flashcards

A

Accessibility of KG data via endpoints, dumps, or dereferenceable URIs.

Question 21

Q

Define ‘Latency’ in KG performance.

Answer

Study These Flashcards

A

Delay between query request and the start of the response from the KG service.

Question 22

Q

What is ‘Interlinking’ in Linked Data?

Answer

Study These Flashcards

A

Degree of external links (e.g., owl:sameAs) per entity, indicating cross-KG connections.

Question 23

Q

How can interlinking affect KG quality?

Answer

Study These Flashcards

A

Improves completeness but may introduce conciseness and consistency challenges when merging.

KG Quality Flashcards

(23 cards)