RDF, RDFa and RDFS Flashcards
(257 cards)
Why do we need a semantic web and ontologies?
- Web content is too diverse/scattered to be easily found, integrated and aggregated
- adding machine processable semantics improves search, query answering and knowledge aggregation
How are facts modelled?
As statements with the structure: Subject - Predicate - Object
What are Knowledge Graphs (KGs)?
They provide an intuitive, yet powerful way to combine complex data, background knowledge about a topic and common sense knowledge about the world.
- They model things that are obvious to humans but unknown/ambiguous to machines/software
Why is Computer Automation for activities like querying search results tricky?
How can this be solved?
Because the web doesn’t have semantics, thus computers/software doesn’t have the same knowledge of dependencies, constraints or context.
By annotating data in a way that software can understand it’s context (alone and within an interconnected network of data)
Which type of formats promote standardisation of the web?
HTML and XML
How can we provide better search query results to users?
By creating large networks of data points that are not siloed to human users but can be combined
How can we help a computer recognise information in a web page for example?
By embedding annotations such as “Page title”, “what the page is about” and “keynote speakers” into the html to reference concepts that the software can understand
What is the semantic web?
A web of partially connected/distributed vocabularies (ontologies) and knowledge bases
- facts are scattered in many locations, with the ability for related instances /ontology fragments asserted anywhere
- Scope and association is managed through namespaces
- references to ontologies and instances through URIs
What advantage does the semantic web offer?
Improves retrieval by using intelligent techniques to take advantage of machine-processable annotations (can use autonomous software agents to perform querying)
What does it mean that the semantic web is a dynamic knowledge source?
It’s typically:
- Inconsistent
- Incomplete (open-world assumption)
- Composed of several ontologies
- Generated by novices and machines as well as experts
What is the AAA principle of the semantic web?
What is a downfall of the AAA principle?
Anyone can say Anything about Anything
- any individual can assert data about some entity (foundation for RDF design)
It causes issues of maintenance, trustability and veracity
Why is it important to remember that information in the semantic web is distributed, networked, disparate and maybe inconsistent?
What is the open world assumption?
- The open world assumption means we can’t assume we have all the information in the network or that we know everything that has been asserted about one single topic.
- If a query doesn’t produce a result, it doesn’t mean the answer doesn’t exist. It implies it could exist but a meaningful link is yet to be established.
What is the issue with data represented in a tabular form?
- It has limited search ability, we can only search by row, column or cell. Each method loses necessary context producing problems.
How can XML be used as a data exchange mechanism?
Explain why XML is not the solution for providing semantics:
It consists of components:
Syntax: how to write the data
Data model: how to organise the data
Semantics: how to interpret the data
This is not the solution because there’s no unique way to express the same information. Tag names do not necessarily provide semantics.
- XML makes no commitment to domain specific ontological vocabulary or modelling
- it’s feasible for closed collaboration but not suitable for global use
What does RDF stand for and what is it?
Resource Description Framework is a language for describing metadata on the web (helps create the semantic web)
- it provides a graph data model that formally describes semantics
- models resource metadata on the web using subject-predicate-object triples
What is an RDF triple?
A triple defines the relationship or predicate between two entities (a subject and an object).
- It is a fact or a statement
Describe the RDF data model (and why this is advantageous)?
- Uses a graph (not a table)
- it is domain independent and can be distributed in a seamless way
Describe what semantics and syntax are:
- Semantics are a set of rules for how to interpret the data model. They describe interpretations of the data items with respect to the domain
- Syntax is a standardised exchange format
What is Schema.org an example of?
A specific format for semantics.
What is the role of models?
They are abstractions that help people understand a fragment of the world and assembles knowledge. They may hide some details and spotlight others.
What are the use cases of models?
- They help people communicate ideas
- They explain and make predictions
- They mediate among multiple viewpoints
- They represent commonalities
Describe the building blocks of RDF:
RDF graphs can be one statement or multiple. They consist of:
- Statements, resources and properties
Describe RDF statements
RDF Statements are subject-predicate-object triples.
They assert the properties of a resource, in the form: resource, property, value.
What are valid objects?
Objects can be resources or literals (atomic values e.g. strings, numbers)
- The value of a statement may be a resource, which means nested or linked data
- This means we can create complex tree and graph structures that slot together.