Data Storage and Processing Flashcards
(130 cards)
What challenge do most existing IoT solutions face?
Most IoT solutions are tailored to specific verticals, leading to separate data silos, which makes it difficult to capture the full potential of IoT across multiple domains.
Why is handling IoT data from different domains challenging?
IoT data come from various structures, sources, and descriptions, making it complex to integrate and process them properly across different domains.
What is required to ensure interoperability of IoT devices?
IoT data must be stored in different network databases, shared among multiple nodes, analyzed by various tools, and interpreted by different machines to ensure interoperability.
What is the Semantic Web, and how does it help with IoT data?
The Semantic Web, or linked data web, provides reasoning engines and tools to analyze and link IoT data meaningfully across various domains.
What role does complex event processing play in IoT data analysis?
Complex event processing searches for dependencies and patterns in streaming IoT data, creating real-time insights to help businesses identify opportunities and threats early.
Why is a single server insufficient for handling IoT data?
IoT data are often too large for a single server or database to handle, requiring distributed processing approaches like MapReduce.
How does the MapReduce programming model help manage IoT data?
MapReduce distributes datasets across multiple databases to process the data separately and then recombines the results, making it possible to handle large volumes of structured and unstructured IoT data.
How did the web evolve from its initial phase to the Semantic Web (Web 3.0)?
The web started as a collection of documents linked to each other and gradually evolved into the Semantic Web, where documents and pieces of data are meaningfully connected.
What was unclear about the relationships between documents in the early phases of the web?
In the early phases, relationships between documents were unclear because they were not linked to specific pieces of data.
What does the Semantic Web enable for users and machines?
The Semantic Web provides meaningful links between data, allowing users (both humans and machines) to explore and understand connections between pieces of information.
What is linked data, and what does it create?
Linked data refers to semantically linking and integrating pieces of information across domains, creating a global web that connects data on topics like books, companies, and social media.
How do machines use linked data in the Semantic Web?
Machines can connect distributed data sources, process new data as they appear on the web, and produce integrated results, enhancing applications like data browsers and search engines.
What does a generic linked data browser allow users to do?
A generic linked data browser lets users browse a data source and travel along links to related sources, enhancing data exploration.
What capability do linked data search engines provide?
Linked data search engines allow expressive query capabilities over aggregated data by crawling the global web of linked data.
What is linked data?
Linked data refers to machine-readable, well-defined information published on the web that can be connected to external datasets from various sources.
What format is used in linked data technologies to connect information?
Linked data technologies use the Resource Description Framework (RDF) format to create a web of data by linking different things.
What kinds of data sources can linked data technologies connect?
Linked data can connect data sources ranging from geographically distributed database to heterogeneous systems that cannot interoperate at the data level.
Who specified the rules for publishing data as part of the global web of data?
Tim Berners-Lee, the inventor of the World Wide Web, specified the rules for publishing data as part of the global web of data.
What are the four linked data principles as specified by Tim Berners-Lee?
- Use Uniform Resource Identifiers (URIs) as names for things.
- Use HTTP URIs to help people look up the things’ names.
- Use RDF and SPARQL standards to provide useful data.
- Include links to other URIs to help people discover more things.
Which two fundamental web technologies are relied on by the first two linked data principles?
The first two linked data principles rely on Uniform Resource Identifiers (URIs) and Hypertext Transfer Protocol (HTTP).
How does RDF enhance linked data?
RDF supports a generic, graph-based data model that structures and links data describing things in the world, enhancing linked data.
What does the Resource Description Framework (RDF) syntax encode and represent?
RDF encodes and represents web resources and data in a structure known as triples.
What are the three components of an RDF triple?
- Subject: A resource identified by a URI.
- Predicate: A URI specifying the relationship between the subject and object.
- Object: A resource or literal (a basic string value) identified by a URI, related to the subject.
What does the predicate represent in an RDF triple?
The predicate specifies the relationship between the subject and the object, represented by a URI.