Big Data Flashcards

1
Q

What are the three defining features of big data?

A

The three defining features of big data are Volume, Velocity, and Variety (the three Vs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain the volume aspect of big data.

A

Big data refers to a large amount of data that cannot fit on a conventional hard drive or server and requires storage over multiple servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe the velocity aspect of big data.

A

Velocity refers to the rapid creation and modification of data in big data systems, requiring servers to respond quickly to changing data within milliseconds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the variety aspect of big data refer to?

A

Variety in big data refers to the diverse types of data it encompasses, ranging from binary files to multimedia files like photos and videos.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is big data’s lack of structure challenging?

A

Big data’s lack of structure makes it difficult to analyze because conventional databases require a row and column structure that big data doesn’t conform to.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can machine learning techniques be useful for analysing big data?

A

Machine learning techniques can be employed to identify patterns in big data and extract valuable insights from it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Provide examples of big data

A

Examples of big data include continuously monitored banking interactions and data from surveillance systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the fact-based model for representing big data?

A

The fact-based model represents big data by storing individual pieces of information as immutable facts with associated timestamps, allowing multiple values for the same attribute to be held over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does the fact-based model reduce the risk of data loss?

A

The immutability of facts in the fact-based model reduces the risk of data loss due to human error, as they cannot be accidentally overwritten.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the graph schema for representing big data?

A

Graph schema uses nodes and edges to represent entities and relationships between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How are properties represented in a graph schema?

A

Properties of entities can be listed inside circles (nodes) or rectangles joined with dashed lines to indicate ownership.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly