Chapter 10 (Final) Flashcards

(41 cards)

1
Q

Big data includes:

A

large volumes of data with many different data types that are processed at very high speeds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

An organization that requires a graph database that is highly scalable would select the ________ database management system.

A

Neo4j

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The philosophical underpinnings of big data are based on schema on write.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

With HDFS it is less expensive to move the execution of computation to data than to move the:

A

data to computation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

JSON is commonly used in conjunction with the ‘document store’ NoSQL database model.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Transaction processing and management reporting tend to fit big data databases better than relational databases.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Economies of storage indicate data storage costs increase every year.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

According to your text, NoSQL stands for:

A

Not Only SQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When reporting and analysis organization of the data is determined when the data is used is called a(n):

A

schema on read.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Graph-oriented databases are designed to maintain information regarding the relationships between data items.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The target market for Hadoop is small to medium companies using local area networks.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

________ is an important scripting language to help reduce the complexity of MapReduce.

A

Pig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Value (related to the five ‘v’s’ of big data) addresses the pursuit of a meaningful goal.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

NoSQL stands for ‘Not only SQL.’

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Big data allows for two different data types (text and numeric).

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Apache Cassandra is a wide-column NoSQL database management system.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

MongoDB databases are composed of:

18
Q

________ is the most popular key-value store NoSQL database management system.

19
Q

The three ‘v’s’ commonly associated with big data include:

A

volume, variety, and velocity.

20
Q

Big data requires effectively processing:

A

many data types.

21
Q

The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop.

A

data management

22
Q

An organization using HDFS realizes that hardware failure is a(n):

23
Q

NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).

24
Q

NoSQL includes data storage and retrieval:

A

not based on the relational model.

25
The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a:
graph-oriented database.
26
Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:
veracity and value.
27
Collect everything is a characteristic of a data lake.
True
28
Neo4j is a wide-column NoSQL database management system developed by Oracle.
False
29
________ includes concern about data quality issues.
Veracity
30
Big data:
does not require a strictly defined data model.
31
It is true that in an HDFS cluster the NameNode is the:
single master server.
32
At a basic level, analytics refers to:
analysis and interpretation of data.
33
The schema on write and schema on read are considered synonymous approaches.
False
34
The original three 'v's' attributed to big data include volume, variety, and velocity.
True
35
The NoSQL model that incorporates 'column families' is called a:
wide-column store.
36
The Hadoop framework consists of the ________ algorithm to solve large scale problems.
MapReduce
37
NoSQL focuses on avoidance of replication and minimizing storage space.
False
38
The primary use of Pig is to:
transform raw data into a format that is useful for analysis.
39
Hive uses ________ to query data.
HiveQL
40
HBASE is a wide-column store database that runs on top of HDFS (modeled after Google).
True
41
The 'schema on read' approach often incorporates JSON or XML.
True