Chapter 10 (Final) Flashcards

1
Q

Big data includes:

A

large volumes of data with many different data types that are processed at very high speeds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

An organization that requires a graph database that is highly scalable would select the ________ database management system.

A

Neo4j

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The philosophical underpinnings of big data are based on schema on write.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

With HDFS it is less expensive to move the execution of computation to data than to move the:

A

data to computation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

JSON is commonly used in conjunction with the ‘document store’ NoSQL database model.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Transaction processing and management reporting tend to fit big data databases better than relational databases.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Economies of storage indicate data storage costs increase every year.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

According to your text, NoSQL stands for:

A

Not Only SQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When reporting and analysis organization of the data is determined when the data is used is called a(n):

A

schema on read.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Graph-oriented databases are designed to maintain information regarding the relationships between data items.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The target market for Hadoop is small to medium companies using local area networks.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

________ is an important scripting language to help reduce the complexity of MapReduce.

A

Pig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Value (related to the five ‘v’s’ of big data) addresses the pursuit of a meaningful goal.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

NoSQL stands for ‘Not only SQL.’

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Big data allows for two different data types (text and numeric).

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Apache Cassandra is a wide-column NoSQL database management system.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

MongoDB databases are composed of:

A

collections

18
Q

________ is the most popular key-value store NoSQL database management system.

A

Redis

19
Q

The three ‘v’s’ commonly associated with big data include:

A

volume, variety, and velocity.

20
Q

Big data requires effectively processing:

A

many data types.

21
Q

The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop.

A

data management

22
Q

An organization using HDFS realizes that hardware failure is a(n):

A

norm

23
Q

NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).

A

True

24
Q

NoSQL includes data storage and retrieval:

A

not based on the relational model.

25
Q

The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a:

A

graph-oriented database.

26
Q

Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:

A

veracity and value.

27
Q

Collect everything is a characteristic of a data lake.

A

True

28
Q

Neo4j is a wide-column NoSQL database management system developed by Oracle.

A

False

29
Q

________ includes concern about data quality issues.

A

Veracity

30
Q

Big data:

A

does not require a strictly defined data model.

31
Q

It is true that in an HDFS cluster the NameNode is the:

A

single master server.

32
Q

At a basic level, analytics refers to:

A

analysis and interpretation of data.

33
Q

The schema on write and schema on read are considered synonymous approaches.

A

False

34
Q

The original three ‘v’s’ attributed to big data include volume, variety, and velocity.

A

True

35
Q

The NoSQL model that incorporates ‘column families’ is called a:

A

wide-column store.

36
Q

The Hadoop framework consists of the ________ algorithm to solve large scale problems.

A

MapReduce

37
Q

NoSQL focuses on avoidance of replication and minimizing storage space.

A

False

38
Q

The primary use of Pig is to:

A

transform raw data into a format that is useful for analysis.

39
Q

Hive uses ________ to query data.

A

HiveQL

40
Q

HBASE is a wide-column store database that runs on top of HDFS (modeled after Google).

A

True

41
Q

The ‘schema on read’ approach often incorporates JSON or XML.

A

True