Ch 1 Big data Flashcards

1
Q

big data

A

analysis, processing, and storage of large collections of

data that frequently originate from disparate sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

datasets

A

Collections or groups of related data are generally referred to as datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

data analysis and its goal

A

Data analysis is the process of examining data to find facts, relationships, patterns,
insights and/or trends. The overall goal of data analysis is to support better decisionmaking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data analytics

A

Data analytics is a
discipline that includes the management of the complete data lifecycle, which
encompasses collecting, cleansing, organizing, storing, analyzing and governing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

four general categories of analytics

A

descriptive analytics
• diagnostic analytics
• predictive analytics
• prescriptive analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Descriptive Analytics

A

Descriptive analytics are carried out to answer questions about events that have already
occurred. This form of analytics contextualizes data to generate information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Diagnostic Analytics

A

Diagnostic analytics aim to determine the cause of a phenomenon that occurred in the past
using questions that focus on the reason behind the event. The goal of this type of
analytics is to determine what information is related to the phenomenon in order to enable
answering questions that seek to determine why something has occurred.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Predictive Analytics

A

Predictive analytics are carried out in an attempt to determine the outcome of an event that
might occur in the future.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Prescriptive Analytics

A

Prescriptive analytics build upon the results of predictive analytics by prescribing actions
that should be taken.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Business Intelligence (BI)

A

BI enables an organization to gain insight into the performance of an enterprise by
analyzing data generated by its business processes and information systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Key Performance Indicators (KPI)

A

A KPI is a metric that can be used to gauge success within a particular business context.
KPIs are linked with an enterprise’s overall strategic goals and objectives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Big Data Characteristics

A
  • volume
  • velocity
  • variety
  • veracity
  • value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Velocity

A

From an enterprise’s point of view, the
velocity of data translates into the amount of time it takes for the data to be processed once
it enters the enterprise’s perimeter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Variety

A

Data variety refers to the multiple formats and types of data that need to be supported by
Big Data solutions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Veracity

A

Veracity refers to the quality or fidelity of data. Data that enters Big Data environments
needs to be assessed for quality, which can lead to data processing activities to resolve
invalid data and remove noise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Noise in data

A

Noise is data that cannot be converted into information and thus has no value,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

signals in data

A

whereas signals have value and lead to meaningful information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Value, and its dependencies

A

Value is defined as the usefulness of data for an enterprise. The value characteristic is
intuitively related to the veracity characteristic in that the higher the data fidelity, the more
value it holds for the business. Value is also dependent on how long data processing takes
because analytics results have a shelf-life

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Human-generated data

A

Human-generated data is the result of human interaction with systems, such as online
services and digital devices.

20
Q

Machine-generated data

A

Machine-generated data is generated by software programs and hardware devices in
response to real-world events.

21
Q

Structured Data

A

Structured data conforms to a data model or schema and is often stored in tabular form and stored in a relational database

22
Q

Unstructured Data

A

Data that does not conform to a data model or data schema is known as unstructured data.
It is estimated that unstructured data makes up 80% of the data within any given
enterprise

23
Q

Semi-structured Data

A

Semi-structured data has a defined level of structure and consistency, but is not relational
in nature. Instead, semi-structured data is hierarchical or graph-based.

24
Q

Metadata

A

Metadata provides information about a dataset’s characteristics and structure. This type of
data is mostly machine-generated and can be appended to data.

25
The first and most important step in any data analysis project is
The first and most important step in any data analysis project is to establish a clear goal, not a goal defined only by the data or the method, but a goal that makes sense to the business as a whole. In
26
Descriptive analysis
technique that allows you to view and measure your company and customer characteristics.
27
Customer Profile
snapshot of exactly who is buying your products or | services.
28
Market penetration analysis and wallet share analysis
are techniques for measuring the | performance of your customer base in comparison with the performance of the overall market for your industry
29
response mode
typically the first type of target model that a company seeks to develop.
30
win-back model
A win-back model is used to invite former customers to reconsider their relationship to the business
31
activation model
An activation model predicts whether a prospect will become a customer
32
revenue model
predicts the dollar amount of an expected sale
33
usage model
predicts the amount of use given to a product or service
34
cross-sell model
cross-sell model is used to predict the probability or value of a current customer’s buying a different product or service from the same company.
35
up-sell model
An up-sell model predicts the probability or | value of a customer’s buying more of the same product or service
36
Among three drugs, which one provides the best results?
Prescriptive Analytics
37
When is the best time to trade a particular stock?
Prescriptive Analytics
38
What are the chances that a customer will default on a loan if they have missed a monthly payment?
Predictive Analytics
39
What will be the patient survival rate if Drug B is administered instead of Drug A?
Predictive Analytics
40
If a customer has purchased Products A and B, what are the chances that they will also purchase Product C?
Predictive Analytics
41
Why were Q2 sales less than Q1 sales?
Diagnostic Analytics
42
Why have there been more support calls originating from the Eastern region than from the Western region?
Diagnostic Analytics
43
Why was there an increase in patient re-admission rates over the past three months?
Diagnostic Analytics
44
What was the sales volume over the past 12 months?
Descriptive analytics
45
What is the number of support calls received as categorized by severity and geographic location?
Descriptive analytics
46
What is the monthly commission earned by each sales agent?
Descriptive analytics