M1 Flashcards

(36 cards)

1
Q

[ ] are fundamental components in various fields, providing tools for data interpretation and decision-making.

A

Statistical analysis and Modeling (SAM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

[ ] primarily focuses on the collection, analysis, interpretation, presentation, and organization of data.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

[ ] provides foundational tools for understanding data distributions, variability, and relationships through methods such as hypothesis testing and regression analysis.

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

[ ] encompasses a broader scope, integrating statistical methods with advanced computational techniques to derive insights from data.

A

Analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

[ ] emphasizes predictive modeling, data mining, and the application of algorithms to inform strategic decisions and optimize processes.

A

Analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A [ ] is an interlinked set of activities that an organization performs to convert inputs to outputs that are valuable to a market.

A

value chain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Insights that prescribe direct and meaningful actions then [ ].

A

drive decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is the value of data to organizations and their market is realized?

A

When actions or decisions are implemented from them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the five main data sources?

A

(TransData - ContractSub - Surve - DataPool - Unstruct)
- Transactional Data
- Contractual, Subscription, or Account Data
- Surveys
- Data Poolers
- Unstructured Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

This type of data source consists of structured, detailed information capturing the key characteristics of a transaction.

A

Transaction Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

This type of data source includes information about the type of product combined with customer characteristics.

A

Contractual, Subscription, or Account Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

This type of data source are questionnaires aimed at extracting sociodemographic and behavioral data from a particular group of people.

A

Surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

These are companies that gather data in particular settings or for particular purposes and sell them to interested customers looking to enrich or extend their data sources.

A

Data Poolers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

This refers to information that does not reside in a traditional row-column database in the world of big data.

A

Unstructured Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the phases of Data Analytics?

A

(Bu - Du - Dp - M - E - D)
- Business Understanding
- Data Understanding
- Data Preparation
- Modelling
- Evaluation
- Deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

[ ] is knowing what the study is for or identifying a business task.

A

Business Understanding

17
Q

[ ] is when you select the related data from many available databases to correctly describe a given business task; identifying relevant data for the problem description.

A

Data Understanding

18
Q

[ ] is also known as data preprocessing.

A

Data Preparation

19
Q

[ ] is to filter, aggregate, and fill-in (impute) missing values.

A

Data Preparation

20
Q

[ ] uses mathematical formulations to convert different measurements into a unified numerical scale.

A

Data Transformation

21
Q

Transforming numerical to numerical scales [ ].

A

shrinks or enlarges the data

22
Q

Transforming categorical to numerical values can be [ ].

A

ordinal (less, moderate, strong) or nominal (red, yellow, blue).

23
Q

What are the two major categories of modeling?

A
  • Predictive Modeling
  • Descriptive Modeling
24
Q

[ ] predicts the value of an attribute based on the values of other attributes.

A

Predictive Modeling

25
[ ] derives patters that summarizes the underlying relationships in the data.
Descriptive Modeling
26
[ ] summarizes the general characteristics or features of a target class of data.
Characterization
27
What are the methods of Characterization and Discrimination?
- data summaries based on stat measures and plots. - user-controlled data summarization using OLAP, EXCEL, Spreadsheet, SQL, Python, etc.
28
What are the outputs of Characterization and Discrimination?
- pie charts, bar charts, curves, crosstabs - Characteristic Rules
29
[ ] compares the general features of the target class data objects against the general features of objects from one or multiple contrasting classes.
Discrimination
30
[ ] is the process of finding a model or function that describes and distinguishes classes or concepts.
Classification
31
[ ] is a statistical methodology that is most often used for numeric prediction.
Regression Analysis
32
[ ] is when objects are grouped based on the principle of maximizing the intraclass similarity and minimizing the interclass similarity.
Clustering
33
[ ] detects objects in data that do not follow norms and its methods may include statistical tests or using distance measures.
Outlier Analysis
34
What are the two things to consider in the Data Interpretation stage?
- How to recognize business value from knowledge patters discovered. - How to visualize the results to properly interpret patterns.
35
A pattern is interesting if:
- easily understood by humans - valid on new or test data with some degree of certainty - potentially useful - novel
36
What are the two primary techniques of descriptive analytics?
- Data Aggregation - Data Presentation