Concepts Flashcards

1
Q

The collection, transformation, and organization of data in order to draw conclusions, make predictions and drive informed decision making.

A

Data Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the six steps of the data analysis process?

A
Ask
Prepare
Process
Analyze
Share
Act
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The various elements that interact with one another to produce, manage, store, organize, analyze and share data.

A

Data Ecosystem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The graphical representation of information.

A

Visualization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A method for examining and calculating how a process works currently in order to get where you want to be in the future

A

Gap Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the life cycle of data?

A
Plan 
Capture 
Manage
Analyze 
Archive 
Destroy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A collection of data stored in a computer system

A

Database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

People who have invested time and resources into a project and are interested in the outcome.

A

Stakeholders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A characteristic or quality of data used to label a column in a table.

A

Attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

All of the attributes for something contained in a row of a data table

A

Observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A unique set of guidelines for a programming language

A

Syntax

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the basic SQL structure?

A

SELECT [choose from the column you want]
FROM [from the appropriate table]
WHERE [a certain condition is met]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The process of recognizing the current problem or situation, organizing available information, revealing gaps and opportunities, and identifying the options

A

Structured thinking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The specific and objective measures of numerical facts. This is often the what, how many, and how often about a problem.

A

Quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Subjective or explanatory measures of qualities and characteristics or things that can’t be measured with numerical data. Used to answer why questions

A

Qualitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A static collection of data given to stakeholders periodically.

A

Report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Monitors live incoming data

A

Dashboard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A data summarization tool that is used to summarize, sort, recognize, group, count, total or average data stored in a database

A

Pivot table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A single quantifiable type of data that can be used for measurement.

A

Metric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the metrics used in ROI (Return on Investment)?

A

Profit and Investments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the types of dashboards?

A

Strategic- focuses on long term goals and strategies at the highest level of metrics
Operational- short term performance tracking and intermediate goals
Analytical- consists of the datasets and the mathematics used in those sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Something made up of specific metrics over a short, well-defined period of time and used to make day-to-day decisions.

A

Small data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Large less specific datasets covering a long period of time and usually have to be broken down and analyzed.

A

Big data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

A collection of two or more cells

A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

An agreed upon outline of the work you’re going to perform on a project

A

Scope of Work (SOW)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is the format of SOW

A

Deliverables
Milestones
Timeline
Reports

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Small files stored on computers that contain information about users

A

Cookies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Data collected by an individual or group using their own resources

A

First-party data

29
Q

Data collected by a group directly from its audience and then sold

A

Second-party data

30
Q

Data collected from outside sources who did not collect it directly.

A

Third-party data

31
Q

Part of a population that is representative of the population

A

Sample

32
Q

Data that is counted and has a limited number of values.

A

Discrete data

33
Q

Data that can be measured using a timer and its value can be shown as a decimal with several spaces.

A

Continuous data

34
Q

A type of qualitative data that is categorized without a set order

A

Nominal data

35
Q

A type of qualitative data with a set order or scale

A

Ordinal data

36
Q

Data that lives within a company’s own systems

A

Internal data

37
Q

Data that lives and is generated outside of an organization

A

External data

38
Q

Data that is organized in a certain format, such as rows and columns.

A

Structured data

39
Q

A model that is used for organizing data elements and how they relate to one another.

A

Data model

40
Q

A data model that gives you a high-level view of your data structure, such as how you want the data to interact across an organization.

A

Conceptual data model

41
Q

A data model that focuses on the technical details of the model such as relationships, attributes and entities

A

Logical data model

42
Q

A data model that depicts how the database was built; laying out how each database will be put in place and how the databases., applications and features will interact in specific detail

A

Physical data modeling

43
Q

A specific kind of data attribute that tells what kind of value the data is.

A

Data type

44
Q

What are examples of data types in spreadsheets?

A

number
boolean
text or string

45
Q

The process of changing the data’s format, structure or values.

A

Data transformation

46
Q

The tendency for different people to observe things differently.

A

Observer bias

47
Q

The tendency to always interpret ambiguous situations in a positive or negative way

A

Interpretation bias

48
Q

The tendency to search for or interpret information in a way that confirms preexisting beliefs

A

Confirmation bias

49
Q

Well-founded standards of right and wrong that dictate how data is collected, shared and used

A

Data ethics

50
Q

What are some aspects of data ethics?

A
Ownership
Transaction Transparency 
Consent 
Currency 
Privacy
Oppenness
51
Q

The process of protecting people’s private or sensitive data by eliminating PII (personally identifiable information)

A

Data Anonymization

52
Q

Data about data, it tells you where the data comes from, when and how it was created and what it is all about

A

Metadata

53
Q

A database that contains a series of tables that can be connected to form a relationship

A

Relational Database

54
Q

An identifier that references a column in which each value is unique

A

Primary key

55
Q

A field within a table that is a primary key in another table

A

Foreign key

56
Q

A database in which only related data is stored in each table

A

Normalized database

57
Q

A way of describing the way something is organized

A

Schema

58
Q

The process of storing data in multiple locations

A

Data replication

59
Q

The process of copying data from a storage device to memory or from one computer to another.

A

Data transfer

60
Q

Changing data to make it more organized and easier to read

A

Data manipulation

61
Q

The difference between the sample results and the results if you surveyed the entire population.

A

Margin of error

62
Q

The term for how confident you are in the survey results

A

Confidence level

63
Q

The range of possible values that the population’s results would be at, the confidence level of the study

A

Confidence interval

64
Q

The determination of whether your results could be due to random chance or not.

A

Statistical significance

65
Q

An indication that a value does not exist in a dataset

A

Null

66
Q

A tool for checking the accuracy and quality of data before adding or importing it.

A

Data validation

67
Q

The process of combining two or more datasets into a single dataset

A

Data merging

68
Q

A file containing a chronologically ordered list of modifications made to a project

A

Changelog

69
Q

A process to confirm that data-cleaning effort was well executed and the resulting data is accurate and reliable

A

Verification