Prepare Data for Exploration (Terms) Flashcards

1
Q

Features such as password protection, user permissions, and encryption that are used to protect a spreadsheet

A

Access control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Metadata that indicates the technical source of a digital asset

A

Administrative metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A list of scheduled appointments

A

Agenda

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Digitized audio storage usually in an MP3, AAC, or other compressed format

A

Audio file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A data source that is not reliable, original, comprehensive, current, and cited (ROCCC)

A

Bad data source

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A conscious or subconscious preference in favor of or against a person, group of people, or thing

A

Bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A data type with only two possible values, usually true or false

A

Boolean data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The tendency to search for or interpret information in a way that confirms pre-existing beliefs

A

Confirmation bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The aspect of data ethics that presumes an individual’s right to know how and why their personal data will be used before agreeing to provide it

A

Consent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data that is measured and can have almost any numeric value

A

Continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A small file stored on a computer that contains information about its users

A

Cookie

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A delimited text file that uses a comma to separate values

A

CSV (comma-separated values) file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The aspect of data ethics that presumes individuals should be aware of financial transactions resulting from the use of their personal data and the scale of those transactions

A

Currency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The process of protecting people’s private or sensitive data by eliminating identifying information

A

Data anonymization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When a preference in favor of or against a person, group of people, or thing systematically skews data analysis results in a certain direction

A

Data bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A piece of information in a dataset

A

Data element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Well-founded standards of right and wrong that dictate how data is collected, shared, and used

A

Data ethics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A process for ensuring the formal management of a company’s data assets

A

Data governance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The ability to integrate data from multiple sources and a key factor leading to the successful use of open data among companies and governments

A

Data interoperability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A tool for organizing data elements and how they relate to one another

A

Data model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Preserving a data subject’s information any time a data transaction occurs

A

Data privacy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Protecting data from unauthorized access or corruption by adopting safety measures

A

Data security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

An attribute that describes a piece of data based on its values, its programming language, or the operations it can perform

A

Data type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Metadata that describes a piece of data and can be used to identify it at a later point in time

A

Descriptive metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
An electronic or computer-based image usually in BMP or JPG format
Digital photo
26
Data that is counted and has a limited number of values
Discrete data
27
Well-founded standards of right and wrong that prescribe what humans ought to do, usually in terms of rights, obligations, benefits to society, fairness, or specific virtues
Ethics
28
The tendency for different people to observe things differently (Refer to Observer bias)
Experimenter bias
29
Data that lives and is generated outside of an organization
External data
30
A single piece of information from a row or column of a spreadsheet; in a data table, typically a column in the table
Field
31
Data collected by an individual or group using their own resources
First-party data
32
A field within a database table that is a primary key in another table (Refer to primary key)
Foreign key
33
The section of a query that indicates where the selected data comes from
FROM
34
Policy-making body in the European Union created to help protect people and their data
General Data Protection Regulation of the European Union (GDPR)
35
The geographical location of a person or device by means of digital information
Geolocation
36
A data source that is reliable, original, comprehensive, current, and cited (ROCCC)
Good data source
37
Data that lives within a company’s own systems
Internal data
38
The tendency to interpret ambiguous situations in a positive or negative way
Interpretation bias
39
A dataset in which each row is one time point per subject, so each subject has data in multiple rows
Long data
40
Someone who shares knowledge, skills, and experience to help another grow both professionally and personally
Mentor
41
Data about data
Metadata
42
A database created to store metadata
Metadata repository
43
Consistent guidelines that describe the content, creation date, and version of a file in its name
Naming conventions
44
Building relationships by meeting people both in person and online
Networking
45
A type of qualitative data that is categorized without a set order
Nominal data
46
A database in which only related data is stored in each table
Normalized database
47
An interactive, editable programming environment for creating data reports and showcasing data skills
Notebook
48
The tendency for different people to observe things differently (also called experimenter bias)
Observer bias
49
The aspect of data ethics that promotes the free access, usage, and sharing of data
Openness
50
Qualitative data with a set order or scale
Ordinal data
51
The aspect of data ethics that presumes individuals own the raw data they provide and have primary control over its usage, processing, and sharing
Ownership
52
In digital imaging, a small area of illumination on a display screen that, when combined with other adjacent areas, forms a digital image
Pixel
53
In data analytics, all possible data values in a dataset
Population
54
An identifier in a database that references a column in which each value is unique (Refer to foreign key)
Primary key
55
A collection of related data in a data table, usually synonymous with row
Record
56
When the same piece of data is stored in two or more places
Redundancy
57
A database that contains a series of tables that can be connected to form relationships
Relational database
58
In data analytics, a segment of a population that is representative of the entire population
Sample
59
Overrepresenting or underrepresenting certain members of a population as a result of working with a sample that is not representative of the population as a whole
Sampling bias
60
A way of describing how something, such as data, is organized
Schema
61
Data collected by a group directly from its audience and then sold
Second-party data
62
The section of a query that indicates the subset of a dataset
SELECT
63
Websites and applications through which users create and share content or participate in social networking
Social media
64
A professional advocate who is committed to moving forward the career of another
Sponsor
65
A sequence of characters and punctuation that contains textual information (also called text data type)
String data type
66
Metadata that indicates how a piece of data is organized and whether it is part of one or more than one data collection
Structural metadata
67
Data organized in a certain format such as rows and columns
Structured data
68
A sequence of characters and punctuation that contains textual information (also called string data type)
Text data type
69
Data provided from outside sources who didn’t collect it directly
Third-party data
70
The aspect of data ethics that presumes all data-processing activities and algorithms should be explainable and understood by the individual who provides the data
Transaction transparency
71
When the sample of the population being measured is representative of the population as a whole
Unbiased sampling
72
An agency in the U.S. Department of Commerce that serves as the nation’s leading provider of quality data about its people and economy
United States Census Bureau
73
Data that is not organized in any easily identifiable manner
Unstructured data
74
A collection of images, audio files, and other data usually encoded in a compressed format such as MP4, MV4, MOV, AVI, or FLV
Video file
75
The section of a query that specifies criteria that the requested data must meet
WHERE
76
A dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject
Wide data
77
An organization whose primary role is to direct and coordinate international health within the United Nations system
World Health Organization