The Process Phase Flashcards
(31 cards)
Refers to the accuracy, completeness, consistency and trustworthiness of data?
Data Integrity
Process of storing data in multiple locations?
Data Replication
The process of copying data from a storage device memory to another?
Data Transfer
Process that involves changing the data to make it more organized and easier to read?
Data Manipulation
It is the degree to which conforms to the actual entity being measured?
Accuracy
The degree which the data contains all the desired components?
Completeness
The degree to which the data is repeatable from different points?
Consistency
The entire group you are interested in for your study?
Population
A subset of your population?
Sample
It is the difference between the sample’s result from what the result would have been if you had surveyed the entire population?
Margin of Error
Refers to how confident you are in the survey results?
Confidence Level
It is the range of possible values that the population’s result would be at the confidence level of the study?
Confidence Interval
The determination whether your results could be due to random chance or not?
Statistical Significance
Fill in the Blanks
When theres a greater sample size, there’s a _______ confidence level, ____ in margin of error and ____ statistical significance.
- Higher
- Decrease
- Greater
When a sample does not represent the population as a whole
Sampling Bias
It is a sampling technique in which every participants in a population has an equal chance of being chosen?
Random Sampling
When the data is incomplete, incorrect, or irrelevant?
It’s Dirty or Dirty Data
Opposite of Dirty Data
Of course it’s Clean💅✨
It is any data record that shows up more than once?
Duplicated Data
-Caused by manual data entry, data imports and data migration
Any data that is old?
Outdated Data
- people changing roles/companies/softwares/system becoming absolete
Any Data that is missing in important fields?
Incomplete data
- probably due to improper data collection
Any data that is complete but inaccurate?
Incorrect/Inaccurate Data
- human error
Any data that uses different format to represent the same thing?
Inconsistent Data
A group of characters within a cell, often composed of letters, numbers, or both?
Test String