Data Sets and Types Flashcards
(18 cards)
data set
an organized collection of data about a specific topic
case
an individual or thing about which data is collected
variable
any specific type of data collected about a set of cases
dependent variable
The value of a dependent variable depends on the values of one or more other variables in the data set.
independent variable
not dependent.
data point
gives the value of a specific variable in a specific case
record
a list of the data points for one case
Qualitative data
any type of data that doesn’t use numbers to stand for a quantity
Nominal data
any type of qualitative data that’s not ordered in any relevant way. The statistical measures of mean, median, and range don’t apply to nominal data because those measures need an ordering that nominal data lacks. But even in a set of nominal data, some values may appear more often than others. So, the statistical measure of mode does apply to nominal data because the mode is simply the value that appears most often.
Ordinal data
qualitative data ordered in a way that matters in the data set
Binary data
takes only two values, like “true” and “false.”
Partly ordered data
has an order among some cases but not among others. The statistical measure of median does not apply to a set of partly ordered values, though it might apply to a subset whose values are all fully ordered.
Quantitative data
data about quantities measured in numbers
Quantitative data is continuous
if it measures something that can be infinitely divided
discrete
Quantitative data that isn’t continuous
Interval data
uses a measurement scale whose number zero doesn’t stand for a complete absence of the factor measured. So, for interval data, a measurement above zero doesn’t show how much greater than nothing the measured quantity is. Because of this, the ratio of two measurements in interval data isn’t the ratio of the two measured quantities.
Ratio data
uses a measurement scale whose number zero stands for the absence of the measured factor. In ratio data, as in interval data, the difference between two measurements stands for the actual difference between the measured amounts. However, in ratio data, unlike interval data, the ratio of two measurements also stands for the actual ratio of the measured amounts.
Logarithmic data
use a measurement scale whose higher values stand for amounts exponentially farther apart. For logarithmic data, as for ratio data, the number zero stands for a complete absence. But in logarithmic data, the higher two measurements a certain number of units apart are, the greater the real difference between the measured amounts is.