Recap Statistics Flashcards
What is qualitative data?
Outcomes are categorical
What is nominal?
Mutually exclusive categories, labeling
A nominal scale describes a variable with categories that do not have a natural order or ranking. You can code nominal variables with numbers if you want, but the order is arbitrary and any calculations, such as computing a mean, median, or standard deviation, would be meaningless.
Examples of nominal variables include:
genotype, blood type, zip code, gender, race, eye color, political party
What is ordinal?
Natural ordering (e.g. preference for chocolate)
An ordinal scale is one where the order matters but not the difference between values.
Examples of ordinal variables include:
socio economic status (“low income”,”middle income”,”high income”), education level (“high school”,”BS”,”MS”,”PhD”), income level (“less than 50K”, “50K-100K”, “over 100K”), satisfaction rating (“extremely dislike”, “dislike”, “neutral”, “like”, “extremely like”).
Note the differences between adjacent categories do not necessarily have the same meaning. For example, the difference between the two income levels “less than 50K” and “50K-100K” does not have the same meaning as the difference between the two income levels “50K-100K” and “over 100K”.
What is quantitative data?
Outcomes are numerical
What is interval data?
An interval scale is one where there is order and the difference between two values is meaningful.
Examples of interval variables include:
temperature (Farenheit), temperature (Celcius), pH, SAT score (200-800), credit score (300-850).
What is ratio data?
A ratio variable, has all the properties of an interval variable, and also has a clear definition of 0.0. When the variable equals 0.0, there is none of that variable.
Examples of ratio variables include:
enzyme activity, dose amount, reaction rate, flow rate, concentration, pulse, weight, length, temperature in Kelvin (0.0 Kelvin really does mean “no heat”), survival time.
When working with ratio variables, but not interval variables, the ratio of two measurements has a meaningful interpretation. For example, because weight is a ratio variable, a weight of 4 grams is twice as heavy as a weight of 2 grams. However, a temperature of 10 degrees C should not be considered twice as hot as 5 degrees C. If it were, a conflict would be created because 10 degrees C is 50 degrees F and 5 degrees C is 41 degrees F. Clearly, 50 degrees is not twice 41 degrees. Another example, a pH of 3 is not twice as acidic as a pH of 6, because pH is not a ratio variable.
Which data type has the highest degree of information?
Ratio, because it has an absolute zero
all op the operators including times and divided by are included
When can a histogram be use?
When the data is interval
How are the number of calls intervals determined?
Depends on the number of observations
How is the class interval width determined?
(largest-smallest observation)/#classes
How does positive skewness look like?
skewed to the left. (more frequent observations on the left)
What is a modal class?
With a distribution in classes it is the class with the highest frequency.
What are descriptive techniques for qualitative data?
Bar and pie charts
What does the bar chart display?
- emphasizes the frequency of occurrence of the different categories
- nominal or ordinal variable
What does the pie chart display?
- emphasizes the proportion of occurrences of each category
- nominal or ordinary variable
What is an ogive?
In statistics, an ogive is a graphic showing the curve of a cumulative distribution function drawn by hand. The points plotted are the upper class limit and the corresponding cumulative frequency.
What is the arithmetic mean?
mean = sum of all observations/ # of observations
How is the sample mean denoted?
x with a bar on it