Unit 1: Exploring One-Variable Data Flashcards
Categorical Variable/Qualitative Data
A variable that takes on values that are category names or group labels. (Think: WORDS)
(EX: Dominant hand, name, college degree)
Quantitative Variable/Data
A variable that takes on numerical values for a measured or counted quantity. (Think: NUMBERS)
(EX: Age, height, count)
Can be discrete or continuous
Frequency table
gives the number of cases in each category
Relative frequency table
gives the proportion of cases in each category(percentage)
[Note: percentage, relative frequency, and rates provide the same information as proportions]
Bar Chart/Graph
A graph to display counts or proportions for a categorical variable only
Pie Chart
A chart to display proportions
Discrete Quantitative Variable
A variable that can take on a countable(finite or countably infinite) number of values.
Continuous Quantitative Variable
A variable that can take on infinitely many values, but those values cannot be counted.
Dot Plots
Best for discrete variables
Steam and Leaf plots
Stem: the number(s) on the left of the plot and the number (EX: stem of 34 is 3)
Leaf: the number on the right of the plot and the number (EX: leaf of 34 is 3)
Histogram
[NOT a bar graph]
gives a discrete Y-value but a continuous X-axis due to the bars connecting.
Population
The collection of all individuals or items under consideration in a statistical study
Sample
part of a population
Inferential statistics
Drawing and measuring the reliability of conclusions about a population based on information obtained from a sample of the population
Skewed Right
More data on the right with a left tail
Skewed Left
More data on the left with a right tail
Symmetric Data
A distribution that is symmetric (peak in middle(unimodal) or peaks on each side(bimodal))
uniform data
The data is all the same
census
Information for the entire population of interest.
Sampling
How to obtain an appropriate subset of people/items from the population. There are 2 types.
[SRSWR] Simple random sampling with replacement
Where a member of the population
can be selected more than once
[SRS] Simple random sampling without replacement
Where a member of the population can be selected at most once.
Statistic vs Parameter
Statistic: value from sample
Parameter: value from population
Systematic Random Sampling
Elements from a larger population are selected at regular intervals after choosing a random starting point.