chapter 1: picturing distributions with graphs Flashcards
(34 cards)
Individuals
Particular things
Population
Set of individuals
Variable
an attribute of an individual.
value (of a variable)
any way that that variable could be exhibited by an individual
True or False: every individual must have only a single value for any given variable
True
Data
numbers with a context, or, values of variables for the individuals in a population
Dataset
The particular data that we are presented with
observation
a member of our dataset
sample
That part of the population from which our observations come
The size of a dataset (or sample)
the number of observations in it
in order to clearly define a statistical problem, we must…
we must first clearly state the population and the variables that it concerns
exploratory data analysis
when you seek simply to describe datasets
distribution
a description of the values a variable takes and how
often it takes them
graph
a visual representation of a distribution
count
a category’s size
If we want to know the proportion of observations in a dataset that are in a given category…
we divide the count of that category by the sample size
If we want to know the percentage of observations in a dataset that are in a given category…
we multiply the proportion of observations in the dataset that are in that category by 100%
two ways we can look at the size of a given category
we might be interested in how it compares to either the sizes of the other categories, or in how it compares to the size of the population
To compare the sizes of the categories with each other
we use a bar graph
One drawback of bar graphs is
it is difficult to look at proportions of observations using them
To compare the sizes of the categories with the size of the dataset
we calculate the percentage of the dataset that fall into each category
to compare the percentage of any given category not only to each other, but… to the dataset as a whole as well
we use a pie chart
roundoff error
when the error in percentage totals reflect the accumulated errors in rounding
when dealing with quantitative data, we must have [blank]
a unit of measurement