AP Statistics Flashcards
Individuals
Individuals are any data set that contains information about a group. The group that is being studied or experimented.
Variable
A variable is an attribute that describes a person, place, thing, or idea.
The value of the variable can “vary” from one entity to another.
For example, suppose we let the variable x represent the color of a person’s hair. The variable x could have the value of “blond” for one person, and “brunette” for another.
Categorical Variable
Categorical. Categorical variables take on values that are names or labels. The color of a ball (e.g., red, green, blue) or the breed of a dog (e.g., collie, shepherd, terrier) would be examples of categorical variables.
Quantitative Variable
Quantitative. Quantitative variables are numerical. They represent a measurable quantity. For example, when we speak of the population of a city, we are talking about the number of people in the city - a measurable attribute of the city. Therefore, population would be a quantitative variable.
Discrete Variables
Suppose we flip a coin and count the number of heads. The number of heads could be any integer value between 0 and plus infinity. However, it could not be any number between 0 and plus infinity. We could not, for example, get 2.5 heads. Therefore, the number of heads must be a discrete variable.
Continuous
Suppose the fire department mandates that all fire fighters must weigh between 150 and 250 pounds. The weight of a fire fighter would be an example of a continuous variable; since a fire fighter’s weight could take on any value between 150 and 250 pounds.
Univariate Data
Univariate data. When we conduct a study that looks at only one variable, we say that we are working with univariate data. Suppose, for example, that we conducted a survey to estimate the average weight of high school students. Since we are only working with one variable (weight), we would be working with univariate data.
Bivariate Data
Bivariate data. When we conduct a study that examines the relationship between two variables, we are working with bivariate data. Suppose we conducted a study to see if there were a relationship between the height and weight of high school students. Since we are working with two variables (height and weight), we would be working with bivariate data
Population
In statistics, population refers to the total set of observations that can be made.
For example, if we are studying the weight of adult women, the population is the set of weights of all the women in the world. If we are studying the grade point average (GPA) of students at Harvard, the population is the set of GPA’s of all the students at Harvard.
Sample
In statistics, a sample refers to a set of observations drawn from a population.
Often, it is necessary to use samples for research, because it is impractical to study the whole population. For example, suppose we wanted to know the average height of 12-year-old American boys. We could not measure all of the 12-year-old boys in America, but we could measure a sample of boys.
Census
A census is a study that obtains data from every member of a population. In most studies, a census is not practical, because of the cost and/or time required.
Distribution
The distribution of a statistical data set (or a population) is a listing or function showing all the possible values (or intervals) of the data and how often they occur. When a distribution of categorical data is organized, you see the number or percentage of individuals in each group.
Inference
The process of using data analysis to deduce properties of an underlying distribution of probability. Inferential statistical analysis infers properties of a population, for example by testing hypotheses and deriving estimates.
Frequency Table
When a table shows frequency counts for a categorical variable, it is called a frequency table Below, the bar chart and the frequency table display the same data
Relative Frequency
A frequency count is a measure of the number of times that an event occurs.
To compute relative frequency, one obtains a frequency count for the total population and a frequency count for a subgroup of the population. The relative frequency for the subgroup is:
Relative frequency = Subgroup count / Total count
The above equation expresses relative frequency as a proportion. It is also often expressed as a percentage. Thus, a relative frequency of 0.50 is equivalent to a percentage of 50%.
Table
The values of the cumulative distribution functions, probability functions, or probability density functions of certain common distributions presented as reference tables for different values of their parameters.
Pie Chart
A pie chart (or a circle chart) is a circular statistical graphic, which is divided into slices to illustrate numerical proportion. In a pie chart, the arc length of each slice (and consequently its central angle and area), is proportional to the quantity it represents.
Bar Graph
A bar chart is a graphical representation of the categories as bars. … A bar chart can be plotted vertically or horizontally. Usually it is drawn vertically where x-axis represents the categories and y-axis represents the values for these categories.
Two-way Table
A two-way table (also called a contingency table) is a useful tool for examining relationships between categorical variables. The entries in the cells of a two-way table can be frequency counts or relative frequencies (just like a one-way table ).
Marginal Distribution
Entries in the “Total” row and “Total” column are called marginal frequencies or the marginal distribution. Entries in the body of the table are called joint frequencies.
Conditional
Conditional probability is the probability of one event occurring with some relationship to one or more other events. For example: Event A is that it is raining outside, and it has a 0.3 (30%) chance of raining today. Event B is that you will need to go outside, and that has a probability of 0.5 (50%).
The formula for conditional probability is:
P(B|A) = P(A and B) / P(A)
which you can also rewrite as:
P(B|A) = P(A∩B) / P(A)
Segmented Bar Graph
A segmented Bar chart is one kind of stacked bar chart, but each bar will show 100% of the discrete value. For example, there are a total of 40 students in your classroom. Out of them, 25 students like Basketball, 30 students like Volleyball, and 20 students like Badminton. There are 25 boys and 15 girls in the class. The data along the vertical side of the box represents sports while the horizontal represents a certain percentage for each sport. Each bar will show the preference of each sport according to the number of boys and girls and the bars will be separated by stacked order, representing one group for the boys and the other for the girls.
Side by Side Bar
In a side-by side bar chart, the bars are split into colored bar segments. The bar segments are placed next to each other. … In a stacked bar chart, the bar segments within a category bar are placed on top of each other, and in a side-by-side bar chart, they are placed next to each other.
Graph
A statistical graph or chart is defined as the pictorial representation of statistical data in graphical form. The statistical graphs are used to represent a set of data to make it easier to understand and interpret statistical data.