Ch11 Quantitative analysis Flashcards
Codebook
a document that describes the procedure for coding variables and their location in a format for computers
4 ways to get quantitative raw data into computer
- code sheet - paper with printed grid - record info so it can be easily entered
- Direct entry - a method of entering data into a computer by typing data without code or optical scan sheets
- optical scan - gather the information then enter it into optical scan sheets by filling the correct dots
- bar code - gather the information, then convert it into different widths of bars that are associated with specific
possible code cleaning
cleaning data using a computer in which the researcher looks for responses or answer categories that cannot have cases. also called wild code checking
contingency cleaning
cleaning data using a computer in which the researcher looks at the combination of categories for two variables for logically impossible cases
descriptive statistics
describe numerical data
univariate statistics
one variable - easiest way to describe numnerical data of one variable is with a frequency distribution = used with any type of data
bimodal
a distribution with two modes
multimodal
distribution with more than one mode
skewed distribution
distribution of cases among the categories of a variable that is not normal
z scores
compare two or more distributions or group - standardized score
number of standard deviations it is above or below the mean
bivariate statistics
only involve two variables
correlation
means that things go together are associated
independence
opposite of correlation - no association between two variables
scattergram/ scatterplot
A diagram to display the statistical relationship between two variables based on plotting each case’s values for both of the variables
Precision
Amount of spread in points on the graph
Cross-tabulation
placing data for two variables in a contingency table to show the number or percentage of cases at the intersection of categories of the two variables
contingency table
a table that shows the cross-tabulation of two or more variables.
it ususally shows bivariate quantitative data for variables in the formof percentages across rows or down columns for the categories of one variable.
Three ways to percentage a table
by row
by column
the total - total columns or marginals
measure of association
a single number that expresses the strength, and often the direction of a relationship.
it condenses information about a bivariate relationship into a single number.
5 measures of association - gamma
used for ordinal level data
based on comparing pairs of variable categories and seeing whether a case has the same rank on each -1 to 1 and 0 is no association
5 measures of association -lambda
nomial level data
it is based on a reduction in errors based on the mode and ranges between 0 - nothing and 1 - strongest possible relationship
5 measures of association -tau
ordinal level data
Takes care of problems that occur with gamma
several statistics named tau and one is kendalls tau -1 to 1 0 = nothing
5 measures of association -rho
Pearson’s product moment correlation coefficient
when they use the term correlation
can only be used for interval and ratio
Used for the mean and SD
5 measures of association -chi squared
two different uses
Can be used as a measure of association in descriptive statistics
or can be used in inferential statistics