skills Flashcards
01 rate versus percent
Rate is per capita, and used to analyze groups of different size. It is per person versus per one hundred people. You divide the number of observations by the total number in the population.
1 how to identify the individuals and variables in a dataset
- Individuals are the objects described by a set of data. Individuals may be people, but they may also be animals or things.
- A variable is any characteristic of an individual. A variable can take different values for different individuals.
1 how to identify categorical VS quantitative variables (and the units of measurement for each quant. var.)
quants are numbers that can be measured, categories can contain words and choices
1 • SPSS • how to make a pie chart
SPSS
- Graphs → Chart Builder
- Gallery → Pie/Polar; Pie Chart → Chart Preview
- drag to determine “slice by” and “angle variable” in chart preview
- In Element Properties, under Statistics, “Statistic:” select “value”
- (Apply if necessary)
- Done
- Double click to edit in Chart Editor

1 how to recognize when a pie chart can and cannot be used
A pie chart must include all the categories that make up a whole. Use a pie chart only when you want to emphasize each category’s relation to the whole.
1 • SPSS • how to make a bar graph of a cat. var. dist.
SPSS
- Graphs → Chart Builder
- Gallery → Bar; Bar Chart → Chart Preview
- Drag to axes
- Element Properties → Statistics → Statistic → Value
- double click to open chart editor
- Properties → Categories
1 • SPSS • how to make a histogram of a quant. var. dist.
- Variable view
- Analyze –> Descriptive Statistics –> Frequencies
- Move desired variable to “Variables” –> click CHARTS
- Under chart type, select histogram
- optional: select show normal curve
- Continue –> OK
1 How to describe skewness
- A distribution is skewed to the right if the right side of the histogram(containing the half of the observations with larger values) extends much farther out than the left side.
- It is skewed to the left if the left side of the histogram extends much farther out than the right side.
- the mean moves towards the skew. a larger mean is right skewed and positive, and vice versa.
1 how to describe a histogram
shape, center and variability or spread. roughly symmetric, distinctly skewed or neither. Existence of outliers.
1 •SPSS • how to make and assess a stem plot,
incl. round leaves and split stems
MAKE
- Analyze –> descriptive statistics –> explore
- move variable to dependent list –> click plots
- Select “Stem and leaf” under descriptive –> continue –> OK
ASSESS
- how many peaks does it have?
- if a long tail goes (down) toward the larger numbers it is “right-skewed”
1 • SPSS • how to make a time plot, and recognize trends and cycles
SPSS
- Graphs –> Chart Builder
- Gallery: select scatter/dot
- Then drag simple scatter to chart preview section
- define axes by dragging (time unit goes to X)
- THEN ADD INTERPOLATION LINE
- click button in attachment in chart editor
- select straight from line type

2 • SPSS • how to find the mean x̄ and standard deviation s for a set of observations
MANUAL
basically the average
SPSS
- Analyze –> descriptive statistics –> frequencies
- moved desired variable to “variables,” –> click “statistics”
- Under percent values, click quartiles, and all other checkboxes needed

2 how to find the Median M and quartiles Q1 and Q3 for a set of observations
To find the median
- arrange all observations in order of size from smallest to largest
- if the number of obs. is ODD, the M is the center obs.
- if the n is EVEN, the M is midway btw the two center obs
- or use (n+1)/2 <– note: this just gives the location of the median
2 know whether the mean or the median are resistant, and know what skewness does
the median is more resistant that the mean, and skewness moves the mean away from the median toward the long tail
2 how to assess a box plot
- look at center, spread, symmetry and skewness
2 • SPSS • how to find the five number summary and draw a box plot
to find the five number summary
- list the lowest and highest observation, the median, and then the medians of each half
To make a box plot in SPSS
- Variable view
- Analyze –> Descriptive Statistics –> Explore
- Move variable to dependent list, click plots
- Under Boxplots –> Factor levels together
2 know how to find the variance and standard deviation
- the standard deviation and variance measure the variability by looking at how far the observations are from their mean
- find the variance by averaging the squares of the difference between each observation and the mean and then dividing by the number of observations minus one
- the standard deviation is the square root of the variance
- you divide by n-1 because the deviations always sum to 0 and you need to not be dividing by zero (kind of)

2 know the basic properties of standard deviation
- s ≥ 0 always
- s = 0 only when all observations are identical and increases as the spread increases
- s has the same units as the original measurements
- s is pulled strongly by outliers and skewness
3 how to approximately locate the mean and median on a density curve
- the median is the equal areas point
- the mean is the balance point
3 how to use the 68-95-99.7 rule and symmetry to state what percent of observations of a normal distribution fall between two points.
- The mean and standard deviation will be listed as N(?,?)
- 68% will fall within one standard deviation σ
- 95% will fall within 2σ
- 99.7% will fall within 3σ
3 how to find the z-score or “standardized value,” and what it means for a normal distribution
The z-score, or standardized value, tells us how many standard deviations the original observation falls away from the mean, and in which direction.
to find it you subtract the mean of the distribution from X and then divide by the standard deviation, as attached.

3 how to calculate the proportion of values above, below or between certain numbers when given a stated mean μ and standard deviation σ
- state the problem and draw a picture
- use Table A backward. Find the given proportion in the body of the table and then read the corresponding z from the left column and top row.
- unstandardize z back to X… x lies the z amount standard deviations away from the mean, so : x = (the mean) plus (the standard deviation) times (the z-score that we are given)
3 • SPSS • how to determine area to the left in a normal distribution (i.e. probability a score will fall within a certain range)
SPSS
- new data set, variable view
- create variable “[variable]” you want to see what is to the left of
- data view, enter desired area to the left (e.g. 0.95 for 95th%)
- Transform –> compute variable
- Target variable: Prob
- Function Group: CDF & Noncentral CDF
- Functions and Special variables: Cdf.Normal
- CDF.NORMAL(?,?,?) <– first ? =”[variable]”, Mean and SD
- Click OK
- You get a two decimal amount, click to see more decimals
3 • SPSS • how to determine percentiles for normal distributions
SPSS
- new data set, variable view
- create variable “area”
- data view, enter desired area to the left (e.g. 0.95 for 95th%)
- Transform –> compute variable
- Target variable: “percentile”
- Put cursor in Numeric Expression then Function Group menu select “Inverse DF”
- From Functions and special variables menu, double -click “Idf.Normal”
- “IDF.NORMAL(?,?,?)” will appear in Numeric Expression box
- select first question mark and place the variable “area” there
- Replace second question mark with Mean
- Replace third question mark with standard deviation
- Click OK
- Data editor will now display percentile

























