Why lean statistics? Flashcards

Prime my brain for higher level concepts and understanding

1
Q

“Statistics” means…

A

Statistical procedures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The uses of statistics

A
  • Organize and summarize information
  • Determine exactly what conclusions are justified based on the results that were obtained
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Goals of statistical procedures

A
  • Accurate and meaningful interpretation
  • Provide standardized evaluation procedures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variable

A

Characteristics or condition that changers or has different values for different individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data (Plural)

A

Measurements or observation of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data set

A

A collections of measurements or observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A datum (singular)

A
  • A single measurement or observation
  • Score or Raw score
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Parameter

A
  • A value, usually a numerical value, that describes a population
  • Derived from measurements of the individual in the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Statistic

A
  • A value, usually a numerical value, that describes a sample
  • Derived from measurements of individuals in the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive Statistics

A
  • Summarize data
  • Organize data
  • Simplify data

E.g.,
- Tables
- Graphs
- Averages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Inferential Statistics

A
  • Study samples to make generalizations about the population
  • Interpret environmental data

Common terminology
- “Margin of error”
- “Statistically significant”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sampling error

A

The Sample is never identical to the population.

Sampling error is the discrepancy, or amount of error, that exists between a sample statistic and the corresponding population parameter.

Example: Margin of error in polls
“This poll was taken from a sample of registered voters and has a margin of
error of plus-or-minus 4 percentage points”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Structure I: The correlational Method

A
  • One group of participants
  • Measurement of two variables for each participant
  • Goal is to describe the type and magnitude of the relationship
  • Patterns in the data reveal relationships
  • Non-experimental method of study

**Can not establish causation
Characteristics:
Strength
Form (Usually linear)
Direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data Structure II: Comparing two (or more) groups of scores

A
  • One variable defines groups
  • Scores are measured on the second variable
  • Both experimental and non-experimental studies use this structure

E.g., T-test and ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Experimental Method

A

Goal
- To demonstrate a cause and effect relationship

Manipulation
- The level of one variable (IV) is determined by the experimenter

Control - rules out influence of other variables (confounds)
- Participant variables
- Environmental variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Independent variable

A

The variable manipulated by the researcher (independent because no other variable influences its value - e.g., sex)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Dependent variable

A

The variable that is observed to assess the effect of treatment (dependent because it is thought to depend on the value of the IV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Experimental method: Methods of control

A
  • Random assignment
  • Matching of subjects
  • Holding level of some potentially influencing variables constant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Experimental Method: Control condition

A
  • Individuals do not receive the experimental treatment.
  • They either receive no treatment or they receive a neutral, placebo treatment
  • Purpose: to provide a baseline comparison with the experimental conditon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Experimental Method: Experimental condition

A

Those who receive the experimental treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Non experimental Methods

A

Non-equivalent Groups
- Researcher compares groups
- Researcher cannot control who goes into which group

Pre-test/Post-test
- Individuals measured at two points in time.
- Researcher cannot control influence of the passage of time

Independent variable is ‘Quasi-independent’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Quasi-independent

A

Cannot be controlled
e.g., age, sex, traits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Constucts

A
  • Internal attributes or characteristics that cannot be directly observed
  • Useful for describing and explaining behavior
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Observational Definition

A
  • Identifies the set of operations required to measure an external (observable) behavior
  • Uses the resulting measurements as both a definition AND a measurement of a hypothetical construct
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Discrete variable
- Has separate, indivisible categories - No values can exist between two neighboring categories Distinct numbers e.g., 35, 1, 8, 3
26
Continuous variable
- Has infinite number of possible values between any two observed values - Every interval is divisible into an infinite number of equal parts e.g., height, weight, temp, time
27
Scales of measurement
1. Nominal 2. Ordinal 3. Interval 4. Ratio
28
Nominal Scale
Characteristics: - Label and categories - No quantitative distinctions (no numbers just names) Examples: - Gender - Diagnosis - Experimental or control
29
Ordinal Scale
Characteristics: - Categorize observations - Categories organized by size or magnitude Examples: - Rank in class - Clothing sizes (S, M, L, XL) - Olympic metals
30
Interval Scale
Characteristic: - Ordered categories - Interval between categories of equal size - Arbitrary or absent of zero point Examples: - Temperature - IQ - Golf scores (above/below par)
31
Ratio Scale
Characteristics: - Ordered categories - Equal interval between categories - Absolute zero point Examples: - Number of correct answer - Time to complete task - Gain in height since last year
32
X
Independent variable
33
Y
Dependent variable
34
N
Number of scores in the population
35
n
Number of scores in the sample
36
Σ
Summation - Done after operations in parenthesis, squaring, multiplication and division. - Done before other addition or subtraction
37
What is central tendency?
A statistical measure A single score to define the center of the distribution Purpose: find the single score that is the most typical or best represents the entire group
38
Central Tendency Measures
There's no single concept of central tendency that always the "best" Different distribution shapes require different conceptualizations of "center" Choose the one which best represents the scores in a specific situation
39
The mean is...
the sum of all scores divided by the number of scores in the data Population: Sample:
40
Mean as a balance point
41
The Weighted Mean
- Combine two sets of scores Three steps: 1. Determine the combined sum of all the scores. 2. Determine the combined number of scores 3. Divide the sum of scores by the total number of scores Overall Mean =
42
Computing the Mean from a Frequency Distribution Table
43
Characteristics of the Mean
- Changing the value of a score changes the mean - Introducing a new score or removing a score 'changes' the mean (unless the score added or removed is 'exactly' equal to the mean) - Adding or subtracting a constant from each score changes the mean by the same constant - Multiplying or dividing each score by a constant multiplies or divides the mean by that constant
44
The Median
- The median is the midpoint of the scores in a distribution when they are listed in order from smallest to largest - The median divides the scores into two groups of equal size
45
Locating the Median (odd n)
* Put scores in order * Identify the “middle” score to find median 3 5 8 10 11 “Middle” score is 8 so median = 8
46
Locating the Median (even n)
* Put scores in order * Average middle pair to find median 1 1 4 5 7 9 (4 + 5) / 2 = 4.5
47
The Mode
The mode is the score or category that has the greatest frequency of any score in the frequency distribution - Can be used with any scale of measurement - Corresponds to an actual score in the data It is possible to have more than one mode
48
Bimodal Distribution
49
Symmetrical Distributions
- Mean and median have same value - If exactly one mode, it has same value as the mean and the median - Distribution may have more than one more or no mode at all
50
Central Tendency in Skewed Distributions
Mean, influenced by extreme scores, is found far toward the long tail (positive or negative) Median, in order to divide scores in half, is found toward the long tail, but not as far as the mean Mode is found near the short tail. If Mean – Median > 0, the distribution is positively skewed. If Mean – Median < 0, the distribution is negatively skewed
51
Positive Skew
52
Negative Skew
53
Overview of variability
Variability is defined as: 1. Quantitative 'distance' measure based on the differences between scores 2. Describes 'distance' of the spread of scores or 'distance' of a score from the mean Purpose - Describe the distribution - Measure how well an individual score represents the distribution
54
What are the Three Measures of variability?
1. The range 2. The variance 3. The Standard deviation
55
The range
- Distance covered by the scores in the distribution (smallest value to the highest) - For continuous data, real limits are used - Based on two score, not all data (an imprecise, unreliable measure of variability)
56
The most common and most important measure of variability is...
Standard deviation - A measure of standard, or average, distance from the mean - Describes whether the scores are cluster closely around the mean or widely scattered The calculation differs for sample and population Variance is a necessary 'companion concept' to standard deviation but NOT the same concept ADD equation image
57
Defining the Standard Deviation
Step one: determine the deviation score X — μ Step two: Find a sum of deviations ∑(X — μ) Step 2 (revised):First square each deviation then sum the 'squared' deviations (SS) SS= ∑(X — μ)^2 Step 3: Variance = Average the 'squared' deviations Step 4: SD=SqrT(Variance)
58
Variance is known as...
mean squared deviation
59
Standard deviation
Averaged distance of the scores from the mean
60
Sum of squares formulas
61
continue
slide 67