Lesson0.2: Statistical Analysis Flashcards

(55 cards)

1
Q

Descriptive statistics _________ data. It does not seek __________ within it

A

Describe, relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are descriptive statistics used for?

A

Measures of central tendency and measures of dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do measures of central tendency do?

A

Estimate the center position of values in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do measures of dispersion do?

A

Describe how spread out the values of the data are

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is discreet data?

A

Numerical data restricted to certain (usually integer) values
Example: rolling a die, can only yield 1, 2, 3, 4, 5, or 6. You can’t get a 5.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is continuous data?

A

Numerical data not restricted to certain number values

Example: the mass of a person can be 63kg, 62.6kg, 62.6523782 kg

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a uniform distribution?

A

A type of continuous probability distribution where all probabilities are equal
Example: date/time of birth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a normal distribution?

A

A type of continuous probability distribution with a bell curve shape
Example: heights of adult Canadian females

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

All normal distributions have the same properties. Name the 3 properties

A

1) They have a bell shape and are symmetrical
2) The mean is in the center of the distribution
3) The area under the curve is 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The Y axis in a continuous probability distribution is the …

A

Frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The X axis in a continuous probability distribution is the …

A

Variable of interest (e.g., mass)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an advantage and disadvantage of using the mean?

A

Pro: it takes all values into account and can thus help minimize error
Con: it takes into account outliers, which can dramatically skew the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does x̄ represent?

A

Sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does µ represent?

A

Population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the median?

A

The middle value of an ordered set; the 50th percentile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In what type of data set are the mean and median the same?

A

In a symmetric distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Which measure(s) of central tendency can be used with nominal data sets?

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which measure(s) of central tendency can be used with ordinal data sets?

A

Mode, median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which measure(s) of central tendency can be used with interval data sets?

A

Mode, median, mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Which measure(s) of central tendency can be used with ratio data sets?

A

Mode, median, mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the most appropriate measure of central tendency for interval or ratio data that are skewed or contain outliers?

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the most appropriate measure of central tendency for non-skewed data?

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Measures of dispersion describe …

A

How spread out the data is

24
Q

How is the range calculated?

A

Subtracting the smallest value in a set from the largest value

25
The first quartile Q1 is larger than _____ of the observations
25%
26
The third quartile Q3 is larger than ____ of the observations
75%
27
How do you calculate the interquartile range (IQR)?
IQR = Q3 - Q1
28
What is standard deviation?
A statistical measure of variability that indicates the average amount that a set of numbers deviates from their mean
29
What is variance?
The square of the standard deviation
30
What does s represent?
Standard deviation for a sample
31
What does σ represent?
The standard deviation of a population
32
What is considered an outlier?
Data that is above Q3 + 1.5IQR or below Q1 - 1.5IQR
33
What causes random error and which measure does it decrease?
Caused by human or intstrumental error and decreases precision
34
What causes systematic error and which measure does it decrease?
Caused by observer, instrument, or subject bias and decreases accuracy
35
Which type of error is consistent? Random or systematic?
Systematic
36
What does accuracy measure?
How close the data points are to the actual value
37
What does precision measure?
How close the data points are to each other (how well they cluster)
38
What does a correlation coefficient of 0 indicate?
That there is no LINEAR relationship
39
What is the difference between correlation and simple linear regression?
Correlation does not establish which variable is causing the other. Simple linear regression describes how one variable is associated with another and is an extension of correlation
40
What is a residual and how are they calculated?
A residual is the difference between an observed value of the response variable (DV) and the predicted value. residual = y(observed) - y(predicted)
41
What is a chi-square test?
A test used to calculate p-values when all variables are categorical Example: are people who watch action movies more likely to buy popcorn?
42
What is a t-test?
A test used to calculate p-values and compare the average values of a quantitative variable between two categorical groups Example: is life expectancy different between Canadians and Americans?
43
What is ANOVA
A test similar to a t-test but for more than two groups Example: is the life expectancy different between Canadians, Americans and Mexicans?
44
What is a confidence interval?
An estimated range of values, that is likely to include an unknown population parameter at a given confidence level
45
What is the level of confidence?
The probability that the interval estimate contains the population parameter
46
What is a Type I error?
A false positive: when the null hypothesis is rejected even when it is true
47
What is a Type II error?
A false negative: when the null hypothesis is not rejected when it is false
48
What is internal validity?
The degree to which the independent variable has been demonstrated to cause the dependent variable
49
What is a threat to internal validity?
Confoudning variables
50
How can confounding variables be minimized?
Randomization
51
What is temporality?
The idea that, for variables to be causally related, the independent variable must occur before the dependent variable
52
What is external validity?
The ability of a research design to provide results that can be GENERALIZED to other situations, especially to natural ("real life") situations
53
Name and describe the two factors external validity depends on.
1) The participants included in the sample: they should be representative of the populationto which one wants to generalize 2) The physical realm of the research setting: it should be similar with respect to relevant and important characteristics of the natural situation to which one wants to generalize
54
There is trade-off between ________ validity and ________ validity
Internal, external
55
What is the biopsychosocial (BPS) approach and what are the two central tenets of this model?
An approach to medicine that integrates psychology, sociology, and biology in diagnoses and treatments Two central tenets: 1) illness is a product of more than biology (social and psychological factors) 2) illness has multiple causes (genetic, environmental, psychological0