Test Flashcards

(57 cards)

1
Q

Categorical

A

Most common: graphical display
Pie chart
Bar chart

Pictograms
Frequency tables

Numerical summaries: category counts and percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantatative

A

Histogram [=including, (=not including
Stem plot
Box plot (5# summary)
- longer/shorter quartile means spread of data not more data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mean

A

Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median

A

Middle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

More

A

Most often occurring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Standard Deviation

A

Use w/ symmetry and mean

68% fall w/in 1 SD of the mean
95% fall w/in 2 SD of the mean
99% fall w/in 3 SD of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

IQR

A

Inner quartile range

Gives us the middle 50%

Used w/skewed data and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

1.5 IQR

Used to detect outliers

A

Q1-1.5(IQR)

Q3-1.5(IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explanatory Variable

X

A

Variable that claims to explain, predict or affect the response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Response variable (Y)

A

Outcome of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

C — Q

A

Box plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

C — C

A

Two way tables / contingency table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Q — C

A

Conditional percentile tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Q — Q

A

Scatter plot

Increase in X = increase in Y
Decrease in X = decrease in Y

U shape = not positive or negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

r= linear correlation coefficient

A

( -1 to 1 )

0 to -1 = neg relationship
0 to +1 = pos relationship

Measures strength of linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Simpsons Paradox

A

When a lurking variable causes us to think the direction of an association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Population

A

Group chosen for sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Sample Frame

A

List of individuals to be sampled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Sample

A

Actual individuals chosen for sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Simple Random Sample

A

Individuals sampled at random without replacement.

Selecting names out of a hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Cluster sample

A

Used when population is naturally divided into groups

Students in university divided into majors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Stratified sample

A

Used when population naturally divided into subpopulations

Students in certain college divided by gender or year in college

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Systematic sample

A

Obtain contact information and sample every so many people (I.e. Every 50th person)

24
Q

Observational Study

A

Values of variables are recorded as they naturally occur

25
Experiment
Researcher defines the explanatory variable
26
Prospective
Values of the variables recorded forward in time
27
Retrospective
Values of variables recorded backward in time
28
Blind experiment
Subjects unaware of which treatment they are receiving
29
Double Blind Experiment
Testing procedure designed to eliminate biased results Where identity of those receiving a test treatment is concealed from both administrators and subjects until study is completed.
30
Hawthorne Effect
People in an experiment behave differently from how they would normally behave.
31
Lack of realism
Subjects/treatments/setting of an experiment may not realistically duplicate the conditions we want to study.
32
Noncompliance
Failure to conform to roles / standards
33
Blocking
Divide subjects into groups of individuals who are similar with respect to an outside variable.
34
Matched pairs
Special case of randomized block design. Used when experiment has two treatment conditions and subjects can be grouped into pairs. Then within each pair subjects are randomly assigned to different treatments.
35
Randomized response
Survey technique for eliminating evasive answers.
36
Leading question
Questions that influence the response
37
Sensitive questions
Questions that may make someone answer dishonestly because of how they feel. (I.e. Questions about lowest grade last year).
38
Classical problems (theoretical/true problems)
Games of chance Flipping coins, rolling dice, spinning spinners
39
Empirical problems (relative frequency)
Run a simulation or use a random sample Use a series of trials that produce outcomes that cannot be predicted in advance
40
Law of large numbers
As the number of trials increases, the relative frequency becomes the actual probability
41
Rule #1. Probabilities are between 0 and 1
A +B + C = 1
42
Rule #2 | Something must happen
As number increases should see a change
43
Rule #3. Complement Rule
P(not A) = 1-P(A)
44
Rule #4 addition rule for disjoint events
P(A or B) = P(A) + P(B) P(A or B) = probability that event A occurs or event B occurs or both)
45
Rule #5 multiplication rule for independent events
P(A and B) = P(A)*P(B) P(A and B) = Probability that event A and event B occur.
46
Disjoint events
Whether or not it is possible for the events to occur at the same time
47
Independent events
If event A occurring does not effect the probability of event B will occur.
48
Probability of at least
L = at least / or not P(L) = 1-P(notL)
49
Rule #6 general rule of addition
P(A or B) = larger number P(A and B) = smaller number
50
Distribution means...
What values the variables take and how often the variables take those values.
51
Skewed right
Most data on left, minimal on right. (Right tail (larger values) is much longer than left tail (smaller values)) Example: distribution of salary.
52
Skewed Left
Left tail (smaller values) is much longer than the right tail (larger values). (Example: age of death from natural causes).
53
Stemplot
Retains actual data and organized it.
54
Given that the student is male, what is the probability that he has one or both ears pierced?
P(E) = probability of having one or both ears pierced P(M) = male student P(E | M )
55
Formal definition of probability
P(B | A) = P(A and B) / P(A)
56
If service A has failed to deliver the document on time, what is the probability that it has arrived on time using service B.
P(B | not A) = P( B and not A) / P ( not A)
57
Testing for independence
Compare the overall probability to the conditional probability. ``` Compare P(B | A) to P (B) as well as P(A | B) to P(A) or, P(B | A) to P(B | not A) ``` If the two events are equal then they are independent.