Module 1 Flashcards

(50 cards)

1
Q

What is an individual?

A

A thing (person, object, event, etc.) we collect data from or about (AKA observations or subjects)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a variable?

A

A characteristic of an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Statistics??

A

the science of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does statistics help us do?

A
  • Describe data visually and numerically
  • Estimate values with a measure of confidence
  • Predict values of unobserved variables
  • Decide when a difference of effect is significant
  • Understand patterns in randomness
  • Learn from data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is population?

A

The set of all individuals we are interested in, usually at a fixed point in time
(number of individuals in a population is denoted by N)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a population parameter?

A

A number summarizing some aspect of the population. GOAL! Population parameters are fixed and usually unknowable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the four main population parameters?

A
  • μ, mu, the mean
  • p, the proportion
  • δ, sigma, the standard deviation
  • ρ, rho, the correlation b/w variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a census?

A

When every individual in the population is studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a sample?

A

It is a subset of the population that data is actually collected from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are reasons sampling is hard?

A
  • Expensive in time, money or other reasons
  • Lack of access, private records, nonresponse
  • Collecting data may be destructive
  • The entire population doesn’t exist
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a sample statistic?

A

A number summarizing some aspect of the sample. It is used to estimate the population parameter. It is knowable but random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the four main sample statistics?

A
  • sample mean, x-bar
  • sample proportion, P-hat
  • sample standard deviation, s
  • sample correlation, r or R
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is stratified sampling?

A

Partitioning pop. into non-overlapping groups. Take on SRS from each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the pros and cons of stratified sampling?

A

Pros: Ensures significant subgroups are represented. Allows for comparisons b/w groups as well
Cons: Need proportions of subgroups in the sample to be similar to proportions in the population. If not, results need to be reweighted. Cons:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is cluster sampling?

A

Partitioning pop. into non-overlapping groups. Randomly select a few groups. Sample everything in those groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the pros and cons of cluster sampling?

A

Pros: Convenient and cost effective. Allows for increased sample size
Cons: If clusters are too homogenous there may not be enough diversity to represent the entire population. Increases the probability of an inaccurate answer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is systematic sampling?

A

Where you pick an integer k. Arrange indiv. in some order. Choose a random starting indiv. b/w 1 and k-1. Sample every kth indiv. after that

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the pros and cons of systematic sampling?

A

Pros: If order is random, mimics simple random sampling, but doesn’t not need a frame. If order is sorted, mimics stratified sampling.
Cons: If k is too big your sample will be too small. If k is too small, you could waster resources on a larger sample than is necessary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A sample is representative if…

A

It has characteristics similar to the pop. from which its drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is convenience sampling?

A

Researchers choose individuals that are easy to access

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the pros and cons of convenience sampling?

A

Pros: none, except for ease
Cons: Individuals tend to be too similar and lack diversity present in the larger population

22
Q

What is voluntary response sampling?

A

Individuals self select into the sample

23
Q

What are the pros and cons of response sampling?

A

Pros: none except for ease
Cons: Biased in favor of individuals with strong opinions

24
Q

The best sampling techniques are

A

based on random selection of individuals

25
The law of Large numbers
Estimates from random samples will approach the true value as the sample size increases
26
Simple Random Sampling (SRS)
Every sample of a given size is equally likely to
27
Sampling is done with replacement if... and without replacement if...
Individuals can be selected more than once/individuals cannot be selected again
28
Pros and Cons of Simple Random Sampling
Pros: simple, but effective. The probability of a representative sample increases with size Cons: Requires access to a frame- a list of all individuals in the population
29
What is anecdotal evidence?
evidence from personal observations made in a casual nonsystematic manner
30
What is bias?
Certain outcomes are systematically favored
31
What is sampling bias?
Bias due to who or what is in the sample
32
What is under coverage?
Some individuals in the pop. have 0 probability of being selected
33
What is nonresponse bias?
Individuals intended for the sample do not respond
34
What is voluntary response bias?
Individuals w/ strong opinions self-select into the sample
35
What is response bias?
Something about the questions or how they are asked/worded/ordered influences the responses
36
What are the 3 main types of response bias
Wording of questions, ordering of questions (priming), and pride/shame/interviewer effect
37
How do you control pride/shame/interviewer effect?
1. Assure of anonymity and confidentiality 2. Self-administer sensitive questions
38
What is a explanatory/independent variable?
One that explains a change in other variables. Also known as the input variable
39
What is a response/dependent variable?
one that changes in response to other variables
40
What is an observational study?
Researchers do not manipulate the explanatory variables
41
What are the pros and cons of observational studies?
Pros: Observational studies allow us to look for relationships when the explanatory variable is difficult or unethical Cons: No matter how strong the correlation is, you cannot conclude causation.
42
What is a controlled experiment?
Where researchers intentionally manipulate the explanatory variable
43
What does the principle controlling mean?
Eliminate variability in other variable that might affect the response
44
What is the principle of blocking mean?
Group individuals according to a characteristic that might affect the response
45
What does the principle of randomization mean?
Randomly assign individuals to treatment groups
46
What does the principle of replication mean?
Study as many individuals as possible under each treatment
47
Confounding occurs when...
the effects of variables on the response cannot be separated
48
What is the completely randomized design?
Individuals randomly assigned to treatment groups
49
What is the randomized block design?
Individuals are partitioned into similar "blocks" before random assignment to treatment
50
What is the matched pairs design?
Individuals are matched into similar pairs. Each member gets a different treatment