Chapter 1: statistical basis Flashcards

(52 cards)

1
Q

statistics

A

the study of how to collect, organize, analyze, and interpret data collected from a group
- Has two branches
1. descriptive
2. inferential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

descriptive statistics

A

collect and organize data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

inferential statistics

A

analyze and interpret data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

individual

A

person or object you are interested in finding out information about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

variable (random variable)

A

the measurement or observation of the individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

population

A

-set of all the entire group of individuals about which we are interested
- collection of all outcomes, responses, measurements, or counts that are of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

sample

A

a subset from the population, it looks just like the population, but contain less data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

parameter

A

-a number calculated from the population. Usually denoted with a Greek letter
-this number is a fixed, unknown number that you want to find

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

statistic

A

-a number calculated from the sample. Usually denoted with letters from the Latin alphabet. Sometimes there is a Latin letter with a caret above it
-sometimes just put a bar over the letter
-used to estimate the parameter value
-a number that describes a sample characteristics
ex: average age of people from a sample of three states
- Ask yourself “is this fact about the whole population?”
-if there is a large population than most likely it is a statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

parameter

A

-a number that describes a population characteristics
ex: average age of all people in the US
- Ask yourself “is this fact about the whole population?”
-if there is a small population than most likely it is a parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

qualitative or categorical variable

A
  • answer is a word or name that describes a quality of the individual
    -consist of attributes, labels, or non-numerical entries
    ex: major, favorite color, place of birth
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

quantitative or numerical variable

A

-answer is a number, something that can be counted or measured from the individual
ex: age, weight of a letter, temperature, time, distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

nominal

A

-data is just a name or a category
-there is no order and you can not do arithmetic
ex: gender, ethnicity, race, car name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

ordinal

A

-data that is nominal, but you can put the data in order, since one value is more or less than another
-still cannot do arithmetic
ex: grades, place value in a race, size of a drink

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

interval

A

-data that is ordinal, but you can now subtract one value from another and it makes sense
-can do arithmetic but only adding and subtracting
ex: temperature, and time on a clock

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Ratio

A

-data that is an interval, but you can now divide one value by another and that ratio makes sense
-you can do all arithmetic on this data
ex: height, weight, distance, time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

counting numbers

A

-Integers, whole numbers, Natural numbers
- no in-between (fraction/decimal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

measuring numbers

A
  • fractions, decimals, scientific notation
    -for any two distinct (different) numbers, there is always one between
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

random variable

A

-represents a numerical value associated with each outcome of a probability distribution
-denoted by x
ex: x= hours spent on sales call in one day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

discrete random variable

A

-has a finite or countable number of possible outcomes that can be listed
ex: number of sales calls a salesperson makes in one day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

continuous random variable

A

-has an uncountable number of possible outcomes, represented by an interval on a number line
ex: hours spent on sales calls in one day

22
Q

census

A

-not really a sample (try to measure all)

23
Q

simple random sample

A

every different possible sample of size n has the same (equal) chance of being selected

24
Q

stratified sample

A

divide into strata, randomly select some from each group
-seems like cluster sample but the strata are chosen specifically to represent different characteristics within the population

25
systematic sample
every nth for example
26
cluster sample
divide into clusters, randomly select clusters, sample some or all in selected clusters
27
convenience sample
-not statistically valid -known as grab or opportunity sampling -type of sampling that involves the sample being drawn from that part of the population which is close to hand -a population selected b/c it is readily available and convenient
28
problem with bias
-bias means that a sample does not represent the population from which it was drawn -conclusions drawn may not apply to the population as a whole -can be anticipated or unanticipated -randomization is the key to avoid bias
29
observational study
-when the investigator collects data merely by watching or asking questions. -do not change anything -very hard to establish cause and effect
30
experiment
-when the investigator changes a variable or imposes a treatment to determine its effect -intended to establish cause and effect
31
simulation
uses a mathematical or physical model to reproduce the conditions of a situation or process -often involves use of computers
32
Guidelines for planning an experiment
1. Identify individuals of interest 2. specify variable or (variables) 3. specify population 4. specify method for measuring or observing 5. determine sampling method 6. collect data 7. use inference to apply knowledge 8. refine (problems, recommendations)
33
randomized two-treatment experiment
-control is key concept, placebo is often used as a control
34
randomized block design
-a block is a group of subjects that are similar, but the blocks differ from each other. Then randomly assigned treatments to subjects inside each block
35
rigorously controlled design
carefully assign subjects to different treatment groups, so that those given each treatment are similar in ways that are important to the experiment
36
matched pair design
-the treatments are given to two groups that can be matched up with each other in some ways
37
replication
-no one is gonna make major changes based on a single study. That is what graduate students are for
38
Single blinded experiment
-subjects do not know if treatment or placebo
39
double blinded experiment
-the experimenters doing the experiment do not know if its a treatment or placebo like the subjects
40
cross-sectional study
data observed, measured, or collected at some point in time
41
retrospective (case-control) study
data collected from the past using records, interviews, and other similar artifacts
42
prospective (longitudinal or cohort) study
-data collected in the future from groups sharing a common factor -will follow and observe a group of subjects over a period of time to gather information and record the development of outcomes
43
lurking or cofounding variables
-when you cannot rule out the possibility that the observed effect is due to some other variable rather than the factor being studied -how to no do statistics
44
overgeneralization
-where you do a study on one group and then try to say that it will happen on all groups -how not to do statistics
45
cause and effect
-where people decide that one variable causes the other just because the variables are related or correlated -very difficult to establish cause from observation alone ex: cigarettes and lung cancer ex: climate change and fossil fuel
46
sampling error
-this is the difference between the sample results and the true population results
47
non-sampling error
-this is where the sample is poorly collected either through a bias sample or through error in measurement
48
statistical significance
-the results are probably not due to the chance of sampling
49
practical or clinical significance
-does the effect really matter
50
hidden bias
-where the questions are asked in a way that makes the person respond a certain way
51
non-response
where you send out a survey but not everyone returns it
52
voluntary response
-where people are asked to respond via phone, online, or email. Problem is only people who care will respond with a call or email