exam 2 Flashcards

(44 cards)

1
Q

descriptive, differences, correlation/regression, association projects (differences, projects)

A

descriptive - do not carry out hypothesis, the goal is to describe the situation (various statistical measures may be important), histogram, density and boxplots
differences - compares two or more sets of data (hypothesis will relate to differences you believe may exist), bar charts, side by side boxplots
correlation/regression - attempts to link variables (looking for strength and direction of links between variables), scatterplots, line plots
association - emphasis on links between variables that are categorical, bar charts, pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

null and alternative hypothesis

A

we test the null hypothesis, data is gathered to test null
we do not prove the alternative hypothesis, the most we can do is find support for it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

possible outcomes from hypothesis testing

A

reject and fail to reject null - reject = null is not accurate, fail to reject = null is accurate
p-value - the probability that the null hypothesis is correct from the data gathered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

histogram

A

descriptive test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

boxplots

A

descriptive, difference (side by side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

bar charts

A

differences, association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

scatterplots

A

correlation and regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

line plots

A

correlation and regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

pie charts

A

association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

histogram in r

A

hist(object)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

boxplot from object in r

A

boxplot(object)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

set scale of axis in r

A

ylim=c(0,0) xlim=c(0,0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

add axis label and graph title in r

A

xlab = “Title”
ylab=”Title”
main=”Title”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

change colors of bars or boxes in r

A

col=”Color”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

popperian philosophy

A

we learn by being wrong, no amount of evidence can prove something is true (empirical falsification)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

testing a null/ reshuffling

A

to determine what no change would look like, create data that would be reasonable for the system (after plenty of research about what is realistic) to come up with more data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

level of probability that scientists use as a threshold for deciding how to interpret hypothesis

A

.05 p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

basic study set up for a t-test

A

create hypothesises, collect data, data must be normally distributed, each data point must be independent

19
Q

what happens to t when variables are changed

A

when t increases, mean difference increases, when t decreases, standard deviation increases, when t increases, n increases

20
Q

what test to do to determine if data are appropriate for t-test, how to interpret

A

find if the data are normal (boxplot or shapiro-wilk test)
greater than .05 = the data is normal and a t-test can be done

21
Q

t-test in r

A

t.test(object)

22
Q

one tailed vs two tailed t-test

A

one tailed - more power to detect directional effect (greater than or less than)
two tailed - shows evidence that the difference between means is greater than expected

23
Q

paired t-test

A

repeated observations collected for a single variable with 2 levels (differences between sample point 1 and sample point 2 are compared for the same sample unit)

24
Q

non-parametric test

A

use the rank of data and rank from smallest to largest, compare the ranks
mann-whitney (two sample) and wilcoxon (paired) tests

25
import data as .csv into r
data<-read.csv(file.choose())
26
how to take one column and create an object in r
object<- dataset$column
27
plotting boxplots and histograms in r
boxplot(object) hist(object)
28
checking for normality in r
shapiro.test(object) OR wilcox.test(object)
29
basic code for t-test in r
t.test(object)
30
how to tell if data are normal/not normal
parametric - do a shapiro wilk test non parametric - do a mann-whitney or wilcoxon test
31
how to deal with not normal data
can be log transformed OR use non-parametric tests
32
how to use a non-parametric test
mann-whitney test - equivalent of 2 paired t test, compares the observed difference in mean of ranks to the maximum possible difference in the mean of ranks wilcoxon test - matched pairs, compares the ranks of differences
33
how to decide how much data is needed to collect
preliminary sampling - small study to refine and evaluate sampling size, acceptability, feasibility, and cost of larger study (determines problems and best methods) dummy data - learn everything and then make up what you think are plausible data primary literature investigation - learn everything you can about your system and others like it (what are other people doing) update analysis as you collect data
34
when do we have independent replicates and when do we not
pseudoreplication - if replicates are "tied" to each other in some way all independent data points must have no connection to other data points
35
simple pseudoreplication
only a single replicate per treatment and subsamples are collected from each area
36
sacrificial pseudoreplication
experimental units are replicated
37
temporal pseudoreplication
only a single replicate per treatment and subsamples are collected from it repeatedly over time
38
phylogenetic pseudoreplication
closely related individuals are the units being sampled (seeds, tadpoles, insect larvae)
39
technical replication
different observers or instruments are used for different parts of the experiment
40
converting continuous variables to catagories
can limit the amount of data visible, can give an inaccurate and varying perspective of the results
41
true positive (p-value)
when Ho is true and we fail to reject
42
true negative (p-value)
when Ho is false and we reject it
43
false positive (p-value)
when Ho is true and we reject it
44
false negative (p-value)
when Ho is false and we fail to reject it