Lets have it lad Flashcards

(203 cards)

1
Q

Add together

A

=sum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what symbol in edit bar means auto sum

A

Ʃ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Multiply

A

*

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Divide

A

/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Subtract

A

-

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Mean

A

=Average()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Median for even numbers

A

Middle two, add together divide by two

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Median

A

Sort and filter icon, smallest to largest to put them in order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mode

A

=mode()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Range

A

=Max() and =min()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Interquatile range

A

Put in numeric order
Split into 4 quarters
Subtract first value of 2nd quarter from last value of 3rd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard deviation

A

=STDEV()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variance tpye

A

=VAR()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is variance

A

STDEv without square root

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you calculate stdev

A

Calculate mean, x with line above means mean
Difference of each x value and mean then square
Take sum of all square
Then divide by number of parts minus 1
Square root

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Square data

A

data^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two types of statistical analysis

A

Descriptive stats

Inferential stats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is descriptive stats?

A

Statistical analysis to summarise main points/characteristics of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is inferential stats

A

Statistical analysis to infer something about a whole form sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Nominal data

A

PLace in categroies, labelled- options

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Ordinal data

A

Preferences shown and then presences ranked, scale 1-5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Interval data

A

Any values that have a consistent interval, how hot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Ratio

A

Has a defined 0 point, distance travelled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the mean very sensitive to?

A

extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is the median compared to the mean?
Much less sensitive, more robust
26
For nominal data what should be used, descriptive stats
Mode
27
What should you use for quantitative data?
Mean or median
28
When should you use median over mode
When. there are extreme values because you do not want a distorted average
29
When should you use mean over median
When there are no extreme values
30
What are the 4 measures of spread
Range Standard dev - most common Interquartile range Variance
31
What is the standard deviation used for?
Measure the variability in data
32
What does a high standard dev mean?
high change
33
wHat does a low standard dev mean>
low change
34
Example of standard dev?
LAI values of rainfall
35
What is variance
Calculates variation which is not in the same units as the data (squared units)- less common
36
What are histograms?
Graphic depiction of the shape of the distribution of data- most common
37
What is the issue w to many intervals in a histogram?
Too complex
38
What is the issue with too few intervals in a histogram?
Detail is lost
39
What is the ideal number of intervals in a histogram?
Ideal=10 to 20
40
4 measures to describe the similaries/differences of frequency distributions
Central tendency Spread Skewness Kurtosis
41
Skewness
Measure of asymmetry in distribution
42
Skewness, low numbers
Positively skewed distribution
43
Skewness High numbers
Negatively skewed distribution
44
No skew
0
45
Sew =
Mean-median
46
+ve value skew
+ve skew
47
-ve value skew
- e skew
48
Kurtosis
Measure of how flat or peaked a distribution is
49
Kurtosis, Platykurtic
Relatively flat distribution w no obvious peak
50
Kurtosis, Leptokurtic
strongly pronounced peak in data
51
Culmative frequency graphs, steep slope
Intervals with many data points
52
Culmative frequency graphs, shallow slope
intervals with few data points
53
What are calmative frequency graphs used for )example)
grain size statistics
54
What is used to raph 2 variables
Scatter plots
55
Nominal data os
-Categroical data
56
What is nominal data frequnctly expressed as?
Pie charts, good at showing proportion
57
Cna histograms still be used for nominal data?
-Yes, intervals=categories and frequencies = number of x in each category
58
How is nominal data expressed?
Histograms and pie charts
59
What do frequency distribution histograms show?
Visually describe distribution and indetify skewness and kurtosis
60
How do you decide size of intervals on histogram
Total (highest value) divided by the number of intervals you want
61
How do you calculate the number of individuals in each class interval
``` =fequencey(data cells, class interval cells) highlight range of cells to put the answer into first ```
62
How do you produce freuencey distribution histogram?
insert tab-charts-coumn
63
Skew function
=skew()
64
What is a normal skew distribution?
0
65
Why are sampling and inferential stats important
- Rare to be able to sample a whole population | - Use characteristics of sample t infer
66
What is random sampling
Selecting individuals with no bias
67
What is systematic smapling
Individuals selected in a regular way
68
What is spatial sampling
individuals are selected at regular spatial intervals
69
Criteria for truly random samples
- Every individual has an equal chance of inclusion throughout the procedure - Selection of any individual should not affect the chance of selection for another
70
Positive/negatives of systematic sampling
+allows fair/even coverage of range of individuals | -Not fair and equal chance of being chosen, can produce bunching of sampled individuals
71
Two key assumptions which underpin most inferential stats
- random sampling | - population has a known distribution
72
What is a parameter
number that describes data from a population
73
What is a statsistic
A number that describes data from a sample
74
How to use random generator on excel
=RaNDBETWEEN(1,200), pulldown for cells below, rapsate answers and click paste special
75
Where to find look up feature
Lecture week 4
76
What is a hypothesis?
Proposed explanation for narrow phenomena, based on a range of things e.g. background scientific knowledge, preliminary investigators, logic, etc.
77
What is a theory
structure conceived by human imagination to explain how/why patterns occur in observed data - often broader and can integrate many hypotheses - new oe very well tested - can be used to generate hypotheses
78
What must hypotheses be to be a science?
testable
79
Why can descriptive statistics be used to make hypotheses?
We can make hypotheses based on observed patterns
80
Hypotheses can be formalised for what?
statisical testing
81
Null hypotheses, symbol
H0
82
What is a null hypothesis
accepted fact which is nullify able/invalidatable
83
Research hypotheses
H1, what we want to find the answer to
84
What is a 2 tailored test
Test to look into difference of means , could be 2 stats
85
what is a 1 tailored test
Only one way it could happen
86
What happens if there is evidence of statically significantly different between sample and overall mean?
We reject null and accept h1
87
What happens when we cannot reject null
If null hypohtesis cannot be statically evidence against, then it has to be accepted
88
What does an alternative hypothesis do?
Reject null
89
What is the only purpose of inferential stats?
Answer the quesiton
90
What is simulation modelling?
Testing all outcomes that might occur due to random variation and sampling- it is very slow and inelegant for well known process, but hd to do it top prove a point
91
Null hypotheses is what we?
test to reject, from which we may accept
92
What must we do to apply statistical tests?
We need to generate full hypotheses
93
What do inferential stats allow us to do?
Use sample statistics to comment on a populations paramters
94
Mode function
=Mode()
95
Function for m, slope
=slope()
96
Function for y intercept of line, c
=intercept()
97
Function for correlation between populations, r, Pearsons
=pearson()
98
r, correlation function
=correl()
99
Variance function
=VAR
100
What are sample figures called? (mean, mode and median)
Statistics
101
What are the true mean and median of a population called?
Parameters
102
What is used to estimate a parameter?
Statistics
103
What do statistics describe?
Sample
104
What do parameters describede?
population
105
How likely is it that a sample will give exact estimates of the population characteristics?
unlikely
106
What are different samples of the same population likely to compare?
Unlikely to have the same estimates of the population
107
What is sampling distribution?
Distribution of a large number of sample statistics
108
What is confidence interval?
Used to assess the accuracy of parameter estimates... and this is what allows us to test hypotheses
109
What is standard error?
Standard deviation/square root of sample size
110
Confidence interval
mean of x, plus or minus the standard error over the route of the sample size, times by the chosen Z value
111
What is standard deviation of the means called?
Standard error
112
What does standard deviation quantify?
The variation within a set of measurements
113
What does standard error quantify?
the variation in the means of multiple measurements, can still do form single measurements
114
Why statistical tests are needed?
Sample data, each mean value is to the same degree in error (it differs from the actual or population mean value of that site
115
Why can difference in mean values from same population occur?
Sampling chance
116
What is statistical significance?
level of risk ( a value); the risk of not being correct if we accept or reject null hypothesis
117
What does a lowering statistical significance mean?
Higher confidence
118
What do parametric statistical tests involve?
Assumptions of populations | -Populations should have some variance, independent date and hypothesis usually concern population ean
119
What are non parametric test in comparison?
-Have fewer assumptions and are more robust, handle non classical distributions e.g. chi squared
120
What stats test to use for nominal data?
test for promotion, difference of two proportions, chi squared independence
121
What stats test to use for interval ratio?
Most commonly mean, difference of two means and regression analysis
122
What is ordinal data?
Between both nominal and interval data
123
If there is one sample...
test for proportion and test for mean
124
If there is two samples..
Difference of two proportions and difference of two means
125
What test if there is one sample with two measures?
chi squared , regression and difference of two emasn
126
What if you are testing for a value?
test for proportion, test for mean and difference of two means
127
What if you are comparing 2 statistics?
Difference of tow proportions/means
128
Working out a relationship?
Chi squared, regression analysis
129
What is the T - Test for?
Individual sampels
130
What does t mean?
Difference between means/standard error of that difference
131
What is critical value calculated based on?
Degrees of freedom and significance level
132
What is the rejection reigon?
Part of the probability distribution beyond the a critical value of test statistic
133
What is regression?
Nature of relationship between two variables
134
What is measured by a correlation coefficient?
An association between two variables - poistive or negative - linear or non-linear - visualization or scatter plot
135
Positive correlation
/
136
Negative correlation
\
137
Linear correlation
Close together and ordered
138
+1 correlation
Perfect correlation
139
0 correlation
no association
140
-1 correlation
Perfect negative association
141
0.00-0.19 correlation
very weak
142
0.2-0.39 corerlation
weak
143
0.4-0.69 correlation
modest
144
0.7-0.89
strong
145
0.9-1.0
very strong
146
What are the different estimates of correlation coefficient?
Parametric and no para metric
147
what is the parametric correlation estimate?
Pearsons product, moment correlation coefficient - asses normal distribution needs values
148
What is the non-parametric correlation estimate?
Spearmans rank korrelation coeffiencent- more effective, only needs rank
149
Pearsons rank
_requires interval scale (continuous data) -Parametric assumes variables are normally distributed Relationship between two variables tested is linear -Should be in a consistent idrection
150
Spearmans Rank
Ordinal or interval dara Assumes direction of relationship consistent only 91% of power?
151
Spearmans rank equation
Look online
152
Pearsons rank equation
Look online
153
Steps of a stats test
-1 A Statement of the null hypothesis Set the level of risk, statistical significance (Alpha value) Select the appropriate statistical test and compute the test statistic value Find critical value from published tables Accept or reject hypothesis
154
What is a correlation coefficient?
Numerical index that reflects the linear relationship between two variables, which ranges from -1 to 1
155
Two types of correlation coefficient used
Spearmans rank and Pearsons
156
How can the significance of Roy and rs be tested?
Using the T test
157
How do you produce a scatter plot
highlight data, inset select x-y scatter chat type with only dots. Right click to alter, select chart layout tab to do the
158
What should be done for sample data to get an unbiased estimation of rs values?
Coefficient should be multiplied by n-1/m
159
How do you work out spearman's rank
find difference between each pair of ranks, copy formula down column - sqaure the rank difference, pull down column - Add squared diffrences - Tied dat= complex - Non tied data = simple
160
How do you work out spearman's rank using a scatter graph
scatter graph, add trend line, chart elements-add. Double click trend line and click show more. Square root r2 (squared) value for (Coefficient of determination) Pearsons
161
How to test significance for correlation data
put r value into T=r route over brackets with n-2 over 1-r squared inside
162
Regression
A parametric stats technique for identifying the relationship between dependant variable and one or more independent variables
163
What does a correlation tell us?
If 2 variables vary together
164
What does a regression describe
Functional relationship between 2 variables
165
What is the regression coefficient?
``` y=a+bx y=dependant variable a=intercept on y axis b= slope gradient x=independant variable ```
166
What does the regression coefficient show?
Provides diagnostic details that indicate the quality of the model fit -used to provide a simplified relation between the two variables to evaluate the strength of the relation and correlation of the model based on sample data
167
Dependant variable
Depend on the values of other variables
168
Best fit line+
Sum of the square residuals minimised- determine b (slope) and a intercept for best fit line
169
Slope = b , equation
Sum of (x-(mean X) X (y-(mean y) over Sum of x-(mean X) squared
170
What is the coefficient of determination?
r2 (squared) between 0 and 1.0 | - What proportion of total variance in the dependant variable is accoiunted for by the regression model, squared peasn
171
What do higher rates of the coefficient determination mean?
Less scatter along the line
172
What must be apparent for a relation to exist in the coefficient determination?
b must be different from 0 for a relation to exist
173
What is the T test used for with regardless to the slope?
See whether it is significant or not | T test can be used for both slope (b) and interecpet
174
If there is no relationship between X and Y what would the coefficient be expected to be?
0
175
How can you also test the slope?
Convert calculated value for b into units of T to to compare T statistic with T critical value- the larger the T value the less likely that the slope coefficient for the sample arose from random sampling of variables that are not related
176
Standard error of the estimate
Measures the amount of variability in the points around the regression
177
Limitations of linear regression?
-Interval scale (continuous data) required Data should be approx. normally distributed -Equation should not be used to predict values for beyond limits of original data -Relationship assumed linear shears non linear fit may provide better fit to same data sets. -Residuals of regression should be approx normally distributed with a mean of 0. Variance of y about regression line does not vary markedly over range of x
178
How do you add linear regression line to the plot?
left click on data series to select- layout out tab, analysis group and click on add trend line- more trend line options and linear regression, check boxes to display the lines equation and r squared
179
What is the slope function to obtain b data ?
=Slope (Known y's, known x's)
180
What is the function to obtain intercept data, a?
=intercpet (known ys, known XS)
181
Pearsons function
=Pearson()
182
How do you calculate regression using the tool?
Select regression from data analysis tool data tab- data analysis group, if not options file- options add in
183
what test for slope coefficient hypothesis?
T test
184
The larger the value of t...
the less likely that the slope coefficient for the sample arose from random sampling of variables that are not linearly related
185
What is confidence interval related to?
Standard error of the estimates
186
What is the standard error of the estimates S.E
Average amount of difference between sample and population characteristics- in regression can be calculated in excel
187
What can be SE be used to calculate?
Confidence interval given a confidence level
188
What is the confidence interval for slope coefficient?
(b-2s.e. b +2s.e)
189
What is Anova?
Analysis of variance- test for multiple samples from same n
190
Why are degrees of freedom useful?
In finding the critical value in the table
191
Degrees of freedom regression
K-1
192
Degrees of freedom
N-K
193
Degreees of freedom
N-1
194
What is the significance level
P value
195
How do you find the T stat for the intercept?
a/s.e
196
How do you find the t stat for the x variable?
b/s.e.
197
What does 0.004 p value means
4% chance the relationship occurred randomly
198
What does 0.04 p value means
4% chance the relationship occurred randomly
199
Typically what does a p value less than 5 or 10% mean?
It is significant
200
What are the limitations in regression analysis?
- iinterval data required i.e. continuous - Data should be approx. normally distributed - Regression equation should not be used to predict values for beyond the limits of original data - The relationship is assumed linear shears non linear best fit line may provide better fit - Homoscasticty, variance of y about the regression line does not vary markedly
201
How should residuals of regression be approx distributed?
normally w mean of 0
202
What should residuals not show?
Trend, slope pof regression residuals on x should=0
203
What should residuals not show?
Trend, slope of regression residuals on x should=0