Data, Graphs and Statistics Flashcards

(98 cards)

1
Q

Probabilistic notation form?

A

P(event)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does 0 mean in probability?

A

Impossible event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does 1 mean in probably?

A

Certain event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is theoretical probability?

A

Does not take take experimental data or bias into account

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Formula for theoretical probability?

A

Number of ways even can occur
P(Y)= ————————————————
Total number of sample points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is experimental probability?

A

Determined by performing an experiment and using data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Formula for experiment probability?

A

Number of ways event does occur
P(Z) = ———————————————-
Number of times experiment repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an independent event?

A

The results of one event does not affect the result of another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to calculate the probability of multiple independent events?

A

Multiply together probabilities

May need to sum together and multiply more if we do not care about the order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to use a tree diagram?

A

Multiply along the branches

Add up probabilities at end if needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a depend event?

A

The outcome of the first event affects the second event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the expectation is probability?

A

The value we expect to happen

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to work out expectation?

A

Multiple each outcome by its probability

Add all these together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the expected frequency?

A

Used when expectation is not possible. E.g. cannot multiple green by 7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to work out expected frequency?

A

Multiply the probability of the outcome by the amount of times we did the experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is continues data?

A

Data can take any value, any number of decimals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Why is discrete data?

A

Only can be from a specific set of numbers, usually a whole number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is categoric data?

A

Non-numerical categories. Can’t usually be ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What graph is used for categoric data?

A
Bar chart (horizontal bars)
Pie chart
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How to change percentage to degrees for a pie chart?

A

Multiply by 3.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What type graph is used for discrete data?

A
Bar chart (vertical bars)
Pie chart 
Histogram (useful for date with ranges) they represent frequency by the area of the bar rather than height
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What type of graph is used for continuous data?

A
Line graphs 
Scatter graphs (when both axis are continuous)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What do scatter graphs show?

A

Used to show whether there is a relationship between two sets of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a negative correlation?

A

As one quantity increases the other decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is a positive correlation?
As one quantity increases so does the other
26
What does no correlation mean?
No clear relationship between quantities
27
What type of graph has a best fit line?
Line graph NOT scatter graph
28
Reliability definition in regards to line graphs?
If all the points are close to the line of best fit with no anomalies
29
What does proportional mean in regards to line graphs?
Only if the line of best fit passes through the origin
30
What is the name of the equation that the graph represents?
The function
31
What is the domain?
The spread of the x values
32
What is the range?
The spread of y values
33
On a domain/range line what does a hallow or full circle mean?
Hallow means greater than or less than | Full means greater than, less than or equal to
34
How should decimals and significant figures be used in a table?
Going down a column where data is being measured there should always be the same number of decimal places If data is being added or subtracted decimal places should also be maintained If data is multiplied or divided then significant figures should be maintained rather than decimals
35
What is central tendency?
As we get more observations we tent towards a central value rather than an extreme value
36
How to calculate arithmetic mean?
Sum of all the values and divided by the number of values
37
Advantages of arithmetic mean?
Most efficient measure of central tendency
38
Disadvantages of arithmetic mean?
Can be substantially affected by large outliers
39
How to work out the mode?
The most common value
40
Advantages of the mode?
Can be used for categoric data. There can be multiple modes.
41
Disadvantages of the mode?
Poor representation of actual most likely value as could easily be due to chance
42
How to calculate the median?
The middle value once organised in numerical order. Or the average of the two middle terms.
43
Advantages of the median?
Immune to outlier and skewed data, very good measure of central tendency
44
Disadvantages of the mean?
Cannot be used for nominal data
45
What is nominal data?
Categoric data that cannot be ordered
46
What is ordinal data?
Categoric data that does have an order
47
What is the range?
Subtract the lowest value from the highest value
48
Disadvantages of the range?
Affected by outliers
49
What is the interquartile range?
Splits data into four and finds the difference between end of 1st and end of 3rd quartiles
50
Formula to work out 1st and 3rd quartile?
Q1=1/4(number of results+1) | Q3=3/4(number of results+1)
51
Advantages of interquartile range?
Removes outliers
52
How to correct quartiles figures when the formula produces a decimal?
Q1 .25 need to average the terms either side Q1 .75 round the term up Q3 .25 round the term down Q3 .75 average the terms either side
53
How to work out mean from a table of continuous data?
Find an average of each variable and times by frequency | Add all these together and divide by total frequency
54
What is an uncertainty?
How far out a reading may be. E.g. +/- 0.01cm
55
How to calculate absolute uncertainty in data with only one reading?
The value followed by the precision of instrument. This also works for repeat readings with the same values.
56
How to calculate absolute uncertainty with repeat readings?
Multiply the range by 0.5 | Value looking at +/- figure found from range
57
How to calculate percentage uncertainty?
Absolute uncertainty ——————————— X 100 Value Written as value with +/- percentage after (maximum % of 2 significant figures)
58
How to combine uncertainties?
Add the percentage uncertainties together for multiplication and division Add the absolute uncertainties for addition and subtraction
59
How to calculate absolute uncertainty from percentage uncertainties?
Percentage uncertainty ————————————- x value 100 Never more decimal places than the value
60
How to work out variance?
Standard deviation squared
61
Difference between sample and population?
Population measures everyone/thing needed | Sample only measures a few out of a population
62
How to work out sample standard deviation?
``` First calculate mean Subtract the mean from every outcome and square the result Add all results together Then use formula: S = √number found previously ————————————- Number of results - 1 ```
63
What is normal distribution?
A probability distribution that suggests that most values cluster around middle of the range and taper off at each end A bell shaped curve
64
Features of the normal distribution?
Symmetric Centred on mean Continuous All values are possible but those at extremes are unlikely
65
How does normal distribution relate to standard deviation?
68% within 1 SD 95% within 2 SD 99.7% within 3 SD
66
How to work out probability of something being ‘less than’ from normal distribution from standard deviation?
Value - mean ———————- Standard deviation Look at table and see what figure corresponds
67
How to work out probability of something being ‘more than’ from normal distribution from standard deviation?
Calculate the ‘less than’ value and subtract this from 1
68
How to find equation of straight line?
Work out relationship between x and y Eg y =2x Should always start with Y = General equation of Y=Mx+c
69
What describes the gradient?
Value in front of X from equation
70
What does y=mx+c show?
``` Equation of a straight line Y= y axis M=gradient X= x axis C= constant (the Y intercept) ```
71
How to find the gradient?
``` Choose two coordinates on line Use formula: Y2-Y1 ———— X2-X1 ```
72
The independent variable is usually the ..... axis?
X
73
The dependent variable is usually the ..... axis?
Y
74
What type of data is Poisson distribution used for?
Discrete | Usual rare events
75
Lambda meaning?
Mean of all the expected number of results
76
What type of date is binomial distribution used for?
Discrete | But only two options (success or failure)
77
How to predict successes from binomial data?
Probability multiplied by number of experiments then look on probability graph
78
Formula for probability from Poisson distribution?
??
79
What is parametric data?
Date that is normally distributed e.g. height, weight
80
What is a null hypothesis?
No significant correlation
81
Hypothesis testing for parametric data?
Paired t-test Unpaired t-test Pearson correlation ANOVA
82
Hypothesis testing for non-parametric data?
Chi squared test Mann-Whitney U test Spearman correlation Kruskal-Wallis
83
What is a significance level?
How likely something is due to chance | P=
84
What is the chi squared test used for?
Discrete data | Difference between what is expected and what is observed
85
Chi-squared test procedure?
1) state null hypothesis 2) calculate expected frequency 3) use formula: (Observed value - expected) (2) X(2)= The sum of—————————————————- Expected value Easiest to calculate in a table 4)calculate degrees of freedom formula: DF= number of classes-1 5) look up in table 6) accept or reject null hypothesis including significance level and degrees of freedom
86
Degrees of freedom formula when working with distributions?
DF= number of classes- 1 - number of estimated parameters Estimated parameters depends on whether binomial, Poisson or normal. Binomial is 0 Poisson is 1 Normal is 2
87
What is a tangent?
A straight line that to measure the gradient of a curved graph
88
How to add an error bar to graph?
Add lines up/down/side to side of the uncertainty associated with value Helps to add line of best fit, must pass through all error bars
89
What is a worst best fit line?
As steep as it can go but still within all error bars and as shallow as it can go with another line
90
Purpose of worse best fit line?
Can be used along side best fit line to see the range of gradient readings Find different between best gradient and worts gradient to find uncertainty
91
How to find the area under a graph?
Divide space to rectangles and triangles Base x height for rectangles 1/2 base x height for triangles Add together If graph is curved Count all squared, not including any less that half full
92
What is the students T-test for?
When we have two samples to compare and then generate claims about the population
93
Assumptions of students unpaired T-test?
Data normally distributed Unpaired data Unequally variance
94
What is unpaired data?
Data from samples that do not impact on each other (different samples)
95
What is paired data?
Data that does effect one another (same sample)
96
How to use unpaired T-test?
1) find amount measured, mean and standard deviation for both groups 2) state null hypothesis 3) t= mean of first group- second group ——————————————————————— Variance of 1st group Variance of 2nd group √(————————————) +(——————————-) Number of obs. Number of obs. 4) ignore any negatives 5) calculate degrees of freedom Obs. in group 1 + group 2 - 2 5) look up value in table 6) accept or reject null hypothesis at required significance level 7) state a conclusion
97
How to use paired T-test?
1) find mean and stand deviation of differences 2) state null hypothesis 3) t=mean of differences √ number obs. —————————————————————— Standard deviation of differences 4) ignore any negatives 5) calculate degrees of freedom Number of obs. - 1 5) look up value in table 6) accept or reject null hypothesis at required significance level 7) state a conclusion
98
Assumptions of students paired T-test?
Data normally distributed Paired data Unequally variance