Statistics, graphs and maps Flashcards
(38 cards)
Measures of central tendency methodology (mean, median and mode)
- Tells us the middle point in a set of data
- Mode is the number that appears most often in a set of data
- Median is the middle point in a sorted list of numbers from low to high. If there are an even amount of numbers in the set then the median is the mean of the two middle numbers
- Mean is calculated by adding all the numbers in a set up and dividing by how many there are
Measures of central tendency considerations and limitations
Outliers have large effect on the mean and can throw off an average if a data set
Measures of dispersion methodology (range and IQR)
- Tells us spread of data around a central value
- Range is the maximum minus the minimum value on a data set
- The interquartile range (IQR). A quartile is one of three point that divide a range of data into four equal parts. Each quartile is 25%. The IQR is the difference between Q3 and Q1. IQR = Q3 - Q1. Less sensitive to outliers.
Measures of dispersion considerations and limitations
Normal distribution is assumed
Chi-squared analysis advantages and disadvantages
Advantages:
1. Can test associations between variables
2. Useful in measuring the differences between observed and predicted data
3. Useful when data can be grouped into classes
4. Chi-squared value can be compared to significance tables
Disadvantages:
1. Cannot use percentages
2. Data must be in the form of frequencies
3. Number of observations must be more than 20
4. Difficult formula so fairly complicated and time consuming to get right
Chi-squared analysis definition
Used to compare observed data with data we would expect to obtain according to a specific hypothesis
Spearman rank correlation coefficient purpose
Measures strength and direction of a relationship between 2 variables
Spearman rank correlation coefficient considerations and limitations
- Extreme values won’t effect result
- Fairly quick and easy to calculate if data set isn’t too large
- Only tests linear relationship
- Can be time consuming if data set is large
- Pearson’s tends to be more accurate as it uses actual ,rather than ranked, values
Pearson product moment correlation coefficient (PPMCC) purpose
Measures strength of a linear relationship between two variables
Pearson product moment correlation coefficient considerations and limitations
- Uses actual values rather than ranks so more accurate and reliable than Spearman’s when drawing conclusions
- Differences between values in the data set are taken into account so more accurate than ranking
- Larger, more tedious and more complicated than Spearman’s
- Assumes that the data had normal distributions
- Affected by extreme values
Linear regression (best fit line on scatter graph) purpose
To determine the linear relationship between one dependent variable and one independent variable
Linear regression considerations and limitations
- Allows accurate best fit line to be drawn
- More accurate than drawing best fit line by eye or your own judgement
- Cannot be used for non-linear relationships
- Time consuming and complicated to carry out due to using a complicated mathematical formula rather than arithmetical calculations
Nearest neighbour analysis purpose
Used to measure distributions according to where they are clustered, random or regular
Nearest neighbour analysis considerations and limitations
- Better than a visual observation which can be subjective
- Allows you to compare distribution of either 2 similar or 2 different areas
- Need a minimum of 30 points
- Large number of points can make the calculation tedious and time consuming
- Doesn’t take other controlling factors into account e.g natural resources
Bipolar analysis purpose (EQS)
Used to compare areas, people or gathered data
Bipolar analysis advantages and disadvantages
+visual, patterns identified easily
+can compare lots of diff aspects on same graph
- can become cluttered if too many areas are compared
- results are based on people’s perceptions which may be biased
Dispersion diagram purpose (test tube with dots)
Shows the range of a group of drama and their tendency to group or disperse
Dispersion diagram advantages and disadvantages
+ shows spread from the mean
+ patterns and anomalies easily stand out
+ other calculations can be made e.g. mean, mode, median and range
+ can calculate standard deviation
- can become easily crowded when more than one thing has the same value
- further statistical analysis requires as the spread does not tell you everything about a set of data
Kite diagrams purpose
Shows density and distribution of vegetation or species along a transect
Kite diagrams advantages and disadvantages
+ useful for displaying changes over a distance
+ patterns and trends are made clear
- calculating a % to put on the diagram can be difficult
- time consuming to plot
Logarithmic graph purpose
Used when values on a scale are too large to show on linear graph paper (increases by multiplication rather than addition)
Logarithmic graphs advantages and disadvantages
+ allows you to plot data that greatly differs in size on the same graph
+ can easily see patterns between values which would have been hard to see from raw data alone
- complicated to draw to appropriate scale
- complicated to read
- if changes are small they can be hard to see, particularly with higher values
- difficult to plot values accurately
- time consuming to draw
Polar graphs purpose
Used to show direction as well as magnitude
Polar graph advantages and disadvantages
+ directions can be seen clearly, very visual
+ info from different variables on the same graph
+ quick to read
+ quick to draw IF given a template
- hard to draw a suitable scale
- hard to draw concentric circles
- time consuming without template
- can become cluttered in the middle
- if there are huge variations in the data the circle may be huge