Lecture 10 - Spatial Data Analysis Flashcards
What is the objective of spatial analysis?
To transform data into useful info to satisfy requirements/objectives of decision-makers at all levels of detail
What does spatial analysis help us to do?
- distinguishes GIS from other info systems
- identify trends in data
- create new relationships from the data
- view complex relationships b/w data sets
- make better decisions
Why do analyses?
Analysis functions use spatial and non-spatial attributes in database to answer questions about real world
Why do we do spatial analysis of massive data sets?
- search for patterns, anomalies, and trends
- semi-automated b/c of data volumes
- widely used in practice in business (unusual credit card usage), epidemiology (detection of disease outbreak), and climate change (data patterns lead to discovering ozone hole)
What are descriptive summaries?
- attempt to summarize useful properties of data sets in one or two statistics
- mean/average widely used b/c measures central tendency
How do you find a spatial mean?
Partial (2D) equivalent of the mean would be a centre. Several ways of defining centres:
- centroid
- dispersion
What is a centroid?
The 2D equivalent of the mean.
- for a set of points, the centroid is found by taking weighted average of x and y coordinates
- balance point: the point about which the 2D pattern would balance if it were transferred to a weightless, rigid plane and suspended
- ex. population centroids in US
What is dispersion?
A measure of spread of points around a centre (useful for determining positional error)
- w/o knowing about how data is dispersed, measure of central tendency may be misleading
- measures of dispersion include: range, average deviation, variance and standard deviation
What are spatial patterns? How can we identify spatial patterns in the data?
Can be clustered, dispersed, or random.
Techniques for identifying:
- measures only looking at physical location and geometric patterns of features (ex. unlabelled points)
- measures looking at geographic patterns in attribute data of features (ex. labelled points)
What is Tobler’s first law of geography?
Everything is related to everything else, but near things are more related than distant things
- fundamental concept for understanding/analyzing spatial phenomena
What is spatial autocorrelation?
Correlation of a variable with itself through space
- measures and analyzes the degree of spatial dependency by comparing values of a sample with values of their neighbours
- if there is any systematic pattern in the spatial distribution of a variable, it is said to be spatially autocorrelated
What is positive spatial autocorrelation?
All similar values are located close together
- may violate assumptions about independence of residuals (diff b/w observed and predicted value)
What is negative spatial autocorrelation?
Dissimilar values appear in close association (neighbouring areas unalike)
What is random patterns in spatial autocorrelation?
Exhibits no spatial autocorrelation
Why is spatial autocorrelation important?
- most stats based on assumption that values of observations in each sample independent of one another
- most GIS provide tools to measure the level of spatial autocorrelation
Goals:
- to measure strength of spatial autocorrelation in a map
- to test the assumption of independence or randomness
What are the main indices of spatial autocorrelation?
- Moran’s I
- Geary’s C
- Ripley’s K
- Join Count Analysis
What do indices of spatial autocorrelation do?
Measures level of interdependence b/w variables and nature and strength of that interdependence
- tests whether the observed value of a variable at one locality is independent of the values of the variable at neighbouring localities
How does Moran’s I work?
Based on p-values.
Ex. Moran’s I = 0.66, then p-value is not statistically significant and you cannot reject null hypothesis
Ex. 2 Moran’s I = 0.015, then p-value is statistically significant, so you can reject null hypothesis
What are spatial modes?
Attempt to represent variation over the earth’s surface (GIS provides suitable platform)
- manipulate geographic info in multiple stages (representing either a single point in time or predictions about future points in time)
What are the benefits of the modelling approach?
- allows experiments to be conducted on simulated systems rather than on the real thing (cheaper and less invasive)
- allows alt scenarios to be evaluated (compare diff policy options and their impacts on the future; planners can experiment “what-if” scenarios)
What are the different types of spatial models?
- static models and indicators
- individual and aggregate models
- cellular models
What are static models and indicators?
- represents a system at a single point in time
- take multiple GIS inputs and compute useful indices
- ex. groundwater protection model
What are individual and aggregate models?
Individual: simulates behaviour of every individual in system (ex. every person in a crowd)
Aggregate: used when too many individual elements to model (ex. impossible to model every molecule of water, grain of sand, etc.)
- ex. simulation of movement of individuals during a parade
What are cellular models?
- model a system based on using raster GIS
- each cell in the raster can be in one of a number of states (ex. change through time is represented by change of cell state)
- ex. urban growth simulation where each cell is either undeveloped or developed