Week 2: Data capture, Acquisition and Big Data Flashcards
(78 cards)
The two types of data sources
Primary and secondary
Data requirements are driven by
Question parameters
Layers of data requirements (data slices) that represent the real world
- Your data
- Survey control
- Water features
- Boundaries
- Addresses
- Transportation
- Elevation
- Imagery
Foundation of all GIS processes
Spatial data
Data determines:
- What theme (issues/problems) you can answer
- What types of analysis (e.g forestry vs. emergency services dispatch etc.)
- Where (study area dimensions and location - a building in Dunedin or the entire South Island)
- When (time / date stamp of data)
- Quality of your analysis (e.g historical data, paper media, confidentiality)
What determines what spatial data is needed
- What questions are trying to be answered
- What data sets do you need to answer the questions
- E.g location of all trees over 2 metres in height in Dunedin
2 ways of acquiring spatial data for GIS?
- Data capture
- Data transfer
2 methods of data capture
- Create from primary sources
- Derive from secondary sources
How is data captured from primary sources
Field collection - GPS, imagery, tree heights, measurement
How is data captured from secondary sources
Scan or digitize from existing maps/images
1 method of data transfer
Acquire existing data
How is existing data acquired
- Government
- Commercial vendors
- Data agreements for limited use
- Open source
- Crowd sourced
Pros of collecting data yourself
- Control how and when data is collected
- Can specify spatial resolution, extent, and spatial and temporal collection parameters
Cons of collecting data yourself
- Can be costly (hire equipment, personel, aircraft, computer hardware)
- Often time consuming to plan and execute
- May not be possible to achieve required without specific equipment, expertise or permits
Pros of using existing data sets
- Can generally be acquired quickly if in digital format
- Numerous data source options (open source, government, commercial vendors)
Cons of using existing data sets
- No control over collection parameters - resolution, extent…
- May have limited metadata to describe how it was collected
- Fit for purpose - does it meet all data requirements
- Cost
5 stages in data collection projects
- Planning
- Preperation
- Digitizing / transfer
- Editing / improvement
- Evaluation
- Repeat
High quality georeferenced data represents how much of GIS projects resources
3/4
Data collection is a balance between
- Speed of data capture
- Data quality
- Price
What should be conducted if possible to test equipment and processes
Pilot study
How is data obtained from government sources
- Land information New Zealand (LINZ) - hardcopy topo maps, landonline for land parcels
- Department of conservation (DOC) - infrastructure and ecological regions
- Landcare research (land cover)
- GNS (surface geology, faults)
- Territorial local authorities: city councils - DCC
How is data obtained from commercial sources
- Eagle technology (ArcGIS dealer) - for StatsNZ census
- Critchlow associates (Mapinfo products), cadastral
- GeographX (digital elevation models, shaded relief images)
- Digital globe - high resolution global satellite imagery
How is data obtained from data sharing agreements
- Limited usage licenses
- Educational data sets
How is data obtained from open source
- Government agencies - LINZ - Topo Hydro, Elevation
- City councils - DCC - aerial imagery, street maps, rates maps
- USGS - Earth explorer - elevation, imagery, land cover
- Data warehouses and portals - NSIDC - National Snow Ice Data Center
- Commercial and independent sites - Koordinates.com - Pacific Data Hub (pacific nations data portals)