Topic 8 Flashcards

(36 cards)

1
Q

what is a histogram

A

graph showing the frequency of measurements/obeservations plotted against the range of observations

an important data exploration and summary tool

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

explain modality and symmetry

A

median - 50% higher and 50% lower
skew… what way is the tail facing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

symmetrical distribution

A

mode, median, mode are coincident

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

modality

A

when there are more than one value with a high frequency

greatly impacts the use of median and mean measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

sample mean vs population mean

A

divided by the sample number or entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

we use deviation from mean ro deviation from the median

A

median is more popular because it doesnt get so easily affected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

standard deviation is important with data transformations T/F?

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is data normalization

A

raw totals (Numerator) are standardized against a denominator
min-max scaling
comparing something to make it comprehensible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

standardization (z-score normalization)

A

types of normalization
denominator is standard deviation
max is 1
min is 0
goes to 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

min-max scaling

A

important for rasters
range of data
min and max values
does not go to 0 only to the min value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do you know when you need to standardize your data?

A

know your data before normalizing it. Normalizing unrelated data is like mixing apples and oragnes. It makes fruit salad, not a good analysis

not all variables need to be normalized

results can be proportions or percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data classification considerations

A

grouping of numerical data into classes for mapping, with each class represented by an individual symbol

class interval: where to put breaks in the data
number of intervals : 4-7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

describe equal intervals

A

equal intervals or steps along the number line
determine data range
not very good
susceptible to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe quantiles

A

each class contains the same number of observations/values
easy tp understand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

describe mean standard deviation

A

derive classes from the descriptive statistics of overall data distribution
worst method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

maximum breakes (defined interval)

A

derive classes from groups of similar data values according to local citerion
calculation of classes order data from low to high
use largest differences as class breaks
can be good
susceptible to outliers
you dont see contrast

17
Q

natural breaks

A

subjective, visual/manual determination of logical breaks in data distribution in dispersion graph or histogram
depends on what you want to highlight

18
Q

geoeetrical intervals

A

class breaks are based on a geoetric series
good for highly skewed data
good for computers

19
Q

optimal (fisher-jenks)

A

computational approaches to mimimizing classification error
most common method
indentifies low points in data

20
Q

rating of classification methods major points

A

quantiles is only good for ordinal data
optimal is only good one for helping assist with selecting number of classes

21
Q

enumeration and spatial fallacies

A

areal aggregation
census tracts
best when units are similar sizes

MAUP
depending on the geometry of spatial organization impacts your outcome and how it will look on the map
change the area = change results

22
Q

jenks (optimal) tends to stay away from using mean as a central measure T/F?

23
Q

what is multivariate mapping

A

encoding two or more variables into the symbolization

trade off between the information content and the complexitiy of the map

two main groups
inter-symbol encoding
intra-symbol encoding

bivariate choropleth maps
bivariate normalization (value by alpha)

24
Q

what is inter-symbol encoding

A

symbolize 2 symbols concurrently (complimentary symbols)

25
what is intra-symbol coding
multiple visual variables in one symbol combination of size and hue could be applied
26
selection of colour for classed maps
kind of data colour vision impairment simultaneous contrast colour associations - cognitive aesthetics purpose - exploration vs presentation cost of production
27
uni polar vs bipolar data characteristics
unipolar - sequential lightness usually preferred - varying saturation cna enhance visual contrast bipolar - diverging two hues diverge from a common light hue or grey
28
representing uncertainty
accuracy and precision uncertainty - difference between the real geographic phenomena and the users understanding of the phenomena ex. inaccuracy of inerpolated maps completeness of census data nature of data collection - standardized methods or not
29
grouping techniques intrinsic-extrinsic coincident-adjacent
intrinsic - extrinsic intrinsic = vary existing object extrinsic = use new object coincident-adjacent coincident = shown in same frame adjacent = small multiple static-dynmic dynamic = animations or interactivity
30
colour schemes and classified maps
map readers process colour by seeing differences in hue, stauration, and value mapping different "things" saturation (changing lightness) hues ( best used to identify map featueres and differentiate) value (lightness )
31
Look up tables
LUTs are used in lots of ways data structures that map values of attributes to something else colour scale can be lenghtened by adding saturation keep hue constant but vary saturation and lightness look up tables = enhancement
32
what type of data is pseudo colour table used on
unclassed data
33
what is an anthrom?
human controlled ecosystem
34
3 type of histogram graphs with LUTs
linear stretch (min-max enhancements) exponential stretch logarithmic stretch
35
are stretch and enhancement synonymous terms?
yes
36
what method is best used with contrast stretch
standard deviation