Module 3 Assessment Flashcards
(35 cards)
New values created through the algebraic combination of two or more bands in order to emphasize particular Earth features are known as:
Spectral Indices
Which of the following is NOT a commonly used spectral index?
a. NDVI
b. NDWI
c. EVI
d. BRDF
d. BRDF
That’s right, the BRDF describes the angular reflectance properties of a surface and is not a spectral index
Assess whether the following statement is true or false: Most spectral vegetation indices are built around the concept of the “red edge” which describes the strong reflectance of green wavelength light compared to red and blue wavelengths of light by vegetation
False
Many spectral indices use a mathematical approach that contrasts two bands while controlling for their overall brightness. This approach is known as:
a. Band Ratio
b. Normalized Difference
c. Band Differencing with Thresholds
d. Linear Combination
b. Normalized Difference
Band ratios do indeed quantify the contrast between two bands, but do not control for overall brightness. The correct answer is “normalized difference”. Differencing two bands quantifies the contrast and dividing by their sum controls for their overall brightness. This is used, for example, in the NDVI and NDWI indices
This is a popular linear transformation of spectral bands designed to create new bands that are less correlated and have clear physical meaning
a. Change Vector Analysis
b. Tasseled Cap
c. Random Forest
d. Enhanced Vegetation Index
b. Tasseled Cap
Yes, the tasseled cap transformation is a linear combination of multispectral values that is akin to the axes rotation accomplished by principal components analysis.
An analytical technique that attempts to assign labels to pixels using an algorithm and examples of known categories (i.e. training data) is called:
a. Unsupervised classification
b. Principal Components Analysis
c. Supervised classification
d. Hierarchical Clustering
c. Supervised classification
Yes, supervised classification can be conducted when the analyst can provide the algorithms with examples of each category or class that is desired in the output map.
Techniques which identify clusters or groupings that occur naturally in the data, without the need for training data are known as:
a. Unsupervised Classification
b. Supervised Classification
c. Artificial Neural Networks
d. Random Forest
a. Unsupervised Classification
ANN’s, when used in image classification, rely on labeled training data in order to “learn” over time. The correct answer is “unsupervised classification”, which are the family of approaches that can identify clusters or groupings in the data based on intrinsic characteristics of the data.
Assess whether the following statement is true or false: Unsupervised classification techniques do not result in semantic classes
True - Correct, unsupervised classification can only provide data classes and the analyst must then supply the semantic meaning
Which of the following is not true of spectral signatures as it relates to image classification?
a. Ancillary data may help to discriminate when different classes have the same or similar spectral signatures
b. To be successful, a classifications schema should be devised such that the spectral signatures for each class show no within-class variability
c. Classes with distinct theoretical spectral signatures may nonetheless be indistinguishable based on which portions of the EM spectrum our sensor has observed
d. Even pixels with identical surface states (e.g. the same land cover/use) will often have different spectral signatures
b. To be successful, a classifications schema should be devised such that the spectral signatures for each class show no within-class variability
This is false. There will almost always be some variability in the spectral signatures, even for classes that are narrowly defined and relatively homogeneous throughout the image.
Which of the following is not a land cover/use product available for use by the scientific community?
a. ESA World Cover
b. Copernicus DEM (e.g. GLO30)
c. National Land Cover Database (NLCD)
d. MODIS Global Land Cover Product
b. Copernicus DEM (e.g. GLO30)
This is a global digital elevation model product and not a land cover/use product.
Which of the following is not a parametric classification technique:
a. Parallelepiped Classifier
b. Minimum Distance to Means Classifier
c. Maximum Likelihood Classification
d. Artificial neural networks
Correct, ANNs do not require assumptions about the statistical distribution of the predictors or the calculation of their parameters
d. Artificial neural networks
Correct, ANNs do not require assumptions about the statistical distribution of the predictors or the calculation of their parameters
In contrast to minimum distance to means, the maximum likelihood classifier takes into account:
a. uneven variance in predictors between classes
b. Non-normal distributions in predictors
c. uneven variance in predictors between classes and covariance amongst predictors
d. covariance amongst predictors
c. uneven variance in predictors between classes and covariance amongst predictors
Yes, MLC takes both uneven variance and covariance into account
In which situation would using prior probabilities in a maximum likelihood classification approach be most useful?
a. When there is strong covariance amongst one or more of the predictors within some or all classes
b. When there are two similar classes but one of them is much rarer than the other
c. When we suspect that some of the training data has inaccurate labels
d. When all classes have an equal amount of area in the study region
b. When there are two similar classes but one of them is much rarer than the other
Yes, when we have rare classes, especially when they’re easily confused with other less rare classes, using prior probabilities can help these classes from being over represented in the output
In which case would MLC most likely not work well due to a violation of its core assumption(s)?
a. When there is strong covariance among one or more predictors in the classification. For example: NDVI and NIR reflectance will usually increase in tandem for vegetated classes like forest
b. When two classes have a high degree of overlap in their predictors and other characteristics. For example: shrublands and woody savannas.
c. When one class has much greater variability in one or more predictors than the others. For example: a forest class containing many different types and densities of forests
d. When a particular class has one or more predictors showing non-normal, such as bimodal, distributions. For example: dark water and shallow/turbid water
d. When a particular class has one or more predictors showing non-normal, such as bimodal, distributions. For example: dark water and shallow/turbid water
Correct, in this case the normality assumption would be violated and the parameters used to compute the covariance matrix would be invalid
Which is not a normal part of the workflow for an unsupervised classification approach?
a. Compiling a set of predictors (spectral, derived indices, and ancillary data) that we expect will lead to the ability to differentiate real surface states
b. Refining the feature set to better distinguish between categories/classes/clusters that contain more than one surface type that we hope to differentiate
c. Merging or collapsing multiple data/spectral classes into semantic classes for which a label can be determined
d. Designing an appropriate sampling scheme for the training data to ensure the variability in each class is appropriately captured
d. Designing an appropriate sampling scheme for the training data to ensure the variability in each class is appropriately captured
Which of the following is a feature of ISODATA but not K-means approaches to clustering?
a. An iterative process whereby the mean centers of potential clusters are refined until an exit condition is met
b. Starting with randomly assigned cluster centers
c. Can be conducted without any training data at all
d. The ability to split, merge, and delete clusters
d. The ability to split, merge, and delete clusters
Yes, ISODATA is identical to K-means, but with an additional step wherein potential clusters are split if they are too highly variable in one or more dimensions, merged if their centers are very close, or deleted altogether if they do not contain enough points/pixels/entities.
All of the following are true of validation data for land cover classification accuracy assessment except which one?
a. The validation data must be collected according to a probabilistic scheme, such as a stratified random approach
b. The validation data must contain samples from every category/class, but the number of samples may not be equal
c. The precision of the resulting accuracy measurements will be proportional to the size (number of sample) in our validation data
d. The training data may be used for validation as long as you are using a machine learning or other nonparametric classification technique
d. The training data may be used for validation as long as you are using a machine learning or other nonparametric classification technique
This quantity reflects the total proportion of validation samples that were correctly classified in the map
a. Overall accuracy
b. Cohen’s Kappa
c. Producer’s accuracy
d. User’s accuracy
a. Overall accuracy
The user’s accuracy reflects the probability that something labeled as class X on the map is actually class X in the real world. The total proportion correct in the validation sample is the overall accuracy.
A measure of overall accuracy that corrects for chance agreement is:
a. Cohen’s Kappa
b. F1 Score
c. Quantity disagreement
d. Allocation disagreement
a. Cohen’s Kappa
Yes, Cohen’s Kappa is related to overall accuracy, but it accounts for chance agreement, or the fact that even some randomly assigned labels would be correct
For a particular class, the total errors of omission are captured by which accuracy measure?
a. User’s accuracy
b. Overall accuracy
c. Cohen’s Kappa
d. Producer’s accuracy
d. Producer’s accuracy
Yes, the producer’s accuracy is the probability that something of class X in the real world is labeled as class X, and therefore entities of the validation data that were incorrectly classified as belonging to another class have been omitted from their true class. So, producer’s accuracy captures the total omission error.
Confusion matrix question
Confusion matrix question
Confusion matrix question
Producer’s accuracy question
Yes, since everything is classified as water and producer’s accuracy for the water class is the probability that real water is correctly classified as water, the producer’s accuracy for water is 100%
The correct answer is ‘True’.