Application Of Statistics In Geography Flashcards

1
Q

What is statistics

A

Is the study and practice of collecting, analyzing and presenting data that have geographical and real world dimension such as census and demography

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define population

A

Is a group of people, objects or units from which we intend to know something

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a sample

A

Is a collection of representative members of the population from which the statistical measures are taken

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give two types of statistics

A

Descriptive and inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain what is descriptive statistics

A

This is a type of statistics concerned with describing and summarizing the entire population or sampled data with the aim of bringing out the important facts about the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Point out examples of descriptive statistics

A

Mean, median, mode, range, standard deviation, and percentage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain what is inferential statistics

A

It’s concerned with studying the sample of the population for the sake of drawing conclusions based on the analysis of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the roles of statistics

A

*statistics present data in a comprehensive and definite form
*statistics present interpretation and predict conditions
*provides easy way of classifying numerical data
*statistics is used to evaluate the effectiveness of the policies
*statistics is useful in budgeting at both family and national level
*statistics is the basis of projection of different geographical events eg climate and outputs of agriculture
*helps to predict future trends and impact of classifying particular social or geographical phenomenon
*statistics is applied in monitoring the proper utilization of fund, human resources, construction and maintenance of learning facilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is data

A

Data can be numeric, text, graphic, art, image or symbol that researchers obtain from the subjects, respondents or participants of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the four categories of the nature of data

A

Discrete, continuous, individual, and grouped data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is discrete data

A

This is a numerical type of data that can only be given in whole, concrete numbers with specific and fixed data values determined by counting eg no of people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is continuous data

A

They include complex numbers and varying data values that are measured within a range eg wind speed, temperature, altitude height, and distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is individual data

A

Are data through which an exact value is given for each individual item in a sample eg number of students which maybe 200 students

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What grouped data

A

Are data by which no exact figure I quoted but several values fall within certain classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Classify statistical data into three categories

A

-Basing on sorting or distribution
-Basing on unit of measurement
-Basing on methods and sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mention statistics data Basing on sorting classes

A

Ungrounded and grouped data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Elaborate ungrouped data

A

It’s concerned with raw facts that have been collected from the study and usually not sorted into categories eg 12,34,23.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Explain grouped data

A

Is a set of numerical facts that has been sorted into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Mention statistical data Basing on unit of measurement classes

A

Categorical(qualitative) and numerical(quantitative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is categorical data

A

Are measured normally describe the characteristics of the studied subject and can be in form of text, graphics, art, image or symbol

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Mention the further categories of categorical data

A

Nominal and ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is nominal data

A

Are data that are named without quantitative measure and often without logical sequence eg martial status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is ordinal data

A

Are named without quantitative measure but with logical sequence eg opinion or feelings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Elaborate numerical data

A

Are data measured and expressed in numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What the further categories of numerical data
Discrete and continuous data
26
Mention statistical data Basing on methods and source classes
Primary and secondary data
27
What are the merits of secondary data
Easy manageability, time saving and low cost of accessing
28
What is the deference between primary and secondary data
*source *method of collection *statistical process *originality of data *use of purpose *method of data collection *time consumed *cost and accuracy
29
What is a variable
Is any characteristic, number of a person, object or phenomenon that can be measured or counted
30
Mention the three categories of variable
Unit of measurement, study design, and casual relationship
31
Mention variable based on unit of measurement categories
Numerical and categorical variable
32
Mention variable based on study design categories
Active and attribute variables
33
Mention variable based on casual relationship categories
Independent and dependent variables
34
What are the numerical variables
The measurements for these variables are number
35
36
What are the categories numerical variables
Continuous and discrete variables
37
What are categorical variables
Are variables that are qualitative in nature as they are represented by non-numeric values
38
What are active variables
Are those variables that can be manipulated, changed or controlled. They are independent variables
39
What are attribute variables
Are variables that cannot be manipulated, changed or controlled. They are dependent variables. Eg age, height, weight, income and level of education
40
What are independent variables
They are not usually affected with external factors for change. For instance someone's age can't be influenced by either eating or better health services
41
What are dependent variables
Are easily affected by external factors. Eg temperature, rainfall amount and body weight
42
43
What is correlation
It's the statistical measure that indicates the extent to which two or more variables are related
44
Name the three ways whereby variables are related
Positive, negative and no relationship
45
What do you know about positive relationship of variables
Is the relationship which exists when an increase in independent variables results to an increase in dependent variable
46
What do you know about negative relationship of variables
This is a relationship where the variables are negatively related whereby an increase in independent variables lead to a decrease in dependent variables
47
What do you know about no relationship of variables
The relationship that occurs when the change in one variable does not cause any change to the other. For instance the number of students who scored division one in a certain school for five consecutive years
48
Mention the three statistical measures
Measures of central tendency, measure of dispersion and measures of asymmetry(skewness)
49
What does summary statistics compress of
Measures of central tendency and measures of dispersion
50
What is parameter
Is the measures of central tendency computed from the entire population. It's the descriptive property of the population
51
The measures of central tendency computed for the sample and are the descriptive property of a sample
Statistics
52
What does measures of central tendency cover
It deals with the characteristics of the data concentrate the values around a central part of distribution. In other words central tendency is average An average is a single value within a range of data that is used to represent the entire values in a series
53
Name the most commonly used measures of central tendency
Mean, mode, and median
54
What are the characteristics of good measure
*easy to understand *simple to compute *possesses sampling stability and not unduly affected by extreme values *has possibility of further algebraic treatment
55
What is mean
This is sometimes referred to as arithmetic mean. It should be noted that mean is not a synonym of average instead measures of central tendency and averages are used interchangeably. It is often used in reporting the weather and climate for elements like temperature and rainfal. For mean formula PG12
56
Mention advantages of mean
*it is easy and fast to calculate *it includes all values in the distribution *it is useful for statistical summarization of data *it used for making comparison in statistical data *it is also widely understood compared to other averages and suited for further statistical analysis
57
What are the demerits of mean
*it is highly distorted by outliers[outliers are extremely high/low values in the observation] *it is impossible to to locate the mean by inspection as in the case of mode and median *value of mean will be effective only if the frequency I normally distributed
58
59
What is median
Is the value of the middle item
60
Hint at the advantages of median
*it is easy to understand because of considering halfway point of the data *it is suitable for distributions with extreme values *the median is not distorted by the extreme values in distribution *it is relevant in skewed data distribution
61
What are the limitations of median
*unsuitable for further mathematical treatment/advanced statistical processing *not based on all values under observation *highly affected by the fluctuation of sampling *not reliable in testing geographical hypothesis
62
What is mode
Is the most frequent score in a distribution. It can be either uni-modal, bimodal, or multi-modal. Mostly used in climatology
63
Give advantage of mode
*it is not affected by the occurrence of a few extreme values under the distribution *determines various phenomena such as the magnitude of agricultural production or the trend of commodities *it is the only average to be opted for categorical data *easy to read and interpret in the given data
64
What are the limitations of mode
*it is not strictly defined in such a way that it is unstable with large samples *it is not based on all data values under observation *it is not suitable for further mathematical treatment *its existence when data are the same in some of the distributions discourages its use in analysis of data and it's usefulness is less common in advanced statistical geography *it is highly affected by fluctuation of sampling
65
Name types of relationship between different measures of central tendency
-Normal distribution curve -Postive skewed distribution curve -Negative skewed distribution curve
66
What do you know about normal distribution curve
Is the curve that can be divided into two equal halves. Mean=median=mode
67
What is postive skewed distribution curve
Is the curve in which longer tail is on the right. Mean>median>mode. It implifies that a number of large populationare low income earners and only a few with extremelyhigh income. In this case mean is pulled up by the extreme high values
68
What do you know about negative skewed distribution curves
Is a Curve skewed to the left and in this case mean is pulled below the median by extremely low values
69
What are the general limitations of measures of central tendency
*various measures give us one single figure that represents the entire data from sample[average only could not adequately describe a set of observation unless all the observation are the same] *they fall short of indicating the extent of dispersion or variability in distribution *two or more distributions can have same central value yet there can be a wide discrepancy in the formation of distribution
70
What are the measures of dispersion
The degree to which numerical data tend to spread about an average value. They show how spread out data are It is also known as measure of variation or spread
71
What is variability
Is the difference of two or more distributions
72
Name the commonly used measures of spread
Range, mean deviation, variance and standard deviation
73
What is range
Is the difference between the smallest and the largest value in the distribution formula PG25
74
What are the advantages of range
*easy to calculate and understand *it also gives a quick estimate of variability in distribution *it is used in circumstances which one intends to get a quick understanding of variability *its also an appropriate measure in weather forecast
75
What are some of range drawbacks
*its highly affected by the extreme values in distribution *range can mislead the interpretation of data if there were some errors during collection of raw data *it is also based on only two items hence does not cover all the items in distribution *it is susceptibility to wide fluctuations from sample based on the same population *fails to give any idea about the pattern of distribution
76
What is mean deviation (MD)
It denotes the amount by which an individual value deviates from the mean in irrespective of sign(+or-) it is also known as average deviation Formula PG27
77
78
What are the advantages of mean deviation
*it is easy to understand and easily calculated *it takes into consideration all the dataset under observation *it is less affected by extreme values *it is possible to make a meaningful comparison of the setup of different distributions
79
What are the limitations of mean deviation
*it falls short of allowing further algebraic treatment *it sometimes fails to give accurate results *it is not a satisfactory measure and from mathematical perspective the method is wrong for it disregards the algebraic signs when deviations are taken from the mean
80
What is variance
It's the mean of the squares of the differences between each data values and the mean It's is denoted hy s square and for sample variance is denoted hy S square Formula PG29
81
What is the limitation for variance
It tends to give more weight to extreme values by squaring them up
82
What is standard deviation
It's the value which shows how far the scores are spread from the normal FORMULA PG30
83
How to interpret results when using standard deviation for individual data
*the greater the SD in one of the group say stream A compared to stream B will mean that the spread of the scores in stream A is greater compared to stream B *the greater the SD the greater the spread of the scores. The closer the scores are to the mean, the less spread they are hence smaller th SD
84
What are the strengths of standard deviation
It's strictly defined and it's value is always definite and based on all observations *it uses the actual signs of observations *it is based on arithmetic mean *gives possibility for further algebraic expression *it I less affected by extreme values
85
86
What are the limitations of standard deviation
*It is complex to understand and calculate *it tends to give more weight to extreme values by squaring them during computation *its an absolute measure of variability [it cannot be used for comparison purposes]
87