AS Statistics Flashcards

1
Q

Population

A

The whole set of items that are of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Census

A

Observes or measures every member of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample

A

A selection of observations taken from a subset of the population which is used to find out information about the population as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sampling frame

A

List of sampling units, with each unit given an identifying name or number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Advantages/disadvantages of a census

A

Advantages:
Completely accurate result
Disadvantages:
Time consuming, expensive, cannot be used when testing process destroys the item, hard to process large quantity of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sample advantages/ disadvantages

A

Advantages:
Less time consuming/expensive, fewer people have to respond, less data to process than in a census
Disadvantages:
Data may not be as accurate, sample may not be large enough to give data about small subgroups of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sampling units

A

Individual units of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Simple random sample

A

Where every sample of size n has an equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Systematic sampling

A

Required elements are chosen at regular intervals from an ordered list

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Stratified sampling

A

Population is divided into mutually exclusive strata and a random sample is taken from each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Simple random sampling advantages/disadvantages

A

Advantages: free of bias, easy and cheap to implement for small samples/populations, each sampling unit has a known and equal chance of selection
Disadvantages: not suitable for large populations (time consuming, disruptive, expensive), a sampling frame is needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Systematic sampling advantages/disadvantages

A

Advantages: simple and quick to use, suitable for large samples/populations
Disadvantages: a sampling frame is needed, can introduce bias if the sampling frame is not random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Stratified sampling advantages/disadvantages

A

Advantages: sample accurately reflects population structure, guarantees proportional representation of groups within a population.
Disadvantages: population must be classified into distinct strata, selection within each stratum suffers from same disadvantages as simple random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Quota sampling

A

an interviewer or researcher selects a sample that reflects the characteristics of the whole population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Opportunity sampling

A

Consists of taking the sample from people who are available at the time the study is carried out and who fit the criteria you are looking for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Quota sampling advantages/disadvantages

A

Advantages: allows a small sample to still be representative of the whole population, no sampling frame required, quick, easy and inexpensive, allows for easy comparison between different groups within a population
Disadvantages: non-random sampling can introduce bias, population must be divided into groups which can be costly and inaccurate, increasing scope of study increases number of groups (adding time and expense), non-responses are not recorded as such

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Opportunity sampling advantages/disadvantages

A

Advantages: easy to carry out, inexpensive
Disadvantages: unlikely to provide a representative sample, highly dependent on individual researcher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Quantitative data/variables

A

variables or data associated with numerical observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Qualitative data/variables

A

variables or data associated with non-numerical observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

continuous variable

A

a variable that can take any value in a given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

discrete variable

A

a variable that can take only specific values in a given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

grouped frequency table (gft)

A

the specific data values are not shown but are included in groups (or classes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

mid-point (gft)

A

average of class boundaries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Daily mean temperature units

A

degrees Celsius

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Daily total rainfall units
mm | Amounts less than 0.05 mm are recorded as 'tr' or 'trace'
26
Daily total sunshine
recorded to the nearest tenth of an hour
27
Daily mean windspeed/daily maximum gust units
knots
28
Daily maximum relative humidity
Given as a percentage of air saturation with water vapour. | Above 95% gives rise to misty and foggy conditions
29
Daily mean cloud cover units
'okras' (eighths of the sky covered by cloud)
30
Daily mean visibility units
decametres (Dm)
31
Daily mean pressure units
hectopascals (hPa)
32
measure of location
a single value which describes a position in a data set
33
Measure of central tendency
a single value which describes the centre of the data
34
Mode/modal class
the value or class that occurs most often
35
Median
the middle value when the data values are put in order
36
Mean formula
sum of the values/number of values
37
Lower quartile
one-quarter of the way through the data set
38
Upper quartile
three-quarters of the way through the data set
39
Range
difference between the largest and smallest values in the data set
40
Interquartile range (IQR)
the difference between the upper and lower quartiles
41
Interpercentile range
the difference between the values for two given percentiles
42
Standard deviation
square root of the variance
43
Coding
a way of simplifying statistical calculations
44
Outlier
an extreme value that lies outside the overall pattern of the data
45
Outlier common definition
Greater than Q3 + k(IQR) | Or less than Q1 - k(IQR)
46
cleaning the data
the process of removing anomalies from the data
47
anomalies
when an outlier should be removed from the data because it is clearly an error and misleading.
48
frequency polygon
When the middle of the top of each bar in a histogram is is joined with a straight line
49
frequency density equation
frequency/class width
50
Bivariate data
data which has pairs of values for two variables
51
Independent/explanatory variable
the variable controlled by the researcher (x-axis)
52
Dependent/response variable
the variable measured by the researcher (y-axis)
53
correlation
describes the nature of the liner relationship between two variables
54
causal relationship
when a change in one variable causes a change in the other (Correlation does not mean causation!) You need to use context of question and common sense to determine this
55
experiment
a repeatable process that gives rise to a number of outcomes
56
event
a collection of one or more outcomes
57
sample space
the set of all possible outcomes
58
mutually exclusive
when events have no outcomes in common
59
Addition rule (probability)
For mutually exclusive events: | P(A or B) = P(A) + P(B)
60
Independent
when one event has no effect on another
61
Multiplication rule (probability)
P(A and B) = P(A) x P(B)
62
tree diagram
can be used to show the outcomes of two or more events happening in succession
63
random variable
a variable whose value depends on the outcome of a random event
64
Sample space
the range of values that a random variable can take
65
probability distribution
fully describes the probability of any outcome in the sample space
66
discrete uniform distribution
when all of the probabilities are the same
67
Sum of the probabilities of all outcomes of an event add up to 1
ΣP(X=x) = 1
68
test statistic
the result fo the experiment or the statistic that is calculated
69
null hypothesis, H0
the one you assume to be correct
70
alternative hypothesis, H1
tells you about the parameter if your assumption is wrong
71
critical region
region of the probability distribution which, if the test statistic falls within it, would cause you to reject the null hypothesis
72
critical value
first value to fall inside the critical region
73
actual significance level
probability of incorrectly rejecting the null hypothesis