Statistics Data Collection Flashcards

(26 cards)

1
Q

Census

A

Observes/measures every member of the population .
✅completely accurate result
❎time consuming + expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample

A

Observations taken from subset of population which is used to find out info about population as a whole.
✅less time consuming + cheaper as less data to process than a census.
❎sample not large enough to give info about small sun groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Simple random sample of N

A

Every sample size N has equal chance selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sampling units

A

Individual units of population
Often individually named/ numbered to form sampling frame.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Types of random sampling

A

Simple random, systematic, stratified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Simple random sample

A

Of size N is where every sample of size N has equal chance of selection.
Need a sampling frame (list of people/things).
Each item allocated number + selected at random.
Can generate random number e.g. using calculator or lottery sampling (items written on tickets + put into hat).
✅each sampling unit has equal chance of selection.
❎unsuitable when population size large as time consuming + expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Systematic sampling

A

Required elements chosen at regular intervals form ordered list.
First person to be chosen should be chosen at random.
✅suitable for large samples + large populations.
❎bias if sampling frame isn’t random.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Stratified sampling

A

Population divide to mutually exclusive strata (e.g males + females) + random sample taken from each.
(N in strata/ population x overall sample size)
✅proportional representation of groups in population
❎population must be clearly classified into distinct strata.
POPULAITON divided to strata which is expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Quota sampling

A

Select sample that reflects characteristics of whole population.
✅quick- no sampling frame.
❎must divide population to groups can be expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Opportunity sampling

A

Sample who’s available at the times the studies carried out.
✅easy carry out
❎depends on when the researchers available
Unlikely to be representative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Continuous

A

Any value in a given range e.g time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Discrete

A

Specific values elf number of apples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe type of data presented by daily total rainfall

A

Continuous quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain why Alison’s process may not generate a sample of size 5

A

Some data values are (n/a)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Daily maximum relative humidity

A

As a % of air saturation with water vapour.
Relative humidities above 95% have more foggy + misty conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Daily max gust

A

Knots
Highest existing windspeed recorded, direction from which it’s blowing is also recorded.

17
Q

Daily mean temp

A

Average of hourly temp in 24hour period.

18
Q

Daily tr

A

Includes snow and hail , melted before measuring. Amounts less than 0.05mm are ‘tr’

19
Q

Daily mean wind direction + windspeed

A

Knots in Kn
Averaged over 24 hours midnight to midnight.
Mean- as bearings and compass directions.
Data categorised according to Beaufort scale.

20
Q

Okta

A

Maximum figure for cloud cover is 8

21
Q

How to clean data that contains TR

A

replace TR with a numerical value from 0 to 0.05 e.g. 0.025

22
Q

What does the data cover and why is this is a limitation

A

Data only covers May- October, not a representative of the whole year

23
Q

What months are you,issuing and what impact does this have

A

Winter months are missing.
We would expect the mean rainfall to be larger including them

24
Q

Sampling frame

25
Disadvantage using sample survey
Uncertainty due to biss
26
Standard deviation in coding
At the end, multiply by the denominator