1 Collecting Data Flashcards

1
Q

raw

A

data before it is sorted
eg data from a survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

quantitative

A

numerical
eg height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

qualitative

A

non numerical
eg colour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

continuous

A

can take any value on a scale
eg weight, length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

discrete

A

can only take particular values on a scale
eg shoe size, no. siblings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

categorical

A

can be sorted into non overlapping/ranked categories
eg gender

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ordinal

A

can be written in order / be given a numerical ranking scale
eg test scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

bivariate

A

involves pairs of related date
eg working hours and pay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

multivariate

A

3+ sets of data
eg plants: colour, leaf size and height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

do intervals (usually) need to be equal widths

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

primary

A

collected by/for the user

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

secondary

A

collected by/for someone other than the current user
eg websites, newspapers, research articles, databases, census returns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

primary vs secondary (adv and disadv)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

census

A

survey/investagation with data from EVERY MEMBER of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

sampling units

A

people or items that are to be sampled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

sampling frame

A

a list of all the sampling units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

eg of population, SU and SF
(number of hours spent on hw is more in Y7 and Y9

A

P: all Y7 and Y9 students
SU: students in Y7&9
SF: list of Y7&9 students

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

petersen capture recapture formula

A

m/n = M/N
no. marked in recapture/number in recapture = original number marked/total population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

assumptions made in capture recapture method

A
  • P(caught) is same for all individuals
  • marks are not lost and always recognisable
  • sample size is large enough to be representative of population
  • population has not changed (no members have entered or left, no births or deaths between release and recapture)
  • marked individuals have mixed with rest of population between release and recapture
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

random sample

A

every sampling unit (member of population) has an equal chance of being included

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

pros and cons of random sample

A

P: more likely to be representative of population if sample size is large
- choice of members of sample is unbiased

D: need a full list of population
- need a large sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

problems of random sample

A

random numbers may be out of range
random numbers may be repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

how to generate random sample

A

1) RNG (eg calc) / names from a hat / random number table
2) ignore numbers out of range and duplicates
3) do this X times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

judgement sampling

A

using your judgement to choose a sample which is representative of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
opportunity sampling
using the people or items available at the time
26
cluster sampling
use natural groups which occur in data list of clusters = sampling frame and some clusters are randomly selected to make up the sample (eg geographical areas)
27
systematic sampling
choose a start point in the sampling frame at random and choose items at regular intervals
28
quota sampling
group the population by chosen characteristics and take a quota from each group (eg age/gender)
29
how to decide if a sampling method is suitable
- will it be biased - will SS be sensible - how quick and easy is method - how expensive
30
stratified sample
contains members of each stratum in proportion to the size of that stratum. sample from each stratum is selected randomly
31
describe how to do a stratified sample
1) calculations THEN -order each group into order - assign each a random no. - choose the relevant no. of people to survey
32
data collection sheet
table/tally chart for recording your results
33
direct observation
recording behaviour patterns systematically as you observe them
34
independent variable
explanatory variable what you control (but change)
35
dependent variable
response variable affected based on your changes to the explanatory variable
36
extraneous variable
variable that you are not interested in but that could affect your results
37
laboratory experiment
38
field experiment
39
natural experiment
40
LabE pros and cons
41
FieldE pros and cons
42
NatE pros and cons
43
simulation
can be used to model random real life evens to predict what could actually happen. easier and cheaper than collecting and analysing real data
44
an experiment is valid/reliable if…
when replicating an experiment gives very similar data
45
questionnaire
set of questions designed to obtain data
46
open vs closed question
open: no suggested answers closed: answers to choose from
47
con of open questions
every respondent gives a different answer so is hard to summarise and analyse the answers
48
problem with opinion scales
most will answer somewhere near the middle unlikely to indicate a strong opinion - do not want to seem extreme
49
what to do in questionnaires
- short questions, simple language - no biased/leading questions - intervals that don’t overlap - options cover all possibilities (0/never/don’t know) - include time frame - avoid questions respondents are unlikely to answer honestly
50
interview pros and cons
51
anonymous questionnaire pros and cons
52
pilot survey
conducted on a small sample to test the design and methods of the survey checks: - respondents understand questions - closed questions include all likely answer options - questionnaire collects the information needed
53
random response method
54
how to answer estimate question about RRM
55
outliers/anomalous data
values that do not fit the pattern of the data can be ignored if it is due to a measuring/recording error
56
cleaning data
- identifying and correcting/removing inaccurate/extreme values - removing units or other symbols form data - deciding what do to with missing values
57
control groups? and where are they often used
58
matched pair test
where two group of people are used to test theffects of a particular factor each individual in a group is paired with an individual in the second group with similar characteristics barring the factors which is to be studied
59
pros and cons of matched pairs
P: can control for different factors C: may have to test a large group at first to find enough matched pairs for a good test
60
who is often used in MPTs
identical twins- easier to see different results disadvantage: limited supply of willing twins
61
hypothesis
an idea that can be tested by collecting and analysing data
62
designing investigations- what do you need to consider?
63
difference between field and natural experiments
a field experiment the researcher manipulates the independent variable (IV), while in a natural experiment the researcher does not