L2 Where data comes from Flashcards

(77 cards)

1
Q

what does one zettabyte = ….

A

a trillion gigabytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

2024 there was how many volumes of data in zettabytes?

A

147

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

where does all of this data now come from?

A

electronics (phones / wearables / home electronics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

by 2025: what is estimated?

A

that 80% of global data will be unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is meant by unstructured data?

A

it wouldn’t be sorted in fixed known locations - data will be everywhere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are surveys good at collecting?

A

large amounts of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what kind of questions do surveys typically have?

A

standardised questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

who are surveys typically asked to?

A

a sample of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why are surveys often not asked to more people?

A

time consuming + expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

more data = an increase in…

A

…data scepticism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

more data = more space for…

A

…misreporting and misrepresenting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what in recent years has increased the amount of data produced?

A

online tools and surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

more bad data is being produced now because of what?

A

poor quality bad surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are less people also doing now and what does this lead to?

A

answering surveys - which causes issues for the data produced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what happens every time we google something on our phones?

A

data is produced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what percentage of the British people said that they believed television news readers to say the truth?

A

52%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

……. of British people trust journalists to tell the truth

A

28% (just over 1/4)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Less than …… of British people think that politicians and gov. ministers generally tell the truth

A

20%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

who is widely regarded as individuals who do tell the truth?

A

scientists and professors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

where is the UK ranked in trusting for our media?

A

very low

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what media is trusted the least / the most

A

least = social media
most = radio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what is administrative data originally used for?

A

keeping records (by governmental departments and agencies)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

administrative data covers…

A

…entire populations of registered people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what is an example of a record that is administrative data?

A

health / tax / benefits / car reg / work permits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
administrative data is not...........avaliable
publicly
26
why is administrative data not publicly available?
as much of it is very personal information
27
what is administrative data often used for when helping with surveys?
as a sampling frame to get a sample
28
will administrative data be the same as other surveys done?
no
29
administrative data is of high quality - true or false + why?
true - as the government collected it
30
what is there when it comes to protecting peoples data?
strict security protocols
31
how is big data generated?
digitally (through online and transactional data)
32
big data is in huge...........and high...........
volume velocity
33
what are examples of different sources of big data?
clicks shares purchases
34
is big data open access data? and why?
no - as it's not collected for research purposes and often used commercially
35
as humans we create d... t.....
data trails
36
google trends example =
spike in "unemployment" google searches over the pandemic
37
who was William Petty?
17th century demographer
38
17th century demographer =
William Petty
39
what did William Petty do? (3 parts)
- began surveying - collecting numbers on people living in London - surveying GDP
40
what does census in latin mean?
to estimate
41
is census is essentially a big...
...survey
42
how many people are in a census? + 2 examples
all people from a population under study - i.e. all in country / all in education
43
how often does a census happen in the UK? + why?
every 10 years - as they are expensive and time consuming
44
when did a census last not happen in the UK?
during peak of WW2
45
when was the most recent UK census
2021
46
when were census' first conducted?
in BC / early AD
47
who can census' now be answered?
one household member for the whole house
48
primary data =
data collected directly by researchers for a very specific purpose and used for that purpose
49
example of creating primary data =
conducting a survey + using results in report for a dissertation
50
primary data is directly for what?
ones own use
51
pros of primary data =
- up to date / current - specific to the question - researcher has full control (can pick the q's asked)
52
cons of primary data =
- time consuming - sometimes impossible - can be expensive
53
what happens if conducting primary data is impossible?
have to rely on and use secondary data instead
54
secondary data =
data that has been previously by someone else for a different purpose - but is available for others to use
55
example of using secondary data =
viewing others surveys and data collected + analysing it to answer Q's for a report / coursework
56
secondary data is the number 1 source of data in the UK - true or false?
true
57
pros of secondary data =
- affordable - easily accessible - longitudinal studies are possible
58
what does the pro of longitudinal studies (in 2ndary data) mean?
can compare old data to more recent - see how it has developed and changed over time
59
cons of secondary data =
- can be outdated - not specific to your question - can be time consuming to begin with
60
why might secondary data sometimes be time consuming?
having to find specific data that relates to what you want
61
how to tell good from bad sources of data 1 =
sources - who produced the data?
62
how to tell good from bad sources of data 2 =
purpose - why was it produced?
63
how to tell good from bad sources of data 3 =
time - when was it produced?
64
we can trust what data sources?
- public institutions (ONS) - respected research companies
65
we should be sceptical of what data sources?
- unknown institutions - sources with dubious reputations
66
what data sources can we NEVER trust?
- gov. sources known to use 'fake news' to influence opinions - info published by satirical newspapers
67
lots of data has............ .........
underlying motives
68
example of satirical news =
2012: Onion new paper said rural whites prefer the president of Iran to Obama - which some believed
69
Trump running twitter polls =
2016: where he had underlying motives + leading questions = example of push polling
70
are push polls real surveys?
no
71
what is push polling aiming to do?
sway and influence voters
72
when is push polling most often used?
during political campaigns
73
surveys should never promote what ideas nor try to...
propaganda ideas / change the mind of respondents
74
leading questions =
leads respondents to answer in a certain way (unbalanced)
75
loaded questions =
forces respondents to answer in a way they might not agree (loaded with assumption)
76
it is important to always consider ........ data was collected
...when...
77
a good first critical question when looking at data is...
...where did the data come from?