Module 6 Data Science Flashcards

(50 cards)

1
Q

What is the fundamental distinction between different types of data?

A

The distinction between quantitative and qualitative data is crucial for effective data science.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define quantitative data.

A

Numerical data that can be measured and compared using mathematical methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give examples of quantitative data.

A
  • Age
  • Height
  • Weight
  • Salary
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define qualitative data.

A

Descriptive data that cannot be measured using mathematical methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give examples of qualitative data.

A
  • Opinions
  • Emotions
  • Perceptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the primary use of quantitative data?

A

To draw conclusions and make predictions based on numerical analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the primary use of qualitative data?

A

To understand experiences and perspectives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two categories of data collection methods?

A
  • Active
  • Passive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Manual Active data collection?

A

Involves actively engaging with participants to collect data, such as surveys and interviews.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Manual Passive data collection?

A

Involves collecting data without actively engaging with participants, such as observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Computerised Active data collection?

A

Using technology to actively collect data from participants or customers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Computerised Passive data collection?

A

Collecting data automatically through technology, without engaging with participants.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define primary data.

A

Data collected by the researcher themselves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give examples of primary data.

A
  • Diaries
  • Original documents
  • Government documents
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define secondary data.

A

Data obtained from existing sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Give examples of secondary data.

A
  • Journal articles
  • Textbooks
  • Encyclopedia websites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the four criteria for evaluating data quality?

A
  • Relevance
  • Accuracy
  • Validity
  • Reliability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What does relevance refer to in data quality?

A

The extent to which the data is directly related to the research question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does accuracy refer to in data quality?

A

The degree to which the data reflects the true values of the variables being measured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does validity refer to in data quality?

A

The degree to which the data accurately measures what it is supposed to measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What does reliability refer to in data quality?

A

The consistency of the data over time and across different sources.

22
Q

What are errors in data collection?

A

Errors can arise in collection, entry, or processing.

23
Q

What is uncertainty in data?

A

A lack of precision or ambiguity.

24
Q

What are limitations in data?

A

Inherent constraints of the data, such as sample size or coverage.

25
Define informatics.
The study of the processing and management of information.
26
What is data visualisation?
The interpretation and presentation of data using tools like graphs and infographics.
27
What is structured data?
Data organized into a specific format, such as a table or spreadsheet.
28
Give examples of structured data.
* Numerical data * Categorical data * Dates and times
29
What is unstructured data?
Data that does not have a specific format or structure.
30
Give examples of unstructured data.
* Text * Images * Audio files
31
What is alternative data?
Data types like likes, emoticons, and memes used for feedback.
32
What is blockchain technology?
A distributed ledger that provides secure and transparent data management.
33
What is data management in blockchain?
Each block contains verified transactions that are nearly impossible to alter.
34
What is data verification in blockchain?
Ensured by cryptographic algorithms and consensus among network participants.
35
What are smart contracts?
Self-executing agreements that automate data verification and management.
36
What are the risks of using autofill features?
They pose risks to data privacy if used on public devices.
37
What is big data?
Large and complex datasets that can't be processed by traditional systems.
38
What are the three Vs of big data?
* Volume * Variety * Velocity
39
What is data warehousing?
Centralising and storing large amounts of data in a single repository.
40
What is data mining?
Extracting and analysing large amounts of data to uncover patterns and insights.
41
What does the impact of data scale refer to?
How size and complexity of a dataset affect analysis and interpretation.
42
What are digital footprints?
The accumulation of large-scale datasets raises concerns about sensitive personal information.
43
What are the key ethical considerations in data use?
* Privacy * Consent * Data security * Transparency * Responsibility * Data accuracy * Confidentiality
44
What is the primary legislation regulating data collection in Australia?
The Privacy Act 1988.
45
What are the Australian Privacy Principles (APPs)?
13 principles governing various aspects of handling personal information.
46
What does the Notifiable Data Breaches (NDB) Scheme require?
Notification of data breaches likely to cause serious harm to individuals.
47
What does data sovereignty refer to?
The right of Aboriginal and Torres Strait Islander Peoples to control their data.
48
What is the importance of data literacy?
Essential for interpreting, analysing, creating, and communicating data.
49
What are data swamps?
Disorganised data repositories that can lead to misleading conclusions.
50
What is the role of educating users in data communication?
Promoting ethical and effective data communication and understanding privacy concerns.