Data analysis Flashcards

(25 cards)

1
Q

What are the types of main variables?

A

Numerical and Categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the types of numerical variables?

A

Continuous and discrete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are continuous variables?

A

Real numbers, e.g. height in metres

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are discrete variables?

A

Integer numbers, e.g. number of inhabitants in a town

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the types of categorical variables?

A

Nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are nominal variables?

A

Unordered categories, e.g. colours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are ordinal variables?

A

Ordered categories, e.g. clothes sizes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are descriptive statistics?

A

Analysis of data that helps describe, show or summarise data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is wanted to summarise?

A

The central tendency of the data
The variability of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is central tendency?

A

A value that is used to describe the centre of the data, e.g. the mean, median or mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range?

A

The difference between the highest and lowest data point, often reported as the highest minus the lowest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the interquartile range (IQR)?

A

Represents from 25th to 75th percentile
Contains approximately one half of the observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is standard deviation?

A

Conveys how widely or tightly the data is distributed from the centre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do different standard deviations (SD) show?

A

Low - Data points are close to the mean
High - Data points are spread out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does SD show with normally distributed data?

A

68% is within 1 SD either side of the mean
95.5% falls within 2 standard deviations either side of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does data analysis show?

A

Shows obvious data patterns
Shows potential problems (outliers, correlations of independent values)
Helps to find problems (typos, mismatch of units)

17
Q

What is the use of plotting data?

A

Looking for trends/patterns
Checking distribution
Whether data conforms to assumptions of a test

18
Q

How does a boxplot work?

A

The box has 50% of all the data
The bottom is the 1st quartile and the top is the 3rd quartile, with in between being the IQR
The solid line indicates the median, whilst the dashed line is the mean
The t-shaped whiskers are the highest and lowest point within 1.5x the IQR
Anything further is an outlier

19
Q

What is standard error?

A

Standard deviation/ sqR sample size

20
Q

What do scatter plots show?

A

X-axis - Numerical
Y-axis - Numerical

21
Q

What do scatter plots and box plots show?

A

X-axis - Categorical
Y-axis - Numerical

22
Q

What does a bar chart show?

A

X-axis - Categorical
Y-axis - Frequency

23
Q

What does a histogram show?

A

X-axis - Numerical
Y-axis - Frequency

24
Q

What does a contingency table show?

A

X-axis - Categorical
Y-axis - Categorical

25
What do charts with error bars show?
X-axis - Categorical Y-axis - Numerical (means)