Descriptive Statistics Flashcards

1
Q

What is a histogram?

A

A diagram with rectangles whose area is proportional to the frequency of a variable and whos consist of bins (x-axis) and its y-axis is the frequency (number of observations).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are descriptive statistics?

A

A visual descriptor of a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are inferential statistics?

A

Using your data to infer something about the general population

Making predictions about a broader set of people, situations or events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the R command for histograms?

A

hist()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the mean?

A

The average of the results of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you calculate the mean?

A

Add up each of the data points and divide by the number of data points total (N)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the median?

A

The middle observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the mode?

A

The most frequently observed value in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the measures of central tendency?

A

Mean

Median

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the measures of spread?

A

Range

Interquartile Range

Standard Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range?

A

The full range of values in the data set.

(0%-100%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you calculate the range?

A

Maximum - Minimum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the interquartile range?

A

The middle half of the data set

(25%-75%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the standard deviation?

A

How far each point is from the centre of mass.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

If your data is normal, what percentage of the population is within 1 standard deviation of the mean?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If your data is normal, what percentage of the population is within 2 standard deviations of the mean?

A

95%

17
Q

If your data is normal, what percentage of the population is within 3 standard deviations of the mean?

A

99%

18
Q

What is the R command for calculating standard deviations?

A

sd()

19
Q

The symbol s refers to?

A

The standard deviation of a sample

20
Q

The letter σ represents?

A

The standard deviation of a population.

21
Q

What is the formula for standard deviations?

A

The square root of the variance.

22
Q

What is variance?

A

The average of the squared differences from the Mean.

23
Q

What are the four steps for calculating the variance?

A
  1. Work out the Mean (average)
  2. For each number: subtract the Mean and square the result (the squared difference - to avoid negatives and positives cancelling each other out)
  3. Calculate the average of those squared differences
  4. Divide the average of the squared differences by N for population data or N-1 for sample data
24
Q

What is the difference between calculating variance when using population data or sample data?

A

Population data = divide by N when calculating Variance

Sample data = divide by N-1 when calculating variance

25
Q

Why can SD’s help determine clinical significance?

A

Tell us what is normal or abnormal.

26
Q

What descriptive statistics should you use to describe nominal scale variables?

A

Frequency Tables

27
Q

What are frequency tables?

A

A table consisting of the number of times you see each observation.

28
Q

When do you use ‘cross tabulation’?

A

When you want to look at nominal variable relationships.

(e.g. males with brown eyes, females with green eyes etc.)