CLINICAL STATS 1 Flashcards

(38 cards)

1
Q

Why is it important to learn clinical statistics?

A

To develop research skills, interpret data effectively, and make independent judgments in clinical practice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the main types of qualitative (categorical) data?

A
  1. Binary – Two categories (e.g. Dead/Alive, Yes/No, Disease/No disease)
  2. Nominal – Named, unordered categories (e.g. Blood type, Occupation)
  3. Ordinal – Ordered categories (e.g. Cancer stages, Education level, Socioeconomic status)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give examples of binary categorical data.

A

Dead/Alive
Male/Female
Treatment/Placebo
Disease/No disease
Exposed/Unexposed
Caries (Yes/No)
Heads/Tails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give examples of nominal categorical data.

A

Blood type
Marital status
Hair colour
Country of residence
Occupation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give examples of ordinal categorical data.

A

Stages of cancer
Socioeconomic status
Education level
Rating on a scale (e.g., 1–5)
Age categories
Rank order (e.g., race position)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two main types of quantitative (numerical) data?

A

Discrete – Countable values (e.g. number of teeth, number of patients)
Continuous – Can take any value within a range (e.g. age, BP, time, speed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Classify the following data types:
Age
Age last birthday
Number of teeth
Pocket depth
Socioeconomic status
Has patient visited dentist in last year
Hardness of filling material
Colour of filling
Type of radiograph
Calcium:phosphorus ratio
Severity of gum disease

A

Age – Continuous
Age last birthday – Discrete
Number of teeth – Discrete
Pocket depth – Discrete
Socioeconomic status – Ordinal
Visited dentist (Y/N) – Binary
Hardness – Can be Nominal, Ordinal, or Numerical depending on how it’s recorded
Colour of filling – Nominal
Type of radiograph – Nominal
Calcium:phosphorus ratio – Continuous
Severity of gum disease – Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

❓ What is the difference between descriptive and inferential statistics?

A

👉

Descriptive statistics: Organise, summarise, and present data (graphs + numbers)
Inferential statistics: Interpret data, make predictions or conclusions from sample to population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are common graphical techniques for different data types?

A

Categorical data → Bar charts, Frequency plots
Continuous data → Histograms, Boxplots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do boxplots show?

A

Minimum
Q1 (25th percentile)
Median (Q2)
Q3 (75th percentile)
Maximum
Outliers
Also shows distribution shape and spread.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

❓ What does it mean if a distribution is left or right skewed?

A

Left skewed: Tail on the left, most values are high, median closer to Q3
Right skewed: Tail on the right, most values are low, median closer to Q1
Symmetric: Mean ≈ Median, bell-shaped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the three measures of central tendency?

A

Mean: Average (sensitive to outliers)
Median: Middle value (not affected by outliers)
Mode: Most frequent value (can be used for categorical or numerical data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is the median preferred over the mean?

A

When there are extreme values or outliers
Because the median is more robust

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the main measures of dispersion/variation

A

Range = Max - Min
Interquartile Range (IQR) = Q3 - Q1
Variance = Average of squared deviations
Standard deviation = √Variance
Coefficient of variation = SD ÷ Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are quartiles?

👉

A

Q1: 25% of observations below it
Q2: Median (50%)
Q3: 75% of observations below it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you calculate Interquartile Range (IQR)?

A

IQR=Q3−Q1
Shows spread of middle 50% of the data.

17
Q

What is a deviation in statistics?

A

The difference between a data point and the mean:

Deviation = Value − Mean
Deviation=Value−Mean
Shows how far a point is from the average.

18
Q

Why do we square deviations when calculating variance?

👉

A

To eliminate negatives
So that deviations don’t cancel each other out
Gives greater weight to values far from the mean

19
Q

What is sample variance?

A

The average of squared deviations from the mean:

20
Q

What is sample standard deviation?

A

The square root of the variance:

Most commonly used measure of spread
Same units as original data
Small SD = data close to mean
Large SD = data more spread out

21
Q

Why is the standard deviation useful?

A

Gives a clear measure of variation around the mean
Helps you understand how spread out the data is
Commonly used in clinical and research data analysis

22
Q

what does it mean if a data is left skewed

23
Q

what does it mean if a data is right skewed

24
Q

What is the first thing you should do when you get new data?

A

Plot your data!

25
26
Which of the following graphics should be used for categorical variables?
Bar Chart
27
Left-skewed
28
All of the following are measures of data variation EXCEPT: Variance Interquartile range Standard deviation Mean
mean
29
All of the following are influenced by outliers EXCEPT:
Interquartile range
30
If you have right-skewed data, which of the following will be true?
Mean > median
31
Which of the following data/variables would be considered a continuous variable? a. Favorite fruit b. Gender c. Decade of birth d. Weight
weight
32
Which of the following variables would be considered a nominal (categorical) variable? a. Favorite fruit b. Height c. Decade of birth d. Weight
favourite fruit
33
34
If the distribution of my data is left-skewed, which of the following is correct? a. Mean < median b. Mean = median c. Mean > median d. Unable to judge
mean
35
Which of the following graphics should be used for categorical variables? a. Histogram b. Box plot c. Bar Chart d. Stem-and-leaf plot
bar chart
36
37
What is the first thing you should do when you get new data? a. Run a t-test b. Calculate a p-value c. Plot your data d. Run multivariate regression
plot your data
38