DVT: Data, Variables and Tables Flashcards

Question 1

Q

What 2 ways can data variables be classified?

Answer

A

Numerical:

Quantitative
Individuals measured or count

Categorical:

Qualitative
Individuals classified into groups

Question 2

Q

What are examples of numerical variables?

Answer

A

Weight
BP
Prothrombin time
Age
No. long distance flights in last month
No. cigarettes per day

Question 3

Q

What are examples of categorical variables?

Answer

A

Smoker/non-smoker
On anticoagulation medicine?
History of cancer
Alive after 6 months?
Blood group type
Causes of death
Pain assessment
Stage of cancer

Question 4

Q

How are numerical values measured?

Answer

A

On interval scales (interval or distance b/w points on scale has precise numerical meaning

Question 5

Q

What is binary data?

Answer

A

Subtype categorical

Can only take 2 values (often yes/no)
Also known as dichotomous

Question 6

Q

What is nominal data?

Answer

A

Subtype categorical

More than 2 categories, but no natural order (A, B, AB, O)

Question 7

Q

What is ordinal data?

Answer

A

More than 2 categories, with a natural order e.g. Stage I, II, III, IV

Question 8

Q

How can data be summarised?

Answer

A

Numerical - Measures of central tendency (mean, median), measures of spread (standard deviation, range)
Categorical - Frequencies, proportions, percentages. Use tables and charts to do this

Question 9

Q

Why should data be summarised?

Answer

A

Data monitoring - Ensure what’s being collected is valid to spot errors that can be corrected
Data checking/cleaning - Ensure collected data correct, identify any outliers
Summary of results - Basic description, potential precursor to more complex analysis

Question 10

Q

How can central tendency be measured?

Answer

A

Mean - Average of all values, good measure of centre at a symmetrical distribution. Much more useful in practice but over influenced by extreme values
Median - Value at which 50% data points lie, better for skewed distributions because only slightly affected by extreme values

Question 11

Q

Describe symmetrical bell shape

Answer

A

Mean = Median

Question 12

Q

Describe negatively skewed bell shape

Answer

A

Mean < median, long tail to left

Question 13

Q

Describe positively skewed bell curve

Answer

A

Mean > Median, long tail to right

Question 14

Q

Can range be a measure of spread?

Answer

A

Dependent on outliers (i.e. extreme values)

Range doesn’t indicate whether these values are distinct from main body of data (larger sample, wider range)

Useful if data not normal (symmetrical)
Splits data so there are equal frequencies in each group

Question 15

Q

Define reference range and how it can be estimated?

Answer

A

A set of values within which a specific test result is considered to be within the normal or healthy range for a particular population

Can be estimated by a large sample of individuals from the defined population is recruited, and their results for the specific test or measurement are collected.

The collected data is analyzed to calculate:
Mean: The average value of the test or measurement across the sample.
Standard Deviation (SD): A measure of how much the individual results deviate from the mean.

The reference range is usually defined as the mean plus or minus a certain number of standard deviations. Commonly, the 95% reference range is calculated as mean ± 2 SD. This means that approximately 95% of individuals in the defined population would be expected to have values within this range.

Question 16

Q

How can numerical data be further classified?

Answer

Study These Flashcards

A

As continuous or discrete:

Continuous - All possible values within range, Continuous numerical data refers to numerical data that can take on any value within a given range, including decimals and fractions. Continuous data is measured rather than counted

Discrete - Takes certain values in given range, Discrete numerical data refers to data that can only take on certain, separate values, typically whole numbers, and are usually counted rather than measured

Question 17

Q

How do we calculate confidence intervals?

Answer

Study These Flashcards

A

Mean +/- 2 Standard error

DVT: Data, Variables and Tables Flashcards

(17 cards)