Stats Flashcards
(53 cards)
Qualitative Variable
A non-numerical variable, like hair colour or eye colour
Quantitative Variable
A numerical variable, like length, time
Continuous variable
Can take any value within a given range, e.g. height, time, age,
Discrete variable
Can only take certain values, e.g. shoe size, cost in £ and p, number of coins.
What type of data do histograms deal with?
Continuous data –> no spaces between the bars
Define the mode
The value which occurs most often.
Define the mean average
Where you add up all the numbers and then divide by the number of numbers
Define the median
The median is the middle number in an ordered list.
When is the mode used?
You should use the mode if the data is qualitative (colour etc.) or if quantitative (numbers) with a clearly defined mode (or bi-modal). It is not much use if the distribution is fairly even.
When is the mean used?
This is for quantitative data (numbers), and uses all pieces of data. It gives a true measure, and should only be used if the data is fairly symmetrical (not skewed).
When is the median used?
You should use this for quantitative data (numbers), when the data is skewed, i.e. when the median, mean and mode are probably not equal, and when there might be extreme values (outliers).
What is the range?
The range is the largest number minus the smallest (including outliers).
What’s an outlier?
An extreme value
Ordinal variables
An ordinal variable is a categorical variable for which the possible values are ordered.
Nominal variables
Nominal variables have two or more categories without having any kind of natural order. They are variables with no numeric value, such as occupation or political party affiliation.
Binary variables
Binary variables are variables which only take two values.
What are the various sources of data?
Routinely collected data: - Mortality and census data - Hospital activity data - Primary care data - Infectious disease notifications - Regular national surveys (e.g. Health Survey for England) Research study data
Continuous variables
Have numerical values
Measurements are on a continuous scale i.e. can take an infinite number of distinct values
Discrete variables
Also known as count variables
Have numerical values, but they must be integers e.g. Number of fillings
What are the appropriate graphical presentations to use for 1 categorical variable?
Bar chart
Pie chart
Frequency table
What are the appropriate graphical presentations to use for 1 continuous variable?
Histogram
Bar chart
What is the appropriate graphical presentation to use for a categorical outcome and categorical exposure?
Contingency table
What is the appropriate graphical presentation to use for a numerical outcome and categorical exposure?
Box and whisker plot
What is the appropriate graphical presentation to use for a numerical outcome and numerical exposure?
Scatter plot