Lecture 1 Flashcards
What is a variable?
a characteristic of a unit that may vary for different observations
What are the two main types of variables (they each go by 2 terms)?
qualitative (categorical) & quantitative (numerical)
Qualitative uses which 2 scales of measurement?
nominal & ordinal
Nominal
order does not matter e.g. gender
Ordinal
order does matter e.g. education levels
Quantitative uses which 2 scales of measurement?
interval & ratio
Interval
difference of quantities that are meaningful but ratios of quantities cannot be compared e.g. temperature in C
Ratio
ratios of quantities that are meaningful
What is an observational study?
the investigator observes a variable of interest of an existing sample in order to draw conclusions
What is an experimental study?
the investigator examines how a response variable behaves when the researcher manipulates one or more factors to determine the effect of those factors on the response
Cross-sectional data
data collected at the same or approximately the same point in time
Time series data
data collected over several time periods
Spatio-temporal data
data collected at different locations over several time periods
Statistical sampling
the procedure to select a subset from a statistical population that is representative of the population
Frequency for a particular category
the number of times the category appears in the data set
Relative frequency for a particular category
the fraction or proportion of the time that the category appears in the data set
How are qualitative (categorical) variables typically summarized/visualized?
frequency table, bar chart & pie chart
Frequency table
displays the possible categories along with the associated frequencies or relative frequencies
How are quantitative (numerical) variables typically summarized/visualized?
stem-and-leaf plot, histogram & box-and-whisker plot
What does a measure of center attempt to do?
report a typical value for the variable e.g. mean, median & mode
What is it called when a measure of center is calculated with sample data?
statistic
What is it called when a measure of center is calculated with popular (e.g. census data)?
parameter
What is the population mean, how is it denoted & what is its formula?
denoted by mu_x, it is the sum of all the population values divided by the size of the population (N) [insert image]
What is the sample mean, how is it denoted & what is its formula?
denoted by Xbar, it is the sum of all the sample values divided by the sample size (n) [insert image]