Midterm Flashcards
(37 cards)
Experiments vs observational studies
Exp. -> can put a control (témoin), researcher has control. Observational study -> we observe. We only associate and do not establish causality
Counfounding variable
Variable that can affect both the treatment and the response
Random sample
Representative of the population. Not biased
3 types of bias
Selection bias. Non-response bias. Measurement errors
Selection bias explanation
Subset of the experimental units of population is excluded/no chance of being selected for exp. = not a random sample
Non-response bias explanation
Unability to obtain data on all experimental units selected for the sample
Measurement error explanation
Inaccuracies in values recorded.
Commonly used displays for qualitative data
Pie charts, bar plots
Simpson’s paradox explanation
Third confounding variable changes the relationship between two other QUALITATIVE variables. Imbalance of the distribution of the categories of the third category with respect to the first two.
Graphical displays for quantitative data
Boxplot, dotplot, histogram
Adv/Disadv of the dotplot
A : Get to see all the data points + Easy to interpret D: Gets messy quickly if lots of data
Adv/Disadv of histograms
A: Easy to pick up on all aspects of quantitative data (centre,spread, etc.) + made by most statistical packages D: Different bin widths can give diff. intepretations
Mode
Number (or centre of the bin) that occurs most often (that has the most observations)
Frequency vs Percentage
Frequency = number of observ in this bin. Percentage = percentage of total obs. it represents
Different possibilities for the mode
Unimodal, Multimodal or no mode
Adv/Disadv. of mean
A: Good for estimating pop. mean. + good inferential properties D: Outliers and skewed data
Adv/disadv. of median
A: Easy to interpret + not infl. by outliers D: Bad inferential properties + longer to calculate
Adv/disadv of mode
A: Highest concentration of data + bimodal data D : Bin width (class definition) matters
sample vs pop mean symbols
X bar and μ
Sample var/std vs Pop. var/std
S square or S vs sigma square or sigma
Different measures of centre
Mean, median, mode
Different measures of spread
Range, IQR, Var or std
Something particular in variance
squared units
% of observations with Z-score values 1,2.3 w/ empirical rule
68, 95, 99.7