Lecture 1 Flashcards
(31 cards)
Population
Set of units we are interested in studying
Variable
Characteristic or property of an individual population unit
Census
Act of measuring characteristics on ALL UNITS IN A POPULATION
Sample
Subset (Part) of the population on which characteristics are measured
Some good statistical practices (good thins to do)
REPRESENTATIVE SAMPLE, ask good questions, make statements we have the data to support
Descriptive statistics (Def)
Used to describe observed phenomenons using quantitative tools. Summarize and describe world around us (quantitatively)
Inference (def) - (Statistical inference)
Generalizing a conclusion from a sample to a larger population
Experimental studies (data) generality (2 concepts)
One attribute of interest (THE TREATMENT) that we have control over and one attribute that we’re interested in observing (THE RESPONSE)
Key part of experimental study
We control who gets the treatment (so there is a control group or placebo)
Why control who gets treatment in experimental study and why have a control group
To see if it is really the treatment that caused the response
Observational studies (difference w/ experimental ?)
We do not control who gets the treatment
Confounding variable (def)
Variable that could be causing both the treatment and response variables
Problem w/ observational studies
We CAN NOT discount (ignorer/considerer negligeable) confounding variables or factors
Verb to use for an observational study’s results
We conclude association (not causation)
How to eliminate counfounding. What is this called
Making each subset of the population representative (equal probability of being selected for the sample) -
Random samples
Random samples ‘‘synonym’’
Mini-populations
Selection bias (def)
A subset of the experimental units in population (a subset of the population/ a subset of populaton that we could measure variable on) has no chance of being selected
Non-response bias
Inability to obtain data on all experimental units selected for the sample (some choose ‘‘don’t know’’ or ‘‘no answer’’)
Measurement error and causes
Inaccuracies in the values of data recorded (poorly calibrated lab equipement, vague questions, measuring variable that represents large concept)
Two ways of summarizing data
Numerically and graphically
Graphical displays are good for … (2)
Building and guiding intuition, provide picture of data
Numerical summaries are good for … (2)
Confirming intuition and giving concise impressions
Quantitative data are …
numerical in nature (blood pressure, temperature, …)
Qualitative data are …
Categorical in nature (Hair color, uni major, tumour vs no tumour)