Stats Year 2 Flashcards
(140 cards)
what are 4 claims of science
- rationality: rational methods
- truth:
- objectivity: can be tested and verified
- Reality
what is science
understanding/acquiring more knowledge of world through observations and experiments
what is scientific method?
- method which provide rational truth about science/world
- Inquire using PEL method: Question, -> hypothesis set -> presuppositions (parameters) + Evidence -> Logic -> conclusion
What is the PEL method
Inquire using PEL method: Question, -> hypothesis set -> presuppositions (parameters) + Evidence -> Logic -> conclusion
what is deduction and induction logic?
- deduction: given model to infer expected data. E.g every mammal has a heart, every horse is a mammal. Hence everyhorse has a heart
- Induction: have data/observed data to infer or come up with a model that represents or describes data. Eg: every observed horse has a heart, conclusion: everyhorse has a heart
what is statistics
- methods to study and measure nature of the world/universe.
- Methods to PREDICT and ESTIMATE in given/measurable parameters
what is parameter?
a quantity of interest: i.e number of viruses, volume of water…etc
what does stats allow us to measure?
- often we cant predict what we dont know even if we are given some known facts
- stats measures uncertainty in an estimate of a real value in a parameter population
- uncertainty = probability of the estimate we obtrained with data/sample which truly reflects the actual value
- probability of truth
what is data
measurement of a variable in a sample or census
what are the types of data/variables
- categorical data (not numbers): nominal variable (qualitative + no order); ordinal variable (qualitative + order)
- numerical data (numbers/quantitative): discrete variables, continuous variables
what is a variable
measurement/characteristic of interest in a population/census
what are the types of variables in experiment
explanatory (independednt - on X axis), response (dependent variable - on Y axis)
what is observational and experimental studies?
- observational studies: cause effect not yet defined. first observe and record variables of interest: then measure and correlate/associate.
- experimental studies: established treatment and control groups, measured and test hypothesis of a cause and effect. random samples
pros and cons of observational vs experimental studies
- Obersvational
- pro: reflects actual present event, measures many variables simultaneously
- con: cannot establish causation
- Experimental
- pros: controls variables of interest to establish causation; limits other variabilities controls other factors
- cons: does not reflect actual present/natural setting
what is statistical inference?
aim of stats is to use data from a subset/sample of population to infer truth (characteristics or parameters) about the population/census
population vs sample
- population: entire collection of units that we want to research a parameter(s) about
- sample: subset of units from population which estimates a population parameter
whart is sampling error?
- the deviation of an estimate from its truth in its population parameter
- the fact that the estimate is different to the truth
- estimates based on samples are rarely exactly equal to the true population values, bcuz a sample does not capture every member of the population
- precision is DIRECTLY RELATED to sampling error
- Accuracy/bias is NOT related to sampling error, it is related to systematic error (error with sampling method)
what is relationship of sampling error and standard error
- sampling error measures deviation from truth, which in related to standard error
- The standard error measures the variability (or standard deviation) of a sample statistic from sample to sample and provides an estimate of the sampling error.
- For example, the standard error of the mean estimates how much the sample mean is expected to vary from the true population mean due to sampling error.
what is precision and accuracy
- precision is related to sampling error and standard eror, and is the SPREAD of estiates from sample, DUE TO sampling error
- Accuracy: systematic error/BIAS. Something wrong w method, ccausing estimate to not reflect population. DUE TO BIAS
how to reduce bias/increase accuracy?
- random sampling
- placebo
- standardize methods
- inaccuracy can be due to chance and small sample size (not neccessarily bias, but results can seem biased), hence increasing sample size will increase accuracy BUT NOT REDUCE BIAS
how to increase precision/reduce standard error/sampling error
- larger sample size
- more sample trials
- smaller deviation and less standard error
what is random sample vs sample convenience. Why sample random?
- random sample: INDEPENDENT selection of units, each unit have equal chance of being selected
- sample convenience: volunteers, by opportunity
- random sample REDUCES bias
what is frequency distibution?
- histogram: records actual data from sample
- discrete + continuous
- different to PDF as it doesn’t predict, but simply used to RECORD
what is probability density function?
- a maths model/function which estimates/predicts probability of a random variable being a certain value in the population
- used for continuous random variables -> hence probability of a specific data point occuring is 0
- estimated data from population
- distribution of probabilities which might occur
- models e.g: uniform distibution, normal distibution…etc