HRSS SEM 2 Flashcards

(46 cards)

1
Q

Cross-sectional study design

A

Observational or descriptive
Collects data from a population at 1 specific point in time
Groups determined by existing differences, not random allocation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Advantages of cross-sectional study design (5)

A
  • Snapshot of a population at one time
  • Can draw inferences from existing relationships or differences
  • Large numbers of subjects
  • Relatively inexpensive
  • Can generate odds ratio, absolute/relative risk and prevalence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Disadvantages of cross-sectional study designs

A
  • Results are static: no sequence of events
  • Doesn’t randomly sample
  • Can’t establish cause and effect relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Pearson’s correlation co-efficient

A

Measures linear relationship between 2 variables
P=0 suggests no linear relationship
Coefficients offer crude linear association and are unable to adjust for other variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Regression modelling

A

Investigates if an association exists between variables of interest
Measures strength and direction of association between variables
Studies the form of relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are continuous linear relationships examined

A

By linear or non-linear regression models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are categorial outcomes examined

A

By logistic regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe Linear Regression

A

Ho = no relationship between DV and IV

IV and DV must be continuous; IV can be continuous or categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Assumptions for linear regression

A

Linear relationship between DV and iV
Observation independently and randomly selected
Effects are additive
Homogeneity of variances
Residuals are independent + normally distributed
Absence of outliers and multi-collinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe skewness and kurtosis, mean/median/mode for a normally distributed variable

A

both = 0
- the further the value is from 0 the more likely it is that the variable isn’t normally distributed
mean median and mode should be equal for a normally distributed variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Tests of normality

A

Tests of normality (Shapiro-Wilk) compare the shape of the sample distribution to the shape of a normally distributed curve

  • non significant tests suggest distribution of sample isn’t sig different from normal distribution
  • significant tests suggest distribution in question is sig different from a normal distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Multicollinearity

A

Refers to IV’s that are correlated with other IV’s

In presence of multi-collinearity, regression models may not give valid estimates of individual predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variance inflation factor (VIF)

A

Measure of how much the variance of the estimated regression coefficient is inflated by the existence of correlation among IV’s in the model
VIF = 1 : no correlation among predictors
VIF >4 : warrants further investigation
VIF > 10: signs of serious multicollinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a simple linear regression model if B > 0 …

A

Positive association between IV and DV

For each unit increase in IV, the DV would increase by (B) value units.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Building a regression model

A

If IV associated with outcome (DV) and no affected by multi-collinearity, then can build multivariable multiple linear regression model for DV
Fitted regression model presents regression coefficients representing adjusted associations between DV and IV; adjusted for each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interpreting a regression model

A

For each unit increase in IV, the estimated DV unit would increase by (B) value, after adjusting for other IV’s
May change association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

R^2

A

% of variability explained by fitted model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Observational studies

A

Subjects observed in natural state; can be measure and tested by no intervention or treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Cohort longitudinal studies

A

Population of subjects identified by common link
Researcher can follow across time to see what happens
Useful for establishing natural Hx of a condition
Cohort can be divided at outset into subgroups of people whose experience is to be compared
Can identify those most likely to develop outcome

20
Q

Internal comparison

A

1 cohort involved study

Sub classified and internal comparison done

21
Q

External comparison

A

> 1 cohort in study for purpose of comparison

22
Q

Strengths of cohort studies

A

Can give incidence rate and risk
Cause-Effect
Good when exposure is rare
Minimises selection and information bias

23
Q

Weaknesses

A

Loss to follow up
Requires large sample
Ineffective for rare disease
Time-consuming and expensive

24
Q

Chi Square Test (+ assumption)

A

Determines if there is a significant relationship between 2 categorial variables
Assumption: expected frequency in each cell >5

25
Fisher's exact test
Used if Chi Square Test doesn't meet assumption (i.e. cells have frequency < 5)
26
Independent samples t-test
Used to compare means of a normally distributed continuous variable for 2 groups
27
Odds Ratio
Measure of strength of association between exposure and outcome. Represents odds an outcome will occur given a particular exposure; compared to outcome occurring in absence of exposure OR = 1: exposure doesn't affect odds of outcome OR = >1 : exposure associated with higher odds of outcome OR = <1 : exposure associated with lower odds of outcome
28
Confidence Intervals and OR
If CI of OR contains 1 (null value of OR) the relationship is likely to be insignificant If CI of OR doesn't contain 1 the relationship is likely to be significant
29
Logistic Regression
A regression w an outcome variable that is categorical and IV that can be a mix of continuous/categorical Predicts which of the possible events are going to happen given certain other information on IV Identifies factors that determine whether an individual is likely to benefit from a certain type of rehab program/outcome
30
Logistic Regression Assumptions
Ration of cases to variables (enough responses in a given category) Linearity in the logic (regression equation should have a linear relationship w the logic form of the outcome) Absence of multicollinearity and outliers Independence of results
31
Logistic Regression Models
Dichotomous outcome - Binary LR Polychromous outcome - multinomial LR Ordered outcome - ordinal logistic regression
32
What test to examine relationship between outcome variable and DV
Chi Square test Fishers test T-Test
33
LR - Omnibus Test
Asks if null model is an improvement | If p<0.05 then fitted model (new model) as a whole fits significantly better than a null model w/o a predictor
34
LR model summary
- 2 log likelihood suggests new model > null model as in a better fit - Nagelkerke R^2 value shows how much variation is explained by new model
35
Goodness of fit test
Used to examine if estimated LR model fits sample data P<0.05 = poor fit p>0.05 = good fit
36
RCT
Individuals allocated at random to receive one of number of interventions (min 2) = chance of allocation to each intervention NOT determined by researched, NOT predictable
37
Random allocation
Eliminates bias Allows researchers to make casual inferences - randomisation ensures any difference between groups is due to chance Covariates are distributed across groups at baseline = unbiased distribution of confounders
38
Sources of bias in RCT
Selection bias - inadequate concealment of allocation/incorrect generation of randomisation sequence Performance bias - inadequate blinding/masking Detection bias Attrition bias Reporting bias
39
How are sample and effect size related
Sample size inversely associated with effect size
40
Intention to treat analysis
Compares treatment groups as originally allocated, irrespective of whether patients received or adhered to treatment protocol Promotes external validity
41
Per protocol analysis
Compares treatment groups as originally allocated but includes only those patients who completed treatment protocol which compromises internal validity
42
Longitudinal Data Analysis
Assesses change in response variable over time, measures temporal patterns of response to Rx, identifies factors that influence change 1. Mixed effects model - compares individual change over time 2. Marginal mode (GEE) - compares population over time, evaluates interventions/informs public policy
43
Generalised Estimating Equations (GEE)
GEE an extension of general linear model of statistical regression for modelling clustered/correlated data Offers robust estimate of standard errors to allow for clustering of observations Produces consistent estimates of regression coefficients and their standard errors Can deal with normal and non-normal outcome data If GEE is significant, two interventions are significantly different
44
Assumptions of GEE
Assumptions of general linear model Responses from known family of distribution with specified mean and variance where variance is function of mean Mean is a linear function of predictors A correlation structure for the responses must be specified Any missing data are either missing completely at random or data is missing at random
45
Sources of variation
In LDA the unexplained variation is divided into components, making the ultimate error variance smaller LDA decreases unexplained variability in response - provides better estimates of effect
46
Post estimation distribution of residuals
Insignificant p value supports normality of residuals, no significant difference between residuals