Week 5 Flashcards
(51 cards)
Why do we need to understand advanced statistics in EBP?
- To effectively implement EBP and enhance patient care
- Statistical proficiency supports informed clinical decision-making by distinguishing between random variations and meaningful patterns
- It enables professionals to critically evaluate research, ensuring high-quality evidence informs practice
- Understanding statistics also enhances patient safety, treatment effectiveness and cost-efficiency by guiding the assessment of interventions
- It aids in quality assessment and public health planning, helping professionals generalize research findings to broad populations
- Statistical literacy is essential for integrating new technologies and analytical methods into practice
- It fosters professional growth and critical thinking, empowering healthcare professionals to communicate EBP insights to patients, enhancing shared decision-making
Advanced statistical knowledge is fundamental for delivering high-quality, effective and patient-centred health care while driving innovation and improvements
Key statistical tools
Regression analysis, Meta-Analysis, Forest Plots, Funnel Plots,
What is meta-analysis
Meta-analysis:
- A statistical method that combines results from multiple studies to produce an overall finding
- Often accompany systematic reviews to increase statistical power by pooling data
It matters as rather than relying on one study meta-analysis gives a more robust estimate of an effect
What is regression analysis
- A family of techniques for examining relationships between an outcome (dependent variable) and one or more predictors (independent variables).
- e.g. regression can tell us how strongly patients therapy intensity predicts their recovery outcomes
It matters as regression is widely used in health research to adjust for confounders and identify significant predictors
- e.g. regression can tell us how strongly patients therapy intensity predicts their recovery outcomes
What are forest plots
- A graphical representation of a meta-analysis
- Each study is shown as a point estimate (usually square) with a horizontal line for its confidence interval and an overall summary effect is shown as a diamond
It matters as they quickly let us see individual study results, their variability and the combined outcome in a single picture
- Each study is shown as a point estimate (usually square) with a horizontal line for its confidence interval and an overall summary effect is shown as a diamond
What are funnel plots
- A scatterplot used to check for publication bias in meta-analyses
- It plots each study’s effect size against its precision
It matters as an asymmetrical funnel plot may indicate smaller or negative studies are missing (possible bias) whereas a symmetrical funnel suggest a lower risk of publication bias
- It plots each study’s effect size against its precision
Statistical significance
Statistical significance:
- A result unlikely to be due to chance along, typically determined by a p-value below a threshold (e.g. <0.05)
- This tells us an effect or difference probably exists in the sample data
- e.g. if a study reports that a new speech therapy technique improved language scores p=0.03, it means there’s only a 3% probability that this improvement was due to chance
Statistical significance reflects the influence of chance on the outcome
Clinical significance
- The real-world importance or size of the effect
- It asks: is the magnitude of the chance big enough to matter for patients or practice?
- A result can be statistically significant yet so small it has little practical impact e.g. a trial finds a statistically significant difference in ROM after a new physiotherapy technique (p=0.047), but the actual improvement is only 1.1 degrees.
- Clinically 1.1 degrees might not justify changing practice or investing in the new technique
Always consider whether an effect would make a noticeable improvement in patient outcomes or decisions
What is regression analysis
- Powerful for quantifying relationships between variables, we have:
- Dependant variable (outcome): The health outcome or measure we want to predict or explain (e.g. patient’s improvement score, probability of hospital readmission)
Independent variable (s) (predictors): Factors that might influence the outcome (e.g. number of therapy sessions, patient age, baseline severity)
- Dependant variable (outcome): The health outcome or measure we want to predict or explain (e.g. patient’s improvement score, probability of hospital readmission)
What is linear regression
- Used when the outcome is continuous (e.g. pain scale, ROM)
- It fits in a straight line through the data
- Outcome = a+b*(predictor) + error
- The coefficient b tells us how much the outcome changes for a one-unit change in the predictor, holding other factors constant
e.g. linear regression might find that each additional physical therapy session per week reduces pain score by 0.5 points on a 10-point scale, if all else is equal (with a certain confidence in that estimate)
What is logistic regression
- Used when the outcome is binary (yes/no etc.)
- Instead of a straight line, it predicts the log-odds of the outcome
- Results are often expressed as odds ratios
- e.g. a logistic regression could show that using a particular splint makes patients 2x more likely to avoid surgery (odds ration = 2) after controlling for injury severity
- Tells us how predictors affect the odds of an outcome
What is multiple regression
- Thes means regression with more than one predictor
- In practice health outcomes are rarely caused by a single factor, so we include multiple variables in the model (e.g. both therapy sessions and patient age and initial status predicting recovery)
- Allows us to adjust for confounders - those extra variables that might also influence the outcome
- By adjusting for confounders, we isolate the effect of the main predictor of interest
e.g. we might find therapy intensity predicts better mobility outcomes even after adjusting for patient age and baseline mobility
What is a note for multiple regression
Note: adjusting for confounders means including factors in the model that could distort the main relationship. e.g. older patients tend to have few therapy sessions and also slower recovery. Age is a confounder for the effect of sessions on recovery. A multiple regression can adjust for age so that we can see the true contribution of therapy sessions on recovery independent of age
How is regression used in allied health
- Interested in factors that predict rehabilitation outcomes after a stroke
- Collect data from 100 patients including baseline walking speed, age, number of therapy hours
- You run a multiple linear regression with walking speed improvement (m/s) as the outcome and the three factors as predictors
- Regression results: Therapy hours per week have a positive coefficient (more hours is better improvement), meaning it’s a significant predictor of improvement
- Baseline speed has a negative coefficient (patients who started better improve slightly less) and Age’s coefficient is near zero and not significant (age didn’t effect rehab improvement)
- Interpretation: Holding age and baseline status constant , each additional hour of therapy per week is associated with 0.05 m/s increase in walking speed after rehab which is statistically significant
- Baseline speed matters (those with very low initial speed had more to gain) while age did not show an effect
Real world example of regression
- A recent study on stroke rehabilitation used regression to predict patient outcomes
They found that initial impairment level and intensity of therapy were significant predictors of recovery, even after accounting for age and comorbidities. Helps clinicians identify which factors to focus on (e.g. ensuring patients get sufficient therapy intensity could improve outcomes)
What is a forest plot
- Graph displaying the results of multiple studies side-by-side usually as part of a meta-analysis
- Lets you instantly visualise the range of findings and the overall combined effect
Each individual study in a meta-analysis is represented by a line in a plot and the pooled result is at the bottom
- Lets you instantly visualise the range of findings and the overall combined effect
Forest plot: Study names
Each row corresponds to one study (often identified by the firs author or year) e.g. Smith et al., 2018
Forest plot: Effect size
- Each study’s result is shown as a square centred on its effect estimate (e.g. an odds ration or mean difference) with a horizontal line through it representing the CI (usually 95% CI).
- If the line crosses the vertical ‘no effect’ line, that study’s results is not statistically significant (at the 95% confidence level)
Large studies typically have narrower CI lines (more precise estimates) and smaller studies have a wider CI
- If the line crosses the vertical ‘no effect’ line, that study’s results is not statistically significant (at the 95% confidence level)
Forest plots: Weights
- The size of the square often indicates the weight of the study in the meta-analysis (larger squares = study contributed more to the overall result, usually because it had more participants or less variance
Thus a big trial might have a big square, a tiny pilot study a tiny square
Forest plot: Line of no effect
- A vertical line down the plot indicates the null effect (e.g. an odds ration of 1 or a mean difference of 0)
- If a study’s CI crosses this line, its result isn’t statistically significant on its own
The overall effect is significant if its summary diamond does not touch this line
- If a study’s CI crosses this line, its result isn’t statistically significant on its own
Forest plot: Overall summary (diamond)
- At the bottom a diamond shape represents the pooled result of all studies combines
- The centre of the diamond is the combined effect estimate, and its width is the confidence interval
If the diamond sits entirely to one side of the no-effect line (not crossing it), the meta-analysis indicates a statistically significant overall effect
- The centre of the diamond is the combined effect estimate, and its width is the confidence interval
Forest plot: Heterogeneity
- Often a corner of the plot or a footnote will report an I2 statistic and possibly a chi-square (Q) test for heterogeneity
- I2 (%) quantifies how much variability in results is due to differences between studies rather than chance
- e.g. I2= 0% means all studies found essentially the same effect (no heterogeneity) whereas I2= 75%, would indicate high variability in results between studies
- We interpret I2 roughly as 0-25% = 0-25% = -0.25 low heterogeneity, 50% = 50% moderate, 75%+ high heterogeneity
High heterogeneity suggests the studies results differ substantially which can affect how confident we are in the combined result
Guided interpretation
- Overall direction: are most of the study squares on one side of the line of no effect (if yes, that tells you the general trend of results)
- Significance of each study: Do any of the horizontal lines not cross the line of no effect (those studies have statistically significant findings on their own)
- Most influential study: Which study has the largest square (this study has the greatest weight. Perhaps it had the largest sample size. Its results will pull the combined results more strongly)
- Pooled result: Look at the diamond. Is it left or right of the line, or overlapping. What does it say about the overall effect and its significance
Heterogeneity: If an I2 value is given, is it low, moderate or high? (how consistent were the study results? If high, consider what might differ between studies e.g. different patient characteristics or protocols)
What is a funnel plot and why use it?
- When we conduct a meta-analysis, we rely on having all relevant studies
- But what if some studies were never published, especially those with negative or inconclusive results
- Publication bias can skew evidence - typically studies with positive findings are more likely to be published than those with null results
A funnel plot is a tool to detect such bias