Statistics Flashcards

Question

Randomization

Answer 1

Assignment occurs by chance

Answer 2

X axis: 1 - specificity, or the false - positive rate Y axis: Sensitivity

Answer 3

Optimal Cut-off point for the respective test. In general, the point closest to the upper-left corner, where sensitivity is highest and the false-positive rate is lowest, is chosen as the cut-off.

Answer 4

To calculate the diagnostic accuracy (best sensitivity and specificity) of the test, that is the probability of correctly identifying disease based on the result of the test. The larger the area under the curve, the better the test.

Answer 5

Used for reliability studies, eg to assess inter-rater reliability or intra-eater reliability. Used in assessing the degree to which two or more raters, examine the same data, agree when it comes to assigning the data to categories.

Answer 6

Occurs when one factor modifies the effect on outcome of another.

Answer 7

Occurs when the association between two variables is distorted by the fact that both are associated with a third. Eg. The association between coffee and lung cancer is distorted by smoking

Answer 8

CV = SD/X x 100% 1. Used for compare the relative spread of data for 2 variables (eg. Height and weight) 2. Used to evaluate precision of the measurement of a single variable (x-ray film reading by two physicians)

Answer 9

For continuous variables

Answer 10

For categorical data

Answer 11

For association

Answer 12

Simple random Systematic random Stratified random Cluster random

Answer 13

Every unit in the population had the same probability of being selected, chance alone determines whether a particular unit in the population is selected for the sample

Answer 14

Every k th member is selected from the population

Answer 15

- Population is divided into heterogeneous groups (strata) (eg. black, white, Hispanic, Asia) and a random sample is taken from within each group - Ensures equal numbers of each strata in final sample.

Answer 16

Population is divided into homogenous group (cluster) and a random sample of these groups is taken. eg a school, a community, etc

Answer 17

Z = (X - U)/sigma Any normal distribution can be transformed to the standard normal to get a Z score for a given value X

Answer 18

Paired t-test

Answer 19

To compare the sample mean with the mean of the population

Answer 20

To compare the mean of two groups

Answer 21

To compare the mean of before and after

Answer 22

Used for more than two groups

Answer 23

Compare two proportions

Answer 24

Is used if expected count on a cell is less than 5

Answer 25

For paired proportions

Answer 26

Pearson's correction coefficient

Answer 27

% of variation in Y explained by X

Answer 28

Dependent variable is continuous | One independent variable

Answer 29

Dependent variable is continuous | More than one independent variables

Answer 30

Dependent variable is dichotomous | OR is used for estimation

Answer 31

Time to the event | Hazard rate is use for estimation

Answer 32

Collinearity is a linear relationship between two explanatory variables. Collinearity can result in unstable beta coefficient estimates.

Answer 33

A graph designed to check for the existence of publication bias in systematic reviews and meta-analyses

Answer 34

In general, p should be small , 15

Answer 35

Reject H0 when it is true.

Answer 36

Accept H0 when it is actually false.

Answer 37

For parametric data, using - Paired t test ( pre and post, paired), For non-parametric data, using -Wilcoxon's signed rank test

Answer 38

For parametric data, using - Student t test For non-parametric data, using -Wilcoxon's rank sum test (also termed Mann-Whitney U test.

Answer 39

For parametric data, using - Chi-square For non-parametric data, using - Fisher exact probability test - used when at least 1 cell in a contingency table has an expected count s Chi-square test for paired proportion.

Answer 40

For parametric data, using - ANOVA For non-parametric data, using - Kruskal-Wallis test

Answer 41

For parametric data, using - Pearson's correlation For non-parametric data, using - Spearman's correlation Multiple regression - more than one independent variable s

Answer 42

1. Kaplan-Meier analysis 2. Cox proportional Hazard Regression - a combination of multiple logistic regression techniques with survival methods

Answer 43

Logistic regression

Answer 44

How scattered the data is.

Answer 45

Precision of the mean. | How precise the data is.

Statistics Flashcards

(69 cards)