Biomedical Informatics Flashcards

1
Q

Q:Define electronic health record (EHR)

A

A:Digital longitudinal repository of patient clinical demographic and administrative data intended for secure real time access and decision support

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Q:List four structural models of EHRs

A

A:Integrated Source oriented Problem oriented Protocol oriented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Q:Name two major clinical coding systems

A

A:ICD ten and SNOMED

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Q:Give three clinical advantages of moving from paper to EHRs

A

A:Simultaneous access Automated decision support Easier data audit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Q:State one major security risk introduced by EHRs

A

A:Large scale data breach potential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Q:What physiological waveform captures cardiac electrical activity

A

A:Electrocardiogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Q:Why are window functions applied before an FFT

A

A:To taper edges of a time block and reduce spectral leakage caused by discontinuities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Q:Write the logistic regression logit equation

A

A:Log odds equals beta naught plus sum beta i times x i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Q:Interpret exp beta in logistic regression

A

A:It is the odds ratio for a one unit increase in the predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Q:Define survival function S t

A

A:Probability that event time exceeds t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q:Express hazard function h t in terms of f t and S t

A

A:h t equals f t divided by S t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Q:What assumption underlies the Cox model

A

A:Proportional hazards constant hazard ratio over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Q:Give the Kaplan Meier estimator formula

A

A:S t equals product over event times of one minus d i divided by n i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Q:Define sensitivity

A

A:True positives divided by true positives plus false negatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Q:Define specificity

A

A:True negatives divided by true negatives plus false positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Q:Explain AUROC

A

A:Probability that a random positive gets higher score than a random negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Q:What is the curse of dimensionality

A

A:Exponential data requirement growth as feature count increases causing sparse coverage

18
Q

Q:Describe mean imputation

A

A:Replacing missing entries with the variable mean so values become centered but variance is reduced

19
Q

Q:Give one disadvantage of mean imputation

A

A:Bias toward normal values and understatement of uncertainty

20
Q

Q:Explain k fold cross validation

A

A:Partition data into k equal folds iterate training on k minus one folds validate on the remaining fold average performance

21
Q

Q:Define autoregressive AR p model

A

A:Current value equals weighted sum of previous p values plus noise

22
Q

Q:Why is AIC used with AR models

A

A:To select order p by balancing goodness of fit and model complexity

23
Q

Q:How can AR coefficients give spectral information

A

A:Transfer function poles derived from coefficients determine power spectral density

24
Q

Q:State one limitation of AR models in ICU data

A

A:Assume stationarity within the modeled window

25
Q:What is a Gaussian process
A:Distribution over functions defined by mean and covariance kernels providing predictive mean and variance at any input
26
Q:Give two benefits of GPs for clinical time series
A:Handle irregular sampling and provide uncertainty estimates
27
Q:State one drawback of Gaussian processes
A:High computational cost scaling cubically with number of observations
28
Q:What is precision in classification
A:True positives divided by true positives plus false positives
29
Q:Why prefer precision recall curve for rare events
A:It focuses on positive class performance without being dominated by abundant negatives
30
Q:Give three variables in SAPS II score
A:Age Glasgow coma scale Systolic blood pressure
31
Q:Define qSOFA threshold for respiratory rate
A:Twenty two breaths per minute or higher
32
Q:What are type I and type II errors
A:False positive and false negative respectively
33
Q:Explain odds vs probability for small p
A:For rare events odds approximate probability while they diverge as probability increases
34
Q:State the formula for positive predictive value
A:True positives divided by true positives plus false positives
35
Q:List two wrapper feature selection strategies
A:Forward stepwise and backward elimination
36
Q:Define mutual information in context of feature relevance
A:Measure of shared uncertainty reduction between feature and target variable
37
Q:What is leave one out cross validation
A:Special case of k fold where k equals sample size with each iteration holding out one case
38
Q:Give one advantage of LSTM over AR for ICU signals
A:Can learn nonlinear long range temporal dependencies without explicit stationarity assumption
39
Q:Name two hyperparameters in a squared exponential GP kernel
A:Length scale and signal variance
40
Q:Describe spectral leakage
A:Energy spreading to unintended frequency bins because of noninteger cycles in FFT window