Topic 4: Normal Model Flashcards

1
Q

What is normal model?

A

A model used for data that is fairly symmetric and used to capture the general trend.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why do we use normal model?

A
  • Normal model represents many natural phenomenon
  • Models data caused by different factors (Central Limit Theorem)
  • Defined by mean and SD (easy to use)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How many types of normal models are there? Describe them

A
  • Standard normal model: N(0,1)
  • General normal model: N(mean, SD^2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is normal model used?

A

A histogram that has quite a bell shape.

Diagnostics:
- Does the histogram look normal? (no long tails or many outliers)
- Do the proportions look right? (or does it fit into the 68/95/99.7% rule?)
- Does the quantile-quantile (QQ) plot look linear?
- Does the shapiro test have low value? (<0.05: not fit)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What commands are used to determine the area under the curve or the threshold of that area?

A

pnorm(x) (cumulative distribution function): used to calculate the area(%) of the area under the curve (lower tail)
pnorm(x); pnorm(x, lower.tail=F); pnorm(x,mean,SD)
x is the threshold

qnorm(y) (quantile function): determine the threshold of the area
qnorm(y,mean,SD)
y is the area/the percentile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to rescale general normal model to standard normal model?

A

Calculate the standard units (Z= (data points - mean)/SD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is individual measurement different from exact value?

A

Individual measurement = exact value + chance error + bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can we estimate chance error?

A

Chance error happens because measured value turns out differently every time.
Chance error can be estimated by replicating the measurement to calculate the SD.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is bias?

A

Bias is a systamic error that is in a constant amount added or subtracted from the measurement.

Bias cannot be estimated by replicating the measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the area under the normal curve approximate?

A

The area under the normal curve approximates that section of the histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is reproducible research and why is it important?

A

Reproducible research is a research that makes data sets and software used available for verifying published findings and alternative analyse.

Without reproducible research, data versions and graphical summaries can change.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly