Representations Of Data Flashcards

(6 cards)

1
Q

Define an outlier:

A

An extreme value that lies outside the overall pattern of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give 2 numerical definitions of an outlier:

A

Either greater than Q3 + k(Q3-Q1).

Or less than Q1 - k(Q3-Q1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define an anomaly:

A

An outlier that is clearly an error so should be removed from the data (also known as cleaning the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do the 5 vertical lines in a box plot represent (starting from left to right)?

A

1) Lowest value that isn’t an outlier (boundary for outliers).
2) Lower quartile.
3) Median.
4) Upper quartile.
5) Highest value that isn’t an outlier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Where do the 4 horizontal lines go on a box plot (starting from left to right)?

A

First line connects the midpoint of first 2 vertical lines.
Next 2 lines connect top and bottom of 2nd, 3rd and 4th vertical lines, creating a rectangle with a vertical line in it.
Last line connects midpoint of 4th and 5th vertical lines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do the crosses represent on a box plot and where do they go?

A

Represent outliers.

They either go to the left of first vertical line or to the right of last vertical line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly