Unit6Vocabulary Flashcards by Norman Sooy

Skewed Left

Also known as negatively skewed, the bulk of the data items are clustered on the positive end of a graph with the long tail to the left.

How well did you know this?

Not at all

Perfectly

Mean

The average value of all the data in a dataset. Calculated by adding up the values of all data items and then dividing by the number of items in the dataset.

How well did you know this?

Not at all

Perfectly

z-score

A value indicating the number of standard deviations a data item is from the mean of its dataset.

How well did you know this?

Not at all

Perfectly

Box and Whisker Plot

A graphical representation of the five number summary.

How well did you know this?

Not at all

Perfectly

Upper Quartile

The median of the upper half of a dataset.

How well did you know this?

Not at all

Perfectly

Bivariate

Two datasets used to measure correlation.

How well did you know this?

Not at all

Perfectly

Strong Positive Correlation

Indicated by a correlation coefficient as defined below:

{ r | 0.7 < r < 1 }

How well did you know this?

Not at all

Perfectly

Weak Negative Correlation

Indicated by a correlation coefficient as defined below:

{ r | -0.1 < r < -0.3 }

How well did you know this?

Not at all

Perfectly

Weak Positive Correlation

Indicated by a correlation coefficient as defined below:

{ r | 0.1 < 0.3 }

How well did you know this?

Not at all

Perfectly

Maximum

The largest data value in a dataset.

How well did you know this?

Not at all

Perfectly

Neutral Positive Correlation

Indicated by a correlation coefficient as defined below:

{ r | 0.4 < r < 0.6 }

How well did you know this?

Not at all

Perfectly

Correlation Coefficient

A statistical measure of how linear a bivariate dataset is. Typically represented with a lowercase r:

{ r | -1 < r < 1 }

How well did you know this?

Not at all

Perfectly

Lower Quartile

The median value of the lower half of a dataset.

How well did you know this?

Not at all

Perfectly

Skewed Right

Also known as positively skewed, the bulk of the data items are clustered on the negative end of a graph with the long tail to the right.

How well did you know this?

Not at all

Perfectly

Histogram

A graphical representation of the clustering of a dataset based on a specified bin width and the number of data items within each bin.

How well did you know this?

Not at all

Perfectly

Bell Curve

Study These Flashcards

A graphical representation of the spread of a normal dataset indicating 1, 2, and 3 standard deviations from mean.

Median

Study These Flashcards

The middle data item in a dataset. When the number of items is even, the median is calculated by taking the middle 2 terms and averaging them.

Standard Deviation

Study These Flashcards

A statistical measure of the average distance the data items within a dataset are from the mean.

Strong Negative Correlation

Study These Flashcards

Indicated by a correlation coefficient as defined below:

{ r | -0.7 < r < -1 }

Causation

Study These Flashcards

In a bivariate data analysis, high correlation is often cited as an indication of a causal relationship. Causation is when it is proven that one thing causes a change in another thing. Correlation does not imply causation.

No Correlation

Study These Flashcards

Indicated by a correlation coefficient near or equal to zero.

Five Number Summary

Study These Flashcards

A measure of a dataset’s spread and distribution accomplished by partitioning the data into quarters: