Review of 2300 Flashcards

(23 cards)

1
Q

Measures of Variability

A

It is often desirable to consider measures of variability (dispersion), as well as measures of location.
For example, in choosing supplier A or supplier B we might consider not only the average delivery time for each, but also the variability in delivery time for each.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of Variability

A

Range
Variance
Standard Deviation
Coefficient of the variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Range

A

The range of a data set is the difference between the largest and smallest data values

It is the simplest measure of variability.
It is very sensitive to the smallest and largest data values.
Range = largest value - smallest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variance

A

The variance is a measure of variability that utilizes all the data.
It is based on the difference between the value of each observation (xi) and the mean ( x bar for a sample, mu for a population).
The variance is the average of the squared
differences between each data value and the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Standard Deviation

A

The standard deviation of a data set is the positive square root of the variance.

It is measured in the same units as the data, making it more easily interpreted than the variance.
In statistics and probability theory, standard deviation (represented by the symbol sigma, σ) shows how much variation or dispersion exists from the average (mean), or expected value. A low standard deviation indicates that the data points tend to be very close to the mean; high standard deviation indicates that the data points are spread out over a large range of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Relative Frequency

Relative Frequency Distribution

A

The relative frequency of a class is the fraction or proportion of the total number of data items belonging to the class.

A relative frequency distribution is a tabular
summary of a set of data showing the relative frequency for each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Percent Frequency

Percent Frequency Distribution

A
The percent frequency of a class is the relative
 frequency multiplied by 100

A percent frequency distribution is a tabular
summary of a set of data showing the percent
frequency for each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Frequency Distribution

A

Guidelines for Selecting Width of Classes
Approximate Class Width =
Largest data value - Smallest data value / Number of classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Cumulative frequency distribution -

A

Cumulative frequency distribution - shows the number of items with values less than or equal to the upper limit of each class

Cumulative relative frequency distribution – shows the proportion of items with values less than or equal to the upper limit of each class

Cumulative percent frequency distribution – shows the percentage of items with values less than or equal to the upper limit of each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Measures of Location

A

If the measures are computed for data from a sample,they are called sample statistics.

If the measures are computed for data from a population,they are called population parameters

A sample statistic is referred to as the point estimator of the corresponding population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean

A

The mean of a data set is the average of all the data values.

The sample mean (xbar) is the point estimator of the population mean (mu).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measures of variability

A

It is often desirable to consider measures of variability (dispersion), as well as measures of location.

For example, in choosing supplier A or supplier B we might consider not only the average delivery time for each, but also the variability in delivery time for each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Forms of measures of variability

A
Range
Variance
Standard Deviation
Coefficient of variation
Interquartile range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Range

A

The range of a data set is the difference between the largest and smallest data values.

It is the simplest measure of variability

It is very sensitive to the smallest and largest data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variance

A

The variance is a measure of variability that utilizes all the data.

It is based on the difference between the value of each observation (xi) and the mean
(xbar for a sample,
mu for a population).

The variance is the average of the squared
differences between each data value and the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Standard Deviation

A

The standard deviation of a data set is the positive square root of the variance.

It is measured in the same units as the data, making it more easily interpreted than the variance.

17
Q

Coefficient of variation

A

The coefficient of variation indicates how large the standard deviation is in relation to the mean.

18
Q

Distribution Shape: Skewness

A

Symmetrical, Skewness = 0, median and mean are the same, perfect bell curve

Moderately skewed to the left
mean will be less than the median

Moderately skewed to the right
mean will be more than the median

19
Q

Z-Score

A

The z-score is often called the standardized value.

It denotes the number of standard deviations a data value xi is from the mean

An observation’s z-score is a measure of the relative location of the observation in a data set.
A data value less than the sample mean will have a z-score less than zero
A data value greater than the sample mean will have a z-score greater than zero
A data value equal to the sample mean will have a z-score of zero

20
Q

Empirical Rule

A

For data having a bell-shaped distribution:

  1. 26% of the values of a normal random variable are +/- 1 standard deviations of the mean
  2. 44T of the values of a normal random variable are +/- 2 standard deviations of the mean
  3. 72% of the values of a normal random variable are +.- 3 standard deviations of the mean
21
Q

Covariance

A

The covariance is a measure of the linear association between two variables.
Positive values indicate a positive relationship
Negative values indicate a negative relationship

22
Q

Correlation Coefficient

A

Correlation is a measure of linear association and not necessarily causation

Just because two variables are highly correlated, it does not mean that one variable is the cause of the other

The coefficient can take on values between -1 and +1
Values near -1 indicate a strong negative linear relationship
Values near +1 indicate a strong positive linear relationship

23
Q

Steps of Hypothesis Testing:

A

Step 1. Develop the null and alternative hypothesis
Step 2. specify the level of significance
Step 3. Collect the sample data and compute the value of the test statistic

p-value approach:
Step 4. Use the value of the test statistic to compute the p-value
Step 5. reject the null hypothesis if the p-value is less than or equal to sigma (the standard deviation)

Critical Value Approach:
Step 4. Use the level of significance to determine the critical value and the rejection rule
Step 5. Use the value of the test statistic and the rejection rule to determine whether to reject the null hypothesis.