Midterm 1 Material Flashcards

(32 cards)

1
Q

What is the mean?

A
  • Mean or average
  • We calculate the average by adding all the numbers together and dividing it by the number of values.
  • It can be represented as ȳ = mean of a sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does “n” signify?

A

n signifies sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does yi rep?

A

yi represents one observation in the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the median?

A

Median is the midpoint of the ordered sample

  • Median = (N + 1) / 2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is mode?

A

Value that occurs most frequently in the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is distribution?

A

Distribution is just a way of organizing and showing data. Think of it as a list or picture that shows where all the data points are and how often they appear.
- It’s like grouping similar data points together to see patterns.

  • Unimodal (having one mode)

-Bimodal (having 2 modes)

-Multimodal (having 3 modes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a deviation?

A
  • Difference between an observation and the mean (i.e. how far an observation ‘falls’ from the mean of the population or the sample)

To calculate deviation;
(yi)= yi - ȳ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a standard deviation

A

Typical (average) deviation from the
mean for an observation in the set
(population or sample).
- σ means population standard
deviation (‘sigma’)

To calculate population standard deviation:

σ = √[Σ(yi – μ)2 / (N – 1)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Empirical Rule

A

If the distribution is normal (symmetrical) bell-shaped

  • 68% of observations fall between the mean and one standard deviation on either side.
  • 95% of observations fall between the mean and two standard deviations on either side.
  • Over 99% of observations fall between the mean and three standard deviations on either side.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Quartile

A

Measure of data dispersion breaking down the distribution in four ordered segments.

  • When ordered in ascendance, the first 25% of the data distribution comprise the lower quartile, whereas the first 75% of the distribution comprise the upper quartile. Quartiles are typically ordered as
    follows:
  • Min: 0% of the data
  • Q1: First 25% of the data
  • Med: First 50% of the data
  • Q3: First 75% of the data
  • Max: Fully 100% of the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data

A

Factual information about things that
we observe

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Population

A

Total set of subjects of interest in a
study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sample

A

Subset of the population on which the
study collects data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Parameter

A

Numerical summary of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Statistic

A

Numerical summary of the sample data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Descriptive statistics:

A

Statistics summarizing (outlining)
sample or population data

17
Q

Inferential statistics:

A

Statistics making predictions about population parameters based on sample
data

18
Q

Variable

A

Characteristic that can vary in value
among subjects in a sample or a population.

19
Q

Discrete variable:

A

Variable taking the form of a set of
separate numbers, such as 0, 1, 2, 3.

20
Q

Continuous variable

A

Variable that can take an infinite continuum of real number values

21
Q

Random sampling

A

Drawing a sample of n subjects who each
have the same probability of being drawn

22
Q

What are the three main ways data can be biased; sampling error?

A

Sample bias
*Response bias (e.g. under /over-reporting)
*Non-response bias

23
Q

Sampling bias

A

occurs from using nonprobability samples, such as the selection
bias inherent in volunteer samples.

24
Q

Response bias

A

occurs when the subject gives an incorrect response (perhaps
lying), or the question wording or the way the interviewer asks the questions is
confusing or misleading

25
Nonresponse bias
occurs when some sampled subjects cannot be reached or refuse to participate or fail to answer some questions.
26
What is the z score?
It refers to the number of of standard deviations that an observation can fall from the mean.
27
What is Population mean?
the average of the observations for the entire population.
28
What is the population standard of deviation?
The population standard deviation describes the variability of those observations about the population mean
29
sampling distribution
enables us to predict how close a sample mean falls to the population mean
30
Probability
that outcome is the proportion of times that the outcome would occur in a very long sequence of observations.
31
The higher the confidence interval
the larger our confidence interval (margin of error) will be
32
The lower our confidence level
the smaller our confidence interval (margin of error) will be (i.e. the more precise our estimation will be).