Math IB Stats Flashcards

1
Q

Discrete data

A

Fixed to certain values; no gaps between data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Continuous data

A

Not fixed to certain values; can occupy a continuous range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Reliable data

A

If you can repeat the data and obtain similar results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sufficient data

A

When there is enough data to support your conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Population

A

The entire group that you want to draw conclusions about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sample

A

Subset of population; group of individuals from the population that will give info about the population as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sampling technique: convenience

A

Most easily accessible members of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sampling technique: simple random

A

Randomly choose members - equal chance for everybody

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sampling technique: systematic

A

Pick at a fixed interval — eg every 6th person

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sampling technique: stratified

A

Divide group into groups (stratas) based on shared characteristics, then sample from the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sampling technique: quota

A

Stratified sampling, but sample from each stratum is proportional and to the size of each stratum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bar chart

A

For discrete data; has gaps in between the bars

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram

A

No gaps in between bars; for continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skew

A

Where the majority of the data is located (shape)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Histogram skew

A

Left (-): most data is on right side - left tail
Normal: equally distributed
Right (+): most data is on left side - left tail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mode

A

Value that occurs the most

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Modal

A

For grouped data — can’t find mode so we would say the modal range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Bimodal; no mode

A

Bimodal - 2 modes in set of data
No mode - all numbers appear only once

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Mean

A

The average

20
Q

Median

A

The middle data value when data set is arranged in order of size (if even data set — median is avg of two middle numbers)

21
Q

Range

22
Q

Quartiles

A

Divides data into quarters
- 1st: 25% of data below it
- 2nd: the median and has 50% of data below and above
-3rd: 75% below

23
Q

Interquartile range (IQR)

A

Difference between Q3 and Q1

24
Q

Lower quartile

25
Middle quartile
Q2
26
Upper quartile
Q3
27
Outlier for boxplot
Outliers are 1.5xIQR above Q3 or below Q1
28
Cumulative frequency
The sum of all previous frequencies up to the current point
29
Percentile
A value below which a certain percentage of observations lie
30
Percentile rank
Calculate by dividing #of values below ___ by total # of values
31
Variance
How far a data point is spread from the mean (sigma squared)
32
Standard deviation
Square root of variance
33
Bivariate data
Study of relationships between to sets of data
34
Correlation
When change in x corresponds to change in y
35
Causation
When one event is the result of the a occurrence of another event
36
Pearson product moment correlation coefficient (r)
Measure of the correlation strength between two variables. Between -1 and 1 ( can equal)
37
R value
0 is weakest. 1 is strongest. (-) values mean there is a negative correlation.
38
Line of best fit
Straight line drawn though the center of a group of points plotted on a scatter diagram
39
Interpolation
Predictions inside the domain your data points are in
40
Extrapolation
Predictions outside the domain of your data
41
Draw line of best fit
Find mean point which line will go through. Equal number of points above and below line.
42
Residual
The vertical distance between data pints and a graph of a regression line
43
Least square regression line
Has the smallest possible value for the sum of squares of the residual
44
Regression line y on x
Y=ax+b A = change on y for each change in x B = y int
45
Binomial distribution elements:
- fixed number of trials - only two outcomes, success or failure - constant probability each trial - trials are independent
46
Probability - with replacement
Elements in sample space remains unchanged (e.g if you pull a card out of a deck, you put it back)
47
Probability - without replacement
Items are not returned to the sample space (e.g if you pull a card out from a deck, you leave it out, changing probabilities for next time)