Statistics Flashcards

1
Q

Nominal Data

A

Data classified into mutually exclusive categories lacking intrinsic order. I.E. Phone numbers, colors, types of plants, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ordinal Data

A

Ordered categories that imply ranking. I.E. Letter grades, race times, best voted restaurants, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Interval Data

A

Ordered numerical data where the difference between each point is equal from one another. I.E temperature, time, mark grading (1-100).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Ratio Data

A

Numerical data where there is equal distance between adjacent values and it has a true 0. I.E. Temperature in Kelvin, height, age in years.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Variable

A

A quantity that can be assumed to vary or be capable of varying in value. Such as X=2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Quantitative variable

A

A variable in which the actual numerical value is meaningful. Represents an interval or ratio measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Qualitative variable

A

A variable in which the numerical value is not meaningful. Represents an nominal or ordinal measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Population

A

The total of some group. I.E. People on earth, ducks candles in a candelabra, ducks in a pond, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sample

A

A subset of a population. I.E. single ducks in a pond, melted candles in a candelabra, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive statistics

A

Statistics that describe the characteristics of some values in a population or a sample of a population. I.E. A mean, median, or mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Inferential Statistics

A

Statistics that use probability to determine population characteristics. Taking a sample and making inferences about a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Distribution

A

The overall shape of all observed data. How it looks when put into a histogram, density plot, scatter plot, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Range

A

The difference between the largest and the smallest value in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Normal distribution / Gaussian distribution / Bell Curve

A

Distribution is symmetrical - An equal number of observations fall above and below the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Asymmetrical distribution / skewed distribution

A

More observations fall to one side or the other of the mean. They skew right or left when the large outliers are above or below the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Central Tendency

A

A single value that attempts to describe a data set by identifying the the central position within that set of data. I.E. Mean, Median, and mode.

17
Q

Mean

A

The average of a distribution. I.E. (2 + 3 + 4 + 5)/4,

18
Q

Median

A

The middle value of a ranked distribution. If there are two middle values, it would be the average of the two values.

19
Q

Mode

A

The most frequent number in a distribution.

20
Q

Inter-quartile range (IQR)

A

The difference in values between the 75th and 25th percentile in a distribution. the 1/4 and 3/4 cutoff points.

21
Q

Variance (the math kind)

A

a measure of how data points differ from the mean.

22
Q

Hypothesis test

A

A way of testing a hypothesis. You’re basically testing whether your results are valid by figuring out the odds that your results have happened by chance. Disprove a null hypothesis.

23
Q

Null hypothesis

A

the hypothesis that there is no significant difference between specified populations, any observed difference being due to sampling or experimental error.

24
Q

Standard deviation

A

A measure of the amount of variation in a set of values. low = more values closer to the mean. High = more values further from the mean.

25
Q

Test statistic

A

Number calculated from a statistical test of a hypothesis. It shows how closely your observed data match the distribution expected under the null hypothesis.

The test statistic is used to calculate the p-value of your results, helping to decide whether to reject your null hypothesis.

26
Q

Confidence interval

A

The confidence interval is the range of values that you expect your estimate to fall between a certain percentage of the time if you run your experiment again or re-sample the population in the same way.

27
Q

T-test

A

A t-test is a statistical test that is used to compare the means of two groups. It is often used in hypothesis testing to determine whether a process or treatment actually has an effect on the population of interest, or whether two groups are different from one another.

28
Q

ANOVA analysis

A

Anlysis of variance test. used to analyze the difference between the means of more than two groups.

29
Q

Chi square test

A

A chi-square test is a statistical test used to compare observed results with expected results.