Module 1 Flashcards

Chapters 1,3,4 (50 cards)

1
Q

Define “Sample”

A

Subset of individuals from a population of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define “Estimation”

A

The ability to approximate an unknown quantity of a target population using sample data
-All estimates have a sampling distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define “Parameter”

Why is it subject to error?

A

Quantity describing a population from sample measurements/estimations.

Subject to error due to usage of incomplete data (a sample)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define” Random Sampling”

A

A sampling method that assures that the sample chosen from the population is chosen by giving everyone an equal and independent chance of being chosen.
-Minimizes bias and allows for standard error calculations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define “Sampling error”, in terms of bias and independence.

What is its relationship to precision?

A

A discrepancy that arises due to chance from sampling the population

SE = 1/precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define “Convenience Sampling”, in terms of bias and independence.

A

A sampling method that chooses the sample group from individuals/groups that are easily available.
-Introduces bias.
-The sample being unbiased & independent is not guaranteed.
-

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define “Volunteer Sampling”, in terms of bias and independence.

A

A sampling method that allows for the population of interest to give themselves up for sampling.
-Introduces bias and can’t guarantee independence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are larger samples better?

A

They are more precise and have lower sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define “Bias”

A

Discrepancy that arises due to the improper sampling of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 2 major goals of sampling?

A

To reduce SE and bias & to allow for precision to be measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define “precision”

A

When the variables of the sampled population fall within the same range as one another (Clumped together).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define “accuracy”

A

When the variables of the sampled population fall within/on the range of the true population (On the mark).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define “Census”

A

The sampling of an entire population (rare)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define “variables”

A

Characteristics that differ amongst individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define “Categorical variables”

A

Qualitative measurement that can be sorted into groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define “Numerical variables”

A

Quantitative measurements.

Two types: discrete (integers) and continuous (any real #).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define “Nominal variable “

A

Categorical variables that have no inherent order (ex. colour of fur)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define “Ordinal variable”

A

Categorical variable that has an order, despite no quantification (ex. small, medium, large).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define “Interval variable”

A

A numerical variable that has an order on a numerical scale, with defined differences between points. No true 0. ex. year.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Define “Ratio Variable”

A

A numerical variable with defined ratios. True 0 (physically meaningful). ex. Mass.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Define “Observational study”

A

Nature assigns values, researches only observes activity and points to associations. No control of treatment assignment.

22
Q

Define “Experimental study”

A

Researcher assigns treatment values randomly to individual units of study (reminder: a unit of study can be a group).

23
Q

Define “ Explanatory variable”

A

Independent. Treatment being applied.

24
Q

Define “Response variable”. How is it determined?

A

Dependent on the explanatory variable. Determined by examining associations between variables in test groups.

25
Define "distributions"
The different measurements of different individuals in a sample
26
Define "Frequency distributions"
How often each value occurs in a sample
27
Define " Probability distribution"
How often each value occurs in the whole population
28
Define "Descriptive statistics". How are they described?
Quantities that capture key features of frequency distributions. -Described by location, spread, proportion.
29
Define "location"
Indicates where observations are centred in numerical data . ex. mean, average, mode
30
Define "Spread"
Indicates how dispersed observations are from the centre. ex. variance, SD, interquartile range.
31
Define "mean"
The average of the numerical data
32
Define "Median"
Middle value in a set of data (largest to smallest). | -Sensitive to extreme data
33
Define "mode"
The most frequently occurring observation
34
Define "variance"
Sum of squares of all residuals divided by dof. Remember its s^2. -Produces squared results!!! *Look at equation
35
Define "Standard Deviation"
Measures how far observations deviate from the mean. - Remember its s. - NEVER negative. - Same units as the observation being analyzed. * Look at equation
36
Define "Interquartile range"
Measurement of variability in the middle 50% of the data (1st, 2nd, 3rd). -Good indicator of SD when data is skewed/extreme.
37
What does a normal distribution on a histogram look like?
Bell shaped and centred.
38
Define "residual". What does the sum of all residuals produce?
Difference between an observation and a mean (e). Sum of residuals will always equal 0. *look at calculation formula
39
Define "Degrees of Freedom"
The number of valuables in a calculation that are free to vary (n-1)
40
Define "Skew"
A measure of asymmetry
41
Define "Positive skew"
Right skew, tail faces right (positive side of graph)
42
Define "Negative skew"
Left skew, tail faces left (negative side of graph).
43
Define "Sampling distribution"
The probability distribution of of all values of an estimate that might be obtained when a population is sampled
44
How does sample size affect sampling distribution?
Larger sample sizes produce narrower sampling distributions that fall more accurately on the true population parameters
45
Define "Standard error". What is it used for?
Standard deviation of an estimate's sampling distribution. - Measures precision of a sample estimate, acquired from the population mean. * Look at formula
46
What is the relationship of sample size and standard error?
As sample size increases, standard error decreases.
47
Define "Confidence Interval"
A range of values surrounding the sample estimate that likely contains the true population parameter.
48
What does 95% CI mean?
We are 95% confident that true population mean lies within the upper and lower limits of this interval (NOT that there's a 95% probability)
49
What is the purpose of error bars?
Demonstrate the precision of estimates (typically SE, not always).
50
Define "Coefficient of variation". What does it mean, in terms of variability?
Calculates SD as a percentage of the mean | -Low CV = less variability