Module 1 Flashcards
Statistics
Field of study involved the science of collecting, describing, and analyzing data
Population
Consists fo all individuals or objects of interest; often, the group being studied
Sample
A group selected from the population or a subset of the population; provides information used to infer information about populations
Why are samples used
Because it difficult getting access to an entire population
Question posed to entire STAT250 section:
What is your favorite streaming service?
What is the population?
What is the sample?
Can the sample data be used to make inferences about the population?
Population: All Mason students
Sample: STAT250 students in this section
Can this sample be used to make inferences about the population? Yes & No; Are all students taking a math/logic course as their mason core?
What happened when Truman was predicted to lose?
The polls were taken by people (generally wealthier and of a different voting party practice than the entire population) who did not serve as an efficient sample size of the population so the pre-mature prediction was inaccurate.
Random Sample
Names in a hat method;
When choosing a simple random sample of n units, all groups of the size n in the population have the same chance of becoming the sample
Best ways to do random sampling?
Small group: names in a hat
Large group: technology (random.org, rguroo, statkey, excel)
What things might prohibit sampling?
Inability to contact population; cost
Non-Random Sample
Targeting a group which may or may not be a conflict on obtaining data
Is it possible to successfully gather a random sample without tech?
Nope.
Explain Question Wording bias
- Question wording, or confusing or non-neutral language/leading questions;
- inaccurate responses (response bias)
- self-reporting data (usually have strong opinions one way or another)
What types of variables are considering during data collection?
Explanatory Variable & Response Variable
Explanatory Variable
The variable we think is “explaining” the change in the response variable (usually the variable scientists are manipulating)
Response Variable
The variable we think is beign impacted or changed by the explanatory variable; it “responds” to changes in the explanatory variable
What is an association relationship between variables?
Two variables are associated if values of one variable tend to be related to values of the other variable
What is a causation relationship between two variables?
Two variables are causally related (or associated) if changing the value of the explanatory variable influences the value of the response variable
Which of the following statements imply a causal relationship? Also,
think about which are the explanatory and response variables.
a) “Daily Exercise Improves Mental Performance.”
b) “Want to lose weight? Eat more fiber!”
c) “Sales stay the same no matter what medium of advertising is used.”
d) “Goldfish who live in large ponds tend to be larger than goldfish who live in
small ponds.”
a & b have causal relationships
Is the following statement a causal or association relationship?
“goldfish who live in large ponds tend to belarger than goldfish who live in small ponds”
Association: Words like “tend to” or “usually” do not imply causation
What words do not imply causation?
“tend to” or “usually”
What type of variable relationship does the following statement imply?
“Sales stay the same no matter what medium of advertising is used”
Neither causation nor association
Identify the explanatory and response variables:
a) “Daily Exercise Improves Mental Performance.”
b) “Want to lose weight? Eat more fiber!”
c) “Sales stay the same no matter what medium of advertising is used.”
d) “Goldfish who live in large ponds tend to be larger than goldfish who live in
small ponds.”
Explanatory variables; Response variables
a) daily exercise; mental performance
b) fiber consumption; weight maintenance
c) advertising; sales
d) pond size; goldfish size
T/F An associated between an explanatory and response variable, especially when very strong, always guarantees that the two variables are causally related
False; some variables are associated even when they have no cause and effect relationship
Confounding Variable
- A third variable that is associated with both the explanatory and response variable;
- Also called a confounding factor or lurking variable