Lecture 24 ʚɞ ⁺˖ ⸝⸝ Flashcards
(18 cards)
What are the two types of distributions discussed?
Univariate and Bivariate
Univariate describes a single variable, while Bivariate describes the relationship between two variables.
In a Univariate distribution, what is plotted on the y-axis?
Frequency or probability
This represents how often different values of the variable occur.
What are the three characteristics used to describe a Univariate distribution?
Shape, Center, Spread
These characteristics help summarize the distribution of a single variable.
What are the three characteristics used to describe a Bivariate distribution?
Form, Direction, Strength
These characteristics help summarize the relationship between two different variables.
What is the purpose of whiskers in a box and whisker plot?
To capture the data outside of the box
Whiskers extend to a limit defined by 1.5 x IQR.
What is labeled as a ‘suspected outlier’ in a box and whisker plot?
Any observation lying beyond the whiskers
These observations are marked with a dot.
List the four sampling methods mentioned.
- Simple Random Sample
- Cluster Sample
- Stratified Random Sample
- Systematic Sample
These methods are used to gather data in a representative manner.
What is essential for a sampling methodology to be effective?
It should be representative (random and independent)
This ensures that the sample accurately reflects the population.
What are the two probability rules mentioned?
- General Addition Rule
- General Multiplication Rule
These rules help calculate probabilities in different scenarios.
What theorem is used to calculate conditional probabilities?
Bayes’ Theorem
This theorem relates the conditional and marginal probabilities of random events.
What is the focus of HW4 in the context of statistics?
Sample Proportions and T tests
T tests are used to determine if there are significant differences between group means.
What statistical method is introduced in HW7?
Ordinary Least Squares
This method is used in regression analysis to minimize the sum of squared differences.
What is a potential issue in regression analysis mentioned?
Omitted Variable Bias
This occurs when a model leaves out one or more relevant variables.
What type of variables are discussed in the context of regression?
Categorical Variables
These are variables that can take on one of a limited, and usually fixed, number of possible values.
True or False: Correlation implies causation.
False
Correlation does not imply that one event causes the other.
Fill in the blank: The frequent consumption of fried potatoes is associated with an increased _______.
mortality risk
This statement highlights a correlation that warrants further investigation.
What is the significance of the ‘Octopus predicting the world cup winner’ example?
It demonstrates bias in sampling
This example illustrates how predictions can be influenced by non-random factors.
What is the focus of HW8?
Test 2 Makeup and Final Writeup
This indicates a review and summary phase in the coursework.