DSE1101 Flashcards
(104 cards)
What is a variable
characteristics observed in a study.
When does variable become categorical
U
observation belongs to a set of categories.
When does variable become quantitative
observations take on numerical values that represent different magnitudes
What is also called independent variable
Explanatory variable
What is also called dependent vairable
Response variable
What is mean
“average, is one way to measure the center
of a distribution.”
What is sample mean
The sample mean is a sample statistics and serve as a point estimate of the population mean.
What kind of variable does histogram show/
distribution of a continuous variable.
What is modality
associated with the numner of peaks your data have. If have one peak, only talking about a general pattern and data is called unimodal.
What is unimodal?
1 peak
What is 2 peaks
bimodal
What is more than 2 peaks
multimodal data
What is it called when all have same peask
uniform data
Where is the peak on negatively skewed data
“Long tail on left
Peak on right”
Give an example of negatively skewed data
“GPA
Age of death”
What is the peak on positively skewed data
“Longer tail on right
Peak on left”
If question ask wheterh left or right skewed, do we remove outliers first?
Yes
When you find data of some people who spend $1000 in super market, is it an error?
No, take them aside to be analysed separately
Why use median over mean?
More robust to outliers
What is the cons of using median
“MEAN IS EASIER TO COMPUTE THAN MEDIAN, REQUIRE MORE COMPUTING POWER
No need to sort”
If question ask wheterh left or right skewed, do we remove outliers first?
YES
If distribution is skewed or has some extreme values, where is the center
median
If distribution is left skewed, where is median in relation to mean
“mean smaller than median
Median is always closer to the PEAK”
What is variance?
the average squared deviation from the sample mean.