Statistik Flashcards
(40 cards)
Vad innebär negative growth rate?
Exempelvis att Working age people decreases because less childen are born, and non working people (60–>) increases because people live longer.
Vilka är the four basic steps in statistics?
- Gathering data
- Understanding data
- Modeling of data
- Conclusions from data
What does population means?
A gathering of all elements with something in common.
Ex: The total number of students in a city
What is sample?
A collection of elements drawn from the population.
A sample is a smaller, manageable version of a larger group
What is sampling frame?
A list of all the elements of population or the source material or device from which the sample is drawn.
It is a list of all those within a population who can be sampled, and may include individuals, households or institutions.
It’s a complete list of everyone or everything you want to study
What is the different between a population and a sample frame?
The population is general and the frame is specific.
For example, the POPULATION could be “People who live in Jacksonville, Florida.”
The SAMPLE FRAME would name ALL of those people, from Adrian Abba to Felicity Zappa.
What does it means that the sample frame is Over coverage?
That the sampling frame contains elements that are not a part of the population
What does it means that the sampling frame is under coverage?
That there are elements in the population that are not included in the sampling frame.
Ex. 100 homeless people, but only 90 registered.
What are the 4 levels of data?
- Nominal data
- Ordinal data
- Interval data
- Ratio data
What defines Nominal data?
Nominal data is the lowest level of data.
It can be classified into categories and they have no natural order.
Ex: gender (male/female), eye color etc.
MODE!
What defines ordinal data?
The second lowest level of data.
It can be classified into categories with natural order and the order is significant.
Measured of non-numeric concepts lie satisfaction, level of happiness etc.
Ex: order from Very satisfied to not at all satisfied.
MODE & MEDIAN!
What defines interval data?
The second highest level of data
The data is numerical but lacks an absolute zero, meaning that when the measure is zero, there is nothing at all.
Interval scale tells us about the order and also about the value between each item.
Ex: Temperature, time. (Kan inte va = 0)
MODE, MEAN & MEDIAN!
What defines ratio data?
The highest level of data.
Has an absolute zero.
Ex: Lenght, weight, temperature in Kelvin scale.
MODE, MEAN & MEDIAN!
Which measurement is best to use when the data has outliers or extreme values?
Median.
Mean kan be effected by outliers/extreme valuer, true or false?
True. And therefore mean is not a good option to use when we have outliers or extreme values.
What does it mean if the modal percentage is close to 100?
That the spread is small.
What defines the upper quartile (Q3) ?
That it has 25% of the observations above it and 75% below.
What is the IQR, Inter quarter range?
The difference between Q3-Q1.
What is an outlier?
An observation that is distance from the other observations.
Outliers can upcome because of variability in the measurement or because of a experimental error (datafel).
They are sometimes excluded from the data set.
If a datapoint is below: Q1 -(1.5xIQR)
OR above: Q3+(1.5xIQR)
What is an extreme value?
When a data point is below: Q1-(3xIQR)
OR above: Q3+(3xIQR)
What does cause and effect mean?
That one variable is the cause of the other ones effect. Ex:
x –> y = outdoor temp bestämmer indoor temp
x y = de påverkar varandra, ex vid income and consumption.
x y. En variabel påverkar två andra.
x, y —> ingen relation mellan dem.
What are the important points in correlation?
It is a numerical measure of relationship between two variables.
The sign of correlation relates to the slope of best fitting line through x and y.
Correlation does not imply causality (orsakssamband)
“r” measures something, what?
What does the value lie in-between ?
That there is no LINEAR relationship between two variables.
-1 < r > 1
What is interpolation?
Interpolation is an estimation of a value within two known values in a sequence of values.