Applied Economics & Statistics 1: An Introduction to Statistics, Measurement, and Presentation of Data* Flashcards
(37 cards)
Explain why Statistics is important
- Statistics is not just used in economics.
- Required for many business subjects, social sciences, and hard
sciences. - Needed to process and analyse data, and data available increases
constantly. - Google tracks internet usage, supermarkets purchases, police crime
data… - Statistical techniques required to make sense of the data and enable
personal, business, and scientific decision making.
What’s a ‘statistic’?
A statistic is a number used to communicate a piece of information
The inflation rate is 5.2%.
The average mark on a module is 60%.
The price of a new Toyota Supra is £50,545.
What’s the name for ‘a number used to communicate a piece of information’?
A statistic
Define ‘statistics’
The science of collecting, organizing, presenting, analysing, and interpreting data to assist in making more effective decisions.
- How does this year’s rate of inflation compare with last year’s? Is
there a trend of increasing or decreasing inflation? Is there a
relationship between inflation and interest rates?
- How does the module mark vary compared to previous years, and
other modules? Does changing the lecturer teaching the module
affect average marks?
- The price of a new Toyota Supra is £50,545. The Mazda MX-5 is
cheaper, costing £25,825. What are the differences in the cars’ specs,
and how are they related to price? What other information would you
need for a purchase decision
What are the different types of statistics?
Descriptive and Inferential Statistics
What’s ‘Descriptive Statistics’?
- Methods of organizing, summarizing, and presenting data in an
informative way - Organize and summarize data with graphs and tables.
- Statistical measures describe the characteristics of a distribution
What’s the name for ‘methods of organizing, summarizing, and presenting data in an
informative way’?
Descriptive Statistics
Define ‘Inferential Statistics’
The methods used to estimate a property of a population on the basis of a
sample
What’s the name for ‘the methods used to estimate a property of a population on the basis of a
sample’?
Inferential Statistics
Define ‘population’ in statistics
The entire set of individuals / objects of interest or measurements
obtained from all individuals / objects of interest
Define ‘sample’ in statistics
A portion, or part, of the population of interest
Describe & explain the types of statistical variables
- Qualitative variable - The characteristic being studied is non-numeric. E.g.: gender, religion, eye
colour - Quantitative variable - The characteristic is numerical and the numbers have a meaning. E.g.
number of children in a family, hourly wage, minutes remaining in the
lecture.. It can be further divided into:
1. Discrete - These can assume only certain discrete values. There are usually “gaps” in
between the values. E.g. children in a family —this variable can only take
on a discrete set of values, 0, 1, 2, 3 etc.
2. Continuous - These can assume any value within a range and can be measured to any
required degree of precision. E.g.: weights, heights and time. The time it
takes for each student to finish an exam can be measured anywhere along
the real positive line, i.e. from 0 to infinity and could be measured to the
millisecond.
What are the levels of measurement?
Why are they different?
- Data can be further classified into 4 levels of measurement:
1. Nominal data.
2. Ordinal data.
3. Interval data.
4. Ratio data.
Each require different methods for summarizing and presenting, and a
different type of statistical analysis.
Describe & explain ‘nominal data’
- Nominal Data - Data represented as labels
or names. They have no order. They can only be classified and counted.
E.g., hair colour, religion, sexual orientation, gender. - No other mathematical operations permitted.
E.g., even if we assign numerical values like heterosexual=1, gay=2,
bisexual=3, it makes no sense to say 1+2=3, therefore
heterosexual+gay=bisexual. - Labels are mutually exclusive, e.g., can’t be both Christian and Muslim.
- Labels are exhaustive: every individual observation must belong to a
category (even if it’s ‘Other!’)
What’s the name for ‘data recorded at the nominal level of measurement represented as labels
or names; they have no order; they can only be classified and counted’?
Nominal Data
Describe & explain ‘ordinal data’
- Ordinal Data - Data based on a relative
ranking or rating of items based on a defined attribute or qualitative
variable. Variables based on this level of measurement are only ranked or
counted.
E.g., university league table position, educational attainment, level of
satisfaction with your lecturer. - Differences between data values are meaningless: university A being
above B in the leave table only says A is better than B, not by how much. - Can be compared and ranked as labels have relative values.
- Can also
find absolute and relative size of each label
What’s the name for ‘data recorded that’s based on a relative ranking or rating of items based on a defined attribute or qualitative
variable; variables based on this level of measurement are only ranked or
counted’?
Ordinal Data
Describe & explain ‘interval data’
- Data where the interval or the
distance between values is meaningful. The interval level of measurement is based on a scale with a known unit of measurement - Equal differences in the values are represented by equal differences in
the measurements. - Known units of measurement e.g. degrees Celcius, or shoe size. Difference between 10◦C and 15◦C is the same as that between 20◦C
and 25◦C. - Zero is a point on the scale, not the absence of a condition.
- Ratios don’t make sense with interval data: a size 28 dress is not
twice as large as a size 14
What’s the name for ‘data recorded where the interval or them distance between values is meaningful and based on a scale with a known unit of measurement’
Interval data
Describe & explain ‘ratio data’
- Data that’s based on a scale with
a known unit of measurement and a meaningful interpretation of zero on
the scale - Zero means an absence of the characteristic.
- Most quantitative data is recorded at this level.
E.g.: weight, age, number of family members, income, population,
investment, distance travelled, etc.
A pint takes twice as much beer as a half pint.
What’s the name for ‘Data recorded that are based on a scale with a known unit of measurement and a meaningful interpretation of zero on
the scale’?
Ratio data
State the ways data can be presented
- Frequency Distribution Table.
- Histogram, Frequency Polygon and Cumulative Frequency Distribution
- Bar Chart, Line Chart, Pie Chart, Scatter Plot
Describe a frequency distribution and what the point of it is
Include definition
- Suppose we had a set of data and the data are organised in tabular form. The table doesn’t give us much of an idea of how the sales are
distributed. - That’s because it only presents the raw data.
- A table that’s organized in some way would be much more useful.
- A way of achieving this is with a frequency distribution, or frequency
table - Frequency Distribution - Grouping of qualitative data into mutually exclusive and collectively
exhaustive classes showing the number of observations in each class.
What’s the name for ‘grouping of qualitative data into mutually exclusive and collectively
exhaustive classes showing the number of observations in each class’?
Frequency Distribution