Summarising Data Flashcards
(38 cards)
Types of data
Numerical data
Categorical data
Numerical data
Discrete data
Continuous data
Categorical data
Attribute/ dichotomous data
Nominal data
Ordinal data
Descriptive statistics
The methodology for describing or summarising a set of data using tables, diagrams and numerical measures.
Batch data
Are a set of related observation, such as the current inflation rates of EU countries.
Sample data
Are a set of observation selected from a population and designed to be representative of that population.
Discrete data
Can only take one of a set of particular values.
Discrete data arise from counting.
Continuous data
Can take any value within a specified range.
Continuous data arise from measuring.
Attribute/ Dichotomous data
Have only two categories.
Eg yes/no, male/ female
Nominal data
Have several unordered categories.
Type of policy, nature of claim
Ordinal data
Have several ordered categories.
Strongly in favour/ … / Strongly against
Frequency distribution
List data values along with there corresponding frequencies.
Frequency
The number of times something occurs.
Types of frequency distribution
Standard frequency distribution Cumulative frequency distribution Grouped frequency distribution Relative frequency distribution Percentage frequency distribution
Number of classes in a frequency distribution
2^k >= n
K no of classes
N no of observation
Class interval
Each category of the data sample.
Class interval formula
Max value - min value
Width class
Class interval / no of classes
Bar Chart
Is a chart or graph that represent categorical data with rectangular bars with heights proportional to the values that they represent.
A bar graph shows comparisons among discrete categories.
Types of bar chart
Standard bar chart
Grouped bar chart
Stacked bar chart
Grouped bar chart
Is used to compare the same categories within different groups.
Stacked Bar Chart
Highlight the part to whole relationship of categories and compare various groups with this stacked bar graph.
Histogram
Is an accurate representation of the distribution of nu erical data; an estimate of the probability distribution of a continuous variable.
Measures of location are used to
Estimate the Central point of a sample; different ways of calculating the average value for the data set.