Statistics Midterm Flashcards
(41 cards)
Descriptive statistics
Consists of collection, organisation, summarisation and presentation of data.
Inferential statistics
Consists of generalising from samples to population, performing estimations, hypothesis testing, determining relationships among variables, and making predictions.
Population
Consists of all subjects that are being studied.
Sample
Is a group of subjects selected from the population.
Parameter
A measure that describes a characteristic of a population.
Statistic
A measure that describes a characteristic of a sample.
Selection bias
A distortion of evidence or data that arises from the way that the data are collected.
Variable
A characteristic or attribute that can assume different values.
Data
The values that variables can assume.
Qualitative variable
The characteristic being studied is non numeric.
Quantitative variable
Information is reported numerically.
Discrete variable
Can only take on a finite number of values. (ex people)
Continuous variables
Can only take on an infinite number of values. (ex hight/ time)
Nominal level of measurement
Classifies data into mutually exclusive (no overlapping), exhaustive categories in which no order or ranking can be imposed.
Ordinal level of measurement
Classifies data into categories that can be ranked. (precise differences between the ranks do not exist).
Interval level of measurement
Ranks data, and precise differences between units of measurement do exist. However, there is no meaningful zero. (ex temperature)
Ratio level of measurement
Ranks data, and precise differences between units of measurement do exist. A true zero exists.
Frequency table (categorical frequency distribution)
The organisation of qualitative data into table form, using mutually exclusive classes and showing the number of observations in each class.
Relative frequency
Class frequency / total frequency
Pie chart
degrees = class relative frequency * 360
Building frequency tables
You should have: class limit, class boundaries, class midpoint, frequency, cumulative frequency, relative frequency + sometimes cumulative relative frequency.
Rules: between 5 to 20 classes, the classes must be mutually exclusive, the classes must be continuous, the classes must be exhaustive (cover full data range), the classes must be equal width.
Procedure: determine class range, decide number of classes, decide the width, set class limits and boundaries, count the number of occurrences in each class.
Cumulative frequency
The total number of values that are less than a given upper class boundary.
Class midpoint
Lower + upper class limit / 2
Histogram
Display the data by using vertical bars of various heights to represent the frequencies. Vertical line: frequencies Horizontal line: class boundaries