STA8170 Flashcards
(115 cards)
Data
systematically recorded values (numbers or labels) together with their context
Categorical/qualitative variable
variable that names categories with words or numbers
Context (info required for?) (x6)
who was measured what was measured how data was collected where data was collected when and why study was done
Rows in a data table hold…
individual cases, eg respondents, participants, subjects, units, records
Columns in a data table hold…
variables that give info about each individual case
Quantitative variable
an amount or degree, measured in meaningful numbers eg scale
Identifiers
variable that assigns unique value to each individual/case - cannot be analysed
Relational database
large data bases that link data tables together by matching identifiers
Ordinal variable
categorical variable with ordering of values
Data table
an arrangement of data in which each row represents a case, and each column a variable
Case
individual about whom/which we have data
Record
info about an individual/case in a database
Sample (x2)
representative subset of population
analysed to estimate/learn about the population
Population
the collection of all individuals or
items or objects of interest
Nominal variable
variable whose values are only names of categories
Units
quantity or amount used as standard of measurement
Parameter (and greek letter)
any numerical characteristic of a population - μ (meuw)
Distribution (x2)
description of all the values a variable can take, and how often those values occur
Three important things pictures can do in data analysis?
reveal things not able to be seen in data tables, helping to think about patterns/relationships
show important features in the data
tell others about the data
Area principle (for graphing data)
the area occupied by a part of the graph should correspond to the magnitude of value it represents
Frequency table (x3)
organises the cases according to their variable
rows are category names
also records totals
describes the distribution of a categorical variable
Relative frequency table (x2)
displays percentages, rather than counts, of values in each category
describes the distribution of a categorical variable
Bar chart (x3)
Display distribution of a categorical variable
Categories on the x, counts on the 7
spaces between the bars indicate that freestanding bars can be placed in any order
Relative frequency bar chart
shows the percentage/proportion of values (y) falling under each category (x)