Module 1 (Ch. 1-3) Flashcards
(45 cards)
Population
All of the entities of interest in a study (ppl, households, machines, etc.)
Sample
A subset of the population, often randomly chosen and representative of the pop as a whole
Data Set
A rectangular array of data
Variable
Aka Field or Attribute A characteristic of members of a population (height, gender, salary) ROW (left to right)
Observation
Aka Case or Record A list of all variable values for a single member of a population COLUMN (up and down)
Numerical Variable
A variable where meaningful arithmetic can be performed on (age, children, salary)
Categorical Variable
A variable where NO meaningful arithmetic can be performed. (gender or state) Can either be ORDINAL or NOMINAL. Can be coded numerically or left uncoded. Opinion Variables - “strongly disagree”
Date Variable
Treated differently from typical numbers
Ordinal (Categorical Variable)
There is a natural ordering of its possible values
Nominal (Categorical Variable)
No natural ordering of its possible values
Dummy Variable
0 - 1 coded variable for a specific category (1 for all in the category, 0 for all not in the category)
Binned or Discretized Numerical Variable
Categorizing a NUMERICAL variable by putting data into discrete categories (called BINS)
Discrete Numerical Variable
If it results from a count (number of children)
Continuous Variable
Essentially continuous measurement (weight or height)
Cross-Sectional Data
Data on a cross sections of a population at a distinct point in time
Time Series
Data collected over time.
Count of Categories
Count the number of observations (columns) in each category
Mean
The average of all values. In Excel…Average Function
Sample Mean
A sample from some larger pop. Denoted by a X with a line above.
Population Mean
Represents the entire population. Denoted by a “U”
Median
Middle observation when data is sorted from smallest to largest. In Excel…Median Function
Mode
The value that appears most often. In Excel….Mode Function
Range
A measure of Variability (flexibility).
Maximum value minus minimum value.
Very sensitive to extreme values
Interquartile Range (IQR)
3rd quartile minus 1st quartile.
Less sensitive to extreme values
