Week 1 Flashcards
(17 cards)
Types of Data Analytics
- Descriptive - Inside into the past
- Predictive - Look into the future
- Pescrptive - Data-driven
What is data vs. information?
Data - Raw numbers, facts, etc…
Information - Structured, meaningful, and useful numbers and facts.
What is categorical data?
Data that has no intrinsic numerical value.
1. Nominal - Two or more outcomes that have no natural order (i.e: movie genres and hair color)
2. Ordinal - Two or more outcomes that have natural order (i.e: movie ratings and education levels)
What is numerical data (quantitative)?
Data that has an intrinsic numerical value.
1. Continuous - Data that can attain any value on a given measurement scale.
- Interval data: Equal intervals represent equal differences, there is no fixed “zero point” (i.e: temperature in Celsius, clock time, birth year)
- Ration data: Both differences and ratios make sense; there is a fixed “zero point” (i.e: movie budget, temperature in Kelvin, distance, time duration)
2. Discrete: – Data that can only attain certain values (typically integers) (i.e: the number of days with sunshine in a certain year, the number of traffic incidents)
What is logarithmic scale data?
Logarithmic scale data is numerical, but it is neither ratio nor truly interval!
Reference table
Used to store all data in a table so that it can be looked up easily
Demonstration table
Used to illustrate a point (with just enough data, or with a
specific summary)
What are the 3 types of summary statistics?
There are different types of summary statistics
- Level: Location summary statistics → What are “typical” values
- Spread: Scale summary statistics → How much do values vary?
- Relation: Association summary statistics → How do values of different quantities vary simultaneously?
What is the mean?
The average
What is the median?
The value separating the higher half from the lower half of a data set
- Median = 50th percentile = 2nd quartile
What is the mode?
The most frequently occurring value may be non-unique
What is the range?
Max value - Min value
What is the interquartile range?
Q3 – Q1 (3 rd quartile - 1st quartile)
What is sample variance?
What is sample standard deviation?
What is median absolute deviation (MAD)?
The median of the absolute deviation from the median