What is the PPDAC process?
A framework for doing data analysis
- Problem
- Plan (What to measure and how)
- Data (collect, manage, process)
- Analysis (reorganise, run analysis, identify patterns)
- Conclusion (Interpret the results)
What is structured data?
What are the two general goals and learning paradigms?
What is supervised learning?
What is unsupervised learning?
What is the objective for interpretative /analytical data?
Understand, summarise, segment & support decisions
What are typical sources of interpretative /analytical data?
What do the variables for interpretative /analytical data usually look like?
What are typical results when using interpretative /analytical data?
What is the objective for Opportunistic / (pure) predictive data?
predict outcomes
What are typical sources of Opportunistic / (pure) predictive data?
What do the variables for Opportunistic / (pure) predictive data usually look like?
What are typical results when using Opportunistic / (pure) predictive data?
What is dimensional reduction?
Combining many starting variables into a final factor
What are the two command types in R-Studio?
What are expressions?
Something that R can evaluate to produce a value
- E.g. 3+5
What are Assignments?
Assignments store the result of an expression in a variable
- E.g. x <- expression
How are expressions separated?
How can you group expressions?
What are the assignment operators? (How is a value assigned?)
Either = or <-
How can we allow multiple expressions treated as one unit?
What is a vector?
What are the basic vector types?
How can you compute the variance of a vector ?