Introduction Flashcards
(25 cards)
What is data analysis used for?
- Answer questions
- Suggest conclusions
- Support decision making
What is the process of Data Analysis?
Converting raw data into useful information via statistical and logical methods
What are independent vs dependent variables?
- Independent variables are model inputs
- Dependent variables are model outputs
How are outputs derived?
With respect to potential relationships with inputs
What tasks fall under a Data Analyst? ( 3 )
- KPI tracking and Performance Benchmarking
- Reporting Automation and Dashboard Creation
- Business, market and industry analysis
What tasks fall under Data Science? ( 3 )
- Machine learning and predictive modeling
- Statistical Analysis
- Algorithm Optimisation
What are the components of the data ecosystem? ( 5 )
- Sources
- Identification
- Transformation
- Analysis and visualization
- Governance and security
List out the steps of Data Lifecycle ( 8 )
- Generation
- Collection
- Processing
- Storage
- Management
- Analysis
- Visualization
- Interpretation
What does Processing in Data Lifecycle ensures?
- Raw data is processed and manipulated to be usable and consitent
Where does Data is being stored?
- Databases
- Data Warehouses
What is does Visualization in Data Lifecycle ensures?
- Insights are presented in graphical or visual formats for easier interpretation
How are Data being Interpreted in Data Lifecycle?
- Results are interpreted to inform decision-making and drive actions
What does Management in Data Lifecycles ensures?
- Data is organized, maintained, and governed to ensure quality and accessibility
What is a hypothesis?
A statement predicting a relationship between two or more variables for scientific testing
What is the null hypothesis?
Statement that elevated AI usage does not affect critical thinking
What is the alternative hypothesis?
Statement that elevated AI usage negatively affects critical thinking
What is the difference between qualitative and quantitative variables?
Qualitative variables describe categories; quantitative variables measure numeric values
What are four variable types?
- Continuous
- Discrete
- Ordinal
- Nominal
What are the three main analytics types? ( 3 )
- Descriptive
- Predictive
- Diagnostic
List out the steps by Analytical Process ( 5 )
- Data Import and Exploration
- Preliminary Data Analysis
- Data Exploration and Visualisation
- Data Wrangling
- Predictive Analytics
What are key preliminary steps in R?
Import data
What is exploratory data analysis (EDA)?
Creative multivariate plots to explore interactions and generate insights from information
What is data wrangling? ( 2 )
Cleaning data and feature engineering
1. Handling unusual or anomalous information
2. Creating new features from existing information
What is the Tidyverse?
A collection of R packages for data science tasks