Chapter 5 Flashcards
(14 cards)
Big data
Refers to datasets that are too large and complex for businesses´existing systems to handle and utilise their traditional capabilities to capture, store, manage and analyse these datasets
Characteristics of Big Data
- volume
- velocity (how fast data is generated)
- variety
- veracity (how accurate is the data)
Data analytics
is the process of evaluating data with the purpose of drawing conclusions, making predictions and driving informed decision-making to address business problems.
An analystics mindset is…
… a way of thinking that centres on the correct use of data and analysis for decisions-making. Includes the ability to:
1. ask the right questions
2. extract, transform and load relevant data
3. apply appropriate data analytic techniques
4. interpret and share the results with stakeholders
Analytic mindset - (1) Ask the right question, SMART objectives
A good analytic question helps establish “SMART” objectives:
S - specific
M - measureable
A - achievable
R - relevant
T - timely
Analytic mindset - (2) Extract, transform and load relevant data (ETL)
The ETL process is often the most time-consuming part
- extracting data
- transforming data
- loading data
ETL-process - Extracting data
Three steps:
- understand data needs and the data available
- perform the data extraction
- verify the data extraction quality and document what you have done
Data lake
a collection of structured (accounting data), semi-structured and unstructured data stored in a single location
ETL-process - Transforming data
Four steps:
1. Understand the data and the desired outcome
2. standardise, structure and clean the data
3. validate data quality and verify the data meets data requirements
4. document the transformation process
ETL-process - Loading data, important considerations
- the transformed data must be stored in a format and structure acceptable to the receiving software
- programs used for analysis may treat some data formats differently than expected. it is important to understnad how the new program will interpret data formats
- once loaded, it is important to update or create a new data dictionary
Analytic mindset - (3) Apply appropriate data analytic techniques, 4 categories
- Descriptive analytic - what happened?
- Diagnostic analytic - why did this happen?
- Predicitve analytic - what might happen in the future?
- prescriptive analytic - what should be done?
Analytic mindset - (4) Interpreting results
- common incorrect interprention is: correlation and causation
- or misinterpretation because of psychology
Data storytelling
the process of often translating complex data analyses into easier to understand terms to enable better decision-making
Data visualisation
the use of graphical representation of data to convey meaning