Big Data Analytics (I) Flashcards
(12 cards)
Multiple of bytes of Big Data
Exabyte (1000^6): Enough rice to fill the West Coast states
Zettabyte (1000^7): Enough rice to fill the pacific ocean
Challenges of Data Explosion
- processing capabilities
- storage
- networking and architecture
What is Big Data?
The data is characterised by 3 attributes: Volume, Variety, Velocity
Characteristics of Big Data 1
Volume: Reflects the size of the data
-> Leveraging large data sets to enable informed decision-making effectively
Characteristics of Big Data 2
Variety: represent the diversity of the data
Structured data + Semi-structured data + Unstructured data (80%)
Characteristics of Big Data 3
Velocity: The speed at which data is generated and used
New age communication channels such as mobile phones, emails, social networking has increased the rate of information flows
NoSQL
-Non-relational database for massive unstructured data
-Scale well horizontally
Hadoop
Open-source data storage and processing platform
What is Hadoop good for?
- Large data sets & cheap scaling
- Fast parallel data processing
- Data from multiple sources/formats
Advantages of Hadoop
- Flexibility
- Scalability
- Cost effectiveness
- Fault tolerance
Data Visualisation
A quick, easy way to convey concepts in a universal manner -> enable decision makers to see analytics presented visually, so they ca grasp difficult concepts or identify new patterns
Big Data and Business
- Accessibility to Data
- Decision Making
- Marketing Trends
- Performance Improvement
- New business models/services