Big Data Flashcards
What is big data
Unstructured data that is so large that it cannot be easily analysed
Characteristics of big data
Volume
Velocity
Variety
What does parallelism mean
As processing big data requires multiple servers multi-processing is used
Why is parallelism made possible
Statelessness meaning the program does not depend on the order functions are called
Immutable data structures, so the function cannot change
Higher-order functions allow functions to be inputted as an argument
What is volume in regards to big data
Big data is too large to be handled
What is velocity in regards to big data
Time it takes to gather a response
What is variety in regards to big data
The fact big data can come in many different forms