Introductory Concepts Flashcards
(9 cards)
What are the so-called ‘four Vs of data’?
Volume, variety, velocity and veracity
One of the four Vs of data, volume, means…
The amount of data
One of the four Vs of data, variety, means…
The forms of data
One of the four Vs of data, velocity, means…
The speed at which data is generated and processed
One of the four Vs of data, veracity, means…
The uncertainty of data
What is unstructured data?
Data that is hard to read using autonomous methods, having no structure and being in a non-traditional format - i.e. an email
What is semi-structured data?
Data that is not in a traditional format, but has some structure to it that may make mining easier - i.e. a website’s database
What is structured data?
Data that adheres to a data model, conforming to a tabular format with a relationship between the different rows and columns
Why do we hope for data to be structured when dealing with data?
It makes it easier to contextualise and understand the data