2.5 Databases and distributed systems Flashcards
(33 cards)
What is normalisation?
A staged process which removes repeated groups of data and inconsistencies.
What is the difference between a primary and a foreign key?
A primary key is unique and used to identify a record whereas a foreign key is a field of one table which is also the primary key of another table (links tables together).
What is the difference between a flat file and a relational database?
A flat file database is one table which holds all data however a relational database is a large, organised collection of data items stored in tables which link to other tables and can be accessed in many ways throughout the program.
What are 5 advantages of relational databases?
Data Consistency
No Data Redundancy
Data Integrity
Data Independence
Better Security
What is Data Consistency?
Data Consistency is the relationship between the input data, the processed data and the output data as well as other related data.
Changes made to data is controlled so that the specific data value is consistent throughout the whole database.
What is Data Redundancy?
Occurs when duplicate data is available in multiple tables. This wastes storage and could potentially compromise data consistency.
What is Data Integrity?
It is the correctness of data e.g. the extent to which it truthfully represents the original information.
What is Data Independence?
Refers to the separation of information from the applications and programs that use it. New systems/programs can still develop/use this data without having to change it or modify the source code.
What are three examples of security in a relational database?
Hierarchy of passwords - Limits users to certain parts of the program.
Access rights to parts of the program - Only certain users can access and change data.
Data is stored separately from the program.
How can products in a supermarket have a One to One entity relationship to a barcode?
Every product in the supermarket will have a unique barcode number.
How can a member of a library have a One to Many entity relationship with the books?
A member of a library can take home multiple books therefore, all the books will have the same, temporary owner.
How can teachers have a Many to Many entity relationship with pupils in a school?
Teachers can teach many pupils across the day however, the pupil will also be taught by many teachers throughout their day.
How can many-to-many relationships be improved?
They can be replaced by a one-many-one relationship.
What is Big Data?
Data which involves very large, complex datasets therefore traditional databases are unable to process them under acceptable time frames.
What are four examples of Big Data?
Transactional data from ecommerce.
Machine data from GPS.
RFID Readers.
Social Media interactions.
What is a Data Warehouse?
A large collection of archived data used for decision making.
Name six advantages of Data Warehousing which will benefit an organisation.
Allows an organisation to:
- Store information about every sale
- See who has bought items and when
- Use data mining
- Find the most popular product
- Target customers with special offers
- Plan future changes/developments within the business
What is Data Mining?
The retrieval and analysis of large sets of data in data warehouses to identify trends and patterns e.g. market opportunities and health trends.
What are 5 advantages of Data Mining which benefit an organisation?
Allows an organisation to create:
- A list of customers who are likely to buy a certain product
- Comparisons with competitors
- Predictions for future sales
- Analysis of best sites for shops
- Analysis of sales patterns
What is Predictive Analysis?
A sub-set of data mining which is used to make predictions about future events, based on individual historic behaviour e.g. weather, insurance risk assessments and targeted marketing.
How is predictive analysis implemented?
Predictive analysis assigns a probability for the likelihood that something, such as a customer, will behave in a certain way.
How could a travel company receive data to undergo predictive analysis?
Comments on social media pages.
Sensors in smart devices such as location services.
Online questionnaires/satisfaction surveys.
What is a distributed database?
Data is not stored in one location, but spread out across a network. The user would not be aware of this, it would still just appear as a large database.
What are 6 advantages of distributed databases?
- Non-dependent on a central store of data.
- Reduces network traffic.
- If one server fails, the entire system does not go down.
- Easy to backup and copy from one server to another.
- If data is lost on the central site, it can be recovered from another local site.
- New locations can be added to a database without the need to rewrite the entire database.