Chapter 5 Flashcards
Data and Knowledge management (39 cards)
Database
a collection of related data files or tables that contain data.
what are the difficulties in managing data?
1) Amount of data is increasing exponentially
2) data is scattered throughout organizations
3) Generated from multiple sources
4) new data from social media sites have to be managed
5) data is subject to data rot
6) data security, quality and integrity are critical
7) Government regulations
Data governance
an approach to managing information across an entire orgnaization
Master data management
process that spans all of an organization’s business processes and applications.
Master data
set of core data, such as customer, product, employee, vendor, geogaraphic, location and so on.
Transactional data
generated and captured by operational systems, describe the business’s activities, or transactions
data file
collection of logicaly related records
What problems do database systems minimize?
Data Redundancy
Data isolation
Data inconsistency
What benefits do database systems maximize?
Data security
Data integrity
Data independence
Database management system (DBMS)
set of programs that provide users with tools to create and manage a database.
Data model
diagram that represents entities in the database and their relationships
Entity
person, place, a thing o ran event such as customer, an employee or a product
Instance
refers to each row in a relational table.
Attributes
Each characteristic or quality of a particular entity for example entities were a customer, an employee, and a product, entity attributes would include customer name, employee number, and product fcolour.
Primary key
the identifier field or attribute
secondary key
another field that has some identifying information but typically does not identify the record with complete accuracy
foreign key
a field or group or group of fields in one table that uniquely identifies a row of another table. A foreign key is used to establish and enforce a link between two tables.
Structured data
collection highly organized in fixed fields in a data repository such as relational database.
Unstructured data collection
Collection refers to data and do not reside in a traditional relational database
what is the purpose of a primary key in a relational database
To uniquely identify a record in the database.
Big data
collection of data that is so large and complex that it is difficult to manage using traditional database management systems.
What does Big Data generally consist of?
Traditional enterprise data
machine-generated/sensor data
social data
images captured by billions of devices
what are the characteristics of Big Data?
Volume
Velocity
Variety
what is the primary feature of NoSQL databases that makes them particularly useful when working with Big Data?
They can process both structured and unstructured data.