lecture 8 - data ethics and intro to big data Flashcards
locked inπ (33 cards)
data privacy definition:
The rights of individuals and organisations to determine access to data about themselves.
what is data privacy protected by:
- privacy act 1989
- ACS privacy policy
Data Governance Definition
Data governance is the overall framework outlining roles, processes and policies for data management.
- the creation, collection, storage, use, protection and disposal of data
Benefits of data governance
- enhanced operational efficiency
- enhanced decision making
- increased competitive advantage ( transparency + public trust )
- Supports regulatory and compliance management
- fosters culture of accountability around data management
data ethics
Data ethics applies those moral principles specifically to the use, sharing, and management of data.
data ethics governs how data is:
- collected
- stored
- analysed
- shared
principles of data ethics:
- privacy
- ownership
- transparency
- intention
- outcomes
professional ethics
- Govern behavior in professional roles.
- Examples: ACM Code of Ethics, BCS Code of Conduct.
consequentialism
Focuses on outcomes; an action is right if it leads to good consequences.
deontology
Emphasises duties and rules regardless of outcomes.
care ethics
Will it create good relationships?
virtue ethics
Centres on character and moral virtues rather than rules or outcomes.
DATA CONTROL LANGUAGE commands for managing access
- grant
- revoke
- deny
grant command
is used to give user
access privileges to a
database
revoke comand
is used to revoke
authorisation / take back permissions
deny
NOT IMPLEMENTED IN ORACLE SQL
explicitly prevents a
user from receiving a
particular permission
common privileges in dcl
- select
- insert
- update
- delete
- all
- execute ( for pl/sql procedural language )
DCL command - Revoke:
Used to revoke authorisation, i.e., take
back permissions from the user
Big Data
Refers to large and diverse collection of unstructured, structured and semi-structured data that continues to grow exponentially over time.
relational model vs big data model
relational scehema: data structure is defines before data is written
big data schema: data structure is applied when data is read / queried
3Vs MUST be present in big data are:
Volume
Velocity
Variety
Volume
The quantity of data to be stored
Velocity
Speed at which data is entering the system
Variety
Variations in structure of the data to be stored.