Beyond Relational Databases Flashcards

(22 cards)

1
Q

What is an ETL process?

A

Extract -> Transform -> Load

Obtain data dump from the source system

Add value, initial data modelling, reconciliation, (pseud/an)onymisation

Import data into warehouse database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a data warehouse?

A

A data management system consisting of a series of hard drives that store, and enable analysis and reporting of, large amounts of data

OLAP database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of a data warehouse?

A

Accumulation of data from multiple departments or hospitals to be able to analyse and find trends, e.g. PACS, dose management software

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two types of databases?

A
  1. OLTP
  2. OLAP
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is OLTP?

A

On-Line Transaction Processing

Normalised

  • Database used for ‘line of business’ applications
  • Standard relational database
  • Normalised
  • ACID compliant
  • Minimise redundancy
  • Each piece of information only stored once
  • Quick to work on and update multiple pieces of information simultaneoulsy
  • General structure of operating systems
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is OLAP?

A

On-Line Analytical Processing

Denormalised

  • Database used for ‘data warehousing’
  • Not necessarily relational database
  • De-normalised
  • Not necessarily ACID compliant
  • Optimised for reporting
  • Orientated around FACTS of interest
  • Designed so FACTS can be updated quickly
  • DIMENSIONS change only slowly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ACID?

A

Atomicity
Consistency
Isolation
Durability

Updates to databases based around concept of transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are FACTS?

A
  • Something we can measure e.g. 1cGy delivered to target
  • Something that happened e.g. patient died
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are DIMENSIONS?

A
  • Information about the fact
    e.g. patient information: ID, DOB, Sex
    planning protocol, doctor, date of treatment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is granularity?

A

When something should be recorded e.g. in radiotherapy treatment, each episode, each day, each control point, each plan, each beam

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the data types?

A

Structured
Unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a structured data type?

A
  • Numbers, results of calculations
  • Codified text
  • Anything standardised
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an unstructured data type?

A
  • Free text
  • Hand written/dictated documents
  • Images
  • Tumour/OAR outlines
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a database?

A

A large amount of information stored in a computer system in such a way that it can be easily looked at or changed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a flat file database?

A

A straightforward list (single table) containing all the data in one place

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the disadvantages of a flat file database?

A

Increased chance of duplication
Potential for non unique records
Harder to update
Inefficient use of memory
Poor for complex enquiries
Poor at limiting access

17
Q

What is a relational database?

A

Data is organised into multiple tables with the records within the tables linked (related) to records in other tables by way of common fields

18
Q

What is the advantage of a relational database?

A

Data only stored once
Easier modification
Efficient storage
Capable of complex enquiries
More secure - can limit access to certain parts of data

19
Q

What is a database engine? Give an example of a database engine software

A

Underlying piece of software that performs all the reading/creating/updating/deleting of the database

Microsoft access

20
Q

What is a primary key?

A

One field needs to be selected to identify the unique nature of each record in each table - use of an ID number for each entry

Unique identifier acting as a reference when relating any of the record’s fields to another table/query

21
Q

What is a foreign key?

A

A primary key from one table which is also in a related table

22
Q

What is a common field in a database?

A

A field that appears in more than one table in a database