Beyond Relational Databases Flashcards

1
Q

What is a database?

A

A large amount of information stored in a computer system in such a way that it can be easily looked at or changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two types of ‘traditional’ database?

A

OLTP: on-line transaction processing
OLAP: on-line analytical processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of a database?

A

To organise and store information in a structured way, making it easily accessible for retrieval and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are OLTP databases used for?

A

Line of business applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are OLAP databases used for?

A

Data warehousing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Are OLTP and OLAP standard relational databases?

A

OLTP: yes
OLAP: sometimes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Are OLTP and OLAP databases normalised?

A

OLTP: normalised
OLAP: denormalised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Are OLTP and OLAP databases ACID compliant?

A

OLTP: yes
OLAP: not necessarily

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does ACID mean in relation to databases?

A

Atomicity: each transaction is treated as a single, indivisible unit.
Consistency: data must adhere to all defined constraints and rules, ensuring data accuracy and integrity.
Isolation: concurrent transactions do not interfere with each other.
Durability: once a transaction is committed, the changes are permanently stored and survive system failures or crashes.

These properties ensure the reliable processing of transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do OLTP databases operate?

A

They have minimised redundancy as each piece of information is only stored once (normalised), making it quick to work on and update multiple pieces of information simultaneously. This is the general structure of an operating system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do OLAP databases operate?

A

They are optimised for reporting and are oriented around FACTS of interest. These FACTS can be updated quickly, but the DIMENSIONS of the database change slowly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a FACT in regards to databases?

A

Something we can measure (e.g. 1 cGy was delivered to a target) or something that happened (e.g. a patient died).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a DIMENSION in regards to databases?

A

Information about a FACT (e.g. patient information, planning protocol, date of treatment).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the FACTS in radiotherapy databases?

A
  • CT series dose report generated
  • CT slice acquired
  • Plan prepared
  • Treatment history recorded
  • Treatment sessions delivered
  • Images acquired
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the DIMENSIONS in radiotherapy databases?

A
  • Patient
  • Date
  • Time of day
  • Equipment
  • Operator
  • Modality
  • Scan protocol
  • Slice details
  • Site
  • Planning technique
  • Mould room technique
  • Treatment technique
  • Course
  • Patient ID
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe the general structure of a database

A

ETL = extract, transform, load

17
Q

Decribe the strucure of an imaging dose database

18
Q

What is ETL?

A

Extract, transform, load. It is the process used to collect, clean, and prepare data from multiple sources for analysis and storage in a data warehouse or other target system.

19
Q

How often is the ETL process run?

A

Nightly (typically)

20
Q

Describe each stage of the ETL process

A

Extract: data dump is obtained from the source system
Transform: value added, initial data is modelled, reconciliation, (Pseudo/An)onymisation
Load: import data into warehouse database

21
Q

What are the two data types?

A

Structured data (anything standardised)
Unstructured data

22
Q

Give 3 examples of structured data

A
  • Numbers
  • Results of calculations
  • Codified text
23
Q

Give 4 examples of unstructured data

A
  • Free text
  • Hand written/dictated documents
  • Images
  • Tumour/OAR outlines
24
Q

Give 5 examples of data analysis tools

A
  • Straight SQL queries (Excel, Access, Matlab)
  • Business reporting tools
  • Data exploration tools
  • ‘Big Data’ tools
  • Data mining tools
25
What is data virtualisation?
A technology that provides a unified view of data from various sources without physically moving or replicating it, offering real-time access to data as if it were in a single location.
26
Give an example of 'Big Data'
The NHS cancer registry data collection