Mass Storage Structure Flashcards

1
Q

how is a disk structured?

A
  • a 1d array of logical blocks (a block is the smallest unit of transfer)
  • data is addressed with logical block number
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is CLV and CAV?

A

Constant linear velocity:
- change speed based on location on disk
Constant angular velocity:
- varying data density so constant speed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what makes a good disk?

A
  • bandwidtch
  • access time
  • reliability
  • power
  • cost
  • capacity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is FCFS scheduling in storage? pros and cons

A
  • first request gets serviced
  • simple, fair (no starvation)
  • may involve excessive head movement because we dont take into account other requests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is SSTF scheduling in storage? pros and cons

A
  • shortest seek time first
  • the next request serviced is the one with the shortest seek time
  • we try to service all request in a local area before moving
  • we know future requests (unlike SJF processing)
  • can cause starvation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is SCAN scheduling in storage? pros and cons

A
  • head continually scans disk from one end to another and back (elavator algorithm)
  • requests are serviced as the head passes that area
  • fair
  • no starvation
  • the delays are non-uniform (requests may arrive just in time or just too late)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is C-SCAN scheduling in storage? pros and cons

A
  • a variant on SCAN
  • return to the start immediately on reaching end rather than using the elavator algorithm, more of a circle
  • better than C-SCAN because theres no favour for bits in the middle
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is LOOK/C-LOOK scheduling? pros and cons

A
  • like C-SCAN but the head only goes as far as the final request in each direction
  • better because there’s no reason to go to areas where there’s no requests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what scheduling do SSDs commonly use? why?

A

FCFS because they don’t have a head so they don’t need to minimise head movement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how do we get deduplication?

A

using cryptographic hashing. compare the hash of two files to see if they’re the same. if theyre they same only store it once

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do SSDs prevent wear?

A

SSDs have microcontroller with their own logical/physical memory. Addr. 1 isnt always the same addr 1. this prevents wear on the disk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how do we prevent unnecessary writing when hashing? (snapshotting)

A

adding anything to a file will drastically change the resulting hash

can use blocking: splitting the file into fixed length blocks and hashing those individually. requires more computational power and doesn’t deal with all cases (adding to the beginning of the file breaks the blocks)

variable length blocks using a rolling hash to determine the size of the length of blocks. this is difficult to parallelise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the RAID structure?

A

Redundant Array of Inexpensive Disks
many disks attached to a computer system.
improves reliability by having more than one copy of the data. Uses mirroring and striping techniques

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is mirroring?

A

a single logical disk consists of two mirrored physical disks, with writes being carried out on both disks. this doubles read performance because we can take from either copy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is data striping?

A

the data is fragmented and each fragment is written to a different disk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is RAID 0?

A

block level striping
allows reading in parallel
if one disk dies then we’ve lost everything

17
Q

what is RAID 1?

A

disk mirroring

uses 2 disks

18
Q

what is RAID 5?

A

parity blocks are used and distributed/weaved into the disks to avoid bottlenecking

19
Q

what is RAID 6?

A

as RAID 5 but with two parity calculations