Data Storage Flashcards

1
Q

Explain the difference between block storage and file storage.

A
  • Block storage divides storage devices into fixed-size blocks and accesses them directly
  • File storage uses a hierarchical structure of directories and files
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the key use cases for block storage and file storage?

A
  • Block storage: for low-level data manipulation, such as databases, where performance and precise control are critical
  • File storage: for general-purpose file sharing and applications where data is organized in a familiar way
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What factors should be considered when deciding between block and file storage

A
  • depends on the specific needs of the application
  • block storage preferred for performance-sensitive workloads
  • file storage for ease of management and sharing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe the concept of RAID (Redundant Array of Independent Disks)

A

RAID is a technique that combines multiple physical disks into a single logical unit for improved performance and redundancy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different RAID levels

A

RAID 0, RAID 1, RAID 5, RAID 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does each RAID level provide redundancy and performance benefits

A
  • Some RAID configurations read and write across multiple disks in order to boost system performance and maximize throughput
  • Others mirror data sets across multiple disks (builds in data redundency)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you choose RAID level

A

depends on the trade-off between performance, redundancy, and cost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Discuss the pros and cons of SSDs (Solid State Drives) compared to traditional HDDs (Hard Disk Drives) for data storage

A
  • SSDs: faster read/write speeds, lower latency, and shock resistance
  • HDDs: greater capacity at a lower cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When would you recommend using Solid State Drives over Hard Disk Drives?

A

SSDs: performance-intensive applications like databases and virtualization BUT less economical for large-scale storage
HDDs: applications where performance isn’t critical (bulk storage, archive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain the difference between object storage and block storage

A
  • object storage: treats data as objects with unique identifiers, metadata, and data payloads
  • Block storage: divides storage devices into fixed-size blocks and accesses them directly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does object storage facilitate scalability and data management in distributed environments?

A
  • designed for scalability and simplified data management
  • eliminates the complexities of file systems
  • easier to manage large volumes of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are typical use cases for object storage

A
  • well-suited for cloud storage, content delivery networks (CDNs), and backup/archiving
  • web applications, IoT data, and unstructured data repositories
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe the principle of data redundancy

A

Data redundancy involves duplicating data to prevent data loss in case of hardware failures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe the principle of data replication

A

Data replication copies data to multiple locations, ensuring high availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does data redundency and replication enhance data availability and fault tolerance?

A

Ensures data remains accessible even if certain components fail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the trade-offs to consider when implementing data redundency and/or replication?

A
  • increased storage costs
  • potential synchronization challenges between replicas
  • possible data consistency issues in distributed environments