Chapter 3 - Software architectures and their trade-offs Flashcards

Question

Central DW Arch

Answer 1

Simple, all data is in one central DW

Answer 2

Data is stored in Data marts which contain the parts of data relevant to different e.g departments main DW is logical a.k.a virtual It allows for better performance due to distribution but is more complex to maintain

Answer 3

Central DW is materialized, data is then distributed to data marts that can be further divided into more data marts creating a type of a redundant tree. Offers great performance on reads as it is redundant and well distributed, however it is hard to manage and complex to implement.

Answer 4

Metadata is needed as in OLAP the information how the data was created is relevant Creation of a DW takes long time and resources, Often fails at lack of knowledge, costs, ethical issues etc.

Answer 5

It relates to datasets that are that big, that managing them in a usual way becomes awkward and problematic.

Answer 6

Memory, Storage, Network

Answer 7

Most important - Scalability!!!! What data is important to collect? (in times of AI) - definitely not what we already know Creating system with little overhead

Answer 8

lower / higher capacity e.g 2GB -> 4GB RAM

Answer 9

more /less units (usually on demand) 1 EC2-> 2 EC2

Answer 10

Horizontal is usually less expensive instantly available, can be automated and not limited by hardware capacity. Vertical scaling is more expesive (specialized servers), usually requires additional setup and might be limited by hardware capacity (there is only as much RAM a PC can handle)

Answer 11

!= data pool Used to partition data without it's replication. used in NoSQL based DB like mongoDB etc. Each shard represents single node in a cluster. Requires a central lookup e.g hash table to know where an element is located

Answer 12

Lookup, Range, Hash

Answer 13

Map of all shards

Answer 14

Every Shard responsible for different ranges ordered by a shard key.

Answer 15

Hashfunction points to a shard idea is to counteract potential hotspots as in range

Answer 16

Schema free (non-relational), horizontally scalable

Answer 17

Example of a fast key-value-store Used e.g in real-time analytics, stock-prices and many more used by Twitter, Github, StackOverflow

Answer 18

Bigtable map looks like follows: row key, column key, timestamp Good for additions, but not good for modifications

Answer 19

Json like storage e.g. Mongo db

Answer 20

Documents are independent meaning their structure can be changed as we go Application logic is easy as it transforms the entities from code directly to documents in the database and vice versa (no mapping needed) Semi structured data allows more intercompatibility in case of a migration - no need to know it's information schema

Answer 21

Idea is to create free floating Objects that are interconnected by the meaning. Thanks to the materialization of relationships at the creation level there is no penalty for browsing them at a later stage, allowing constant access time. E.g Neo4j

Answer 22

1. Iterate over the data (large number of records) 2. Extract sth of interest from each 3. Shuffle and sort for intermediate results 4. Aggregate back for a full view 5. Generate the final output => MapReduce

Answer 23

Component wrapped behind a standardized interface e.g REST / SOAP

Answer 24

Can be called across platforms and operating systems regardless of programming language, at the same time allowing for cross use from different applications.

Answer 25

Consist of Envelope that contains Header and body. Body then cointains information in form of a xml file. It is sent over HTTP / SMTP Hint: Used in critical services

Answer 26

Service - oriented Architecture It's design and the scale of it that matters here. e.g Event-based interaction Language independence e

Answer 27

Standardized Contract - default way of accessing all services Abstraction - hide as much as possible Reusability - service should be resources Loose Coupling - little dependencies Stateless - only then stateful when needed Discoverability

Answer 28

When it launches it registers itself with a registry / load balancer and can be utilized from there.

Answer 29

Multi-Languge, Loosely Coupled, Independent of vendor or tech Cons: No service ecosystem, Complex, not easily scalable, less agile thourgh hardcoding and dependencies

Answer 30

architectural STYLE that exposes resources on a networked system not a protocol or specification

Answer 31

A thing that: is unique, more then just ID provides context, is reachable within addressable universe (URL / URN) e.g Website, resume, aircraft, employee, application, printer, song etc

Answer 32

URI ( URL/URN) needs to contain a state within it, or be given it e.g: https://www.google.de/search?q=cloud&ie=utf-8&oe=utf- 8&client=firefox-b-ab&... No client application state should be stored by the server. Important Resource state != Application state

Answer 33

current state of a resource e.g a list of open tickets in XML/JSON.HTML/CSV etc metadata of the resource - cover image, reviews, stock-price etc

Answer 34

Read-only - nothing will change based on the operation E.g GET

Answer 35

Operation will have the same effect no matter number of times executed. E.g PUT, DELETE

Answer 36

“DevOps is a set of practices intended to reduce the time between committing a change to a system and the change being placed into normal production, while ensuring high quality.”

Answer 37

Software suites that can independently deployed with certain common characteristics such as business capability, automated deployment or the decentralized control of data.

Answer 38

Each user request is then satisfied in some order of services Most services are private and not visible on the outside All Services are independently deployable and updateble Services are organized around business capabilities instead of resources etc

Answer 39

Monoliths put all functionality into one big process. Microservices split it by different services. This allows Microservices to be easier scalable as the monolith must be replicated in whole, while microservices can have single services duplicated.

Answer 40

All Services communicate via interfaces (REST, or lightweight "dumb" pipe messaging e.g RabbitMQ / ZeroMQ)

Answer 41

To adress the common problems some patterns have developed. E.g service-per-container, service-discovery, db-per-service or shared db

Answer 42

auditor, broker, carrier, consumer, provider

Answer 43

Manages the use, performance and delivery of cloud services. Also negotiates relationships between cloud providers and consumers.

Answer 44

Conducts assessments of cloud services, IT systems, performance and security

Answer 45

Provides connectivity and transport to and from the cloud

Answer 46

on-demand self-service broad network access, resource pooling rapid elasticity measured service

Answer 47

Computing capabilities are easly provisioned as needed without human interaction with the service provider

Answer 48

Providers resources are serving multiple consumers using a multi tenant model.

Answer 49

Quick scalability in/out is possible even automatically. E.g DNS / load balancing

Answer 50

Automatic control and optimization of resources Consumer ensuring to have a process detecting idle resources

Answer 51

System with predefined scaling condition which trigger allocation of IT resources from the resource pools.

Answer 52

horizontal - e.g more instances vertical - e.g more RAM relocation - e.g move to different device, that e.g has more I/O capacity

Chapter 3 - Software architectures and their trade-offs Flashcards

(76 cards)