Chapter 11: Large Scale Systems and Overlay Routing Flashcards

Question 1

Q

Large‐scale storage applications: Web indexing (Google), Web archives, Motivation and Goals

Answer

A

Web indexing:

Goal: Index the entire Web
Estimate: Google has 250,000‐node cluster!
- Worldwide & massively distributed
- Organized as datacenters of clusters (racks upon racks)

Web archives:

Goal: Make and archive a daily checkpoint of the Web
Estimates
- Web is about 57 Tbyte, compressed HTML+img
- New data per day: 580 Gbyte
- ~1000 Tbyte per year with 5 replicas (just for new data)
Design
- 10,000 nodes: 100 Gbyte disk each (today: Maybe ~4 TB each)

Question 2

Q

Client server limitations

Answer

A

Scalability is expensive
Presents a single point of failure
Requires administration
Unused resources at the network edge
P2P systems try to address these limitations and leverage (otherwise) unused resources

Question 3

Q

P2P computing

Answer

A

P2P computing is the sharing of computer resources and services by direct exchange between systems.
These resources and services include the exchange of data, processing cycles, cache storage, and disk storage for files.
P2P computing takes advantage of existing computing power, computer storage and networking connectivity, allowing users to leverage their collective power to the ‘benefit’ of all.

Question 4

Q

What is a P2P system?

Answer

A

A distributed system architecture
- No centralized control
- Nodes are symmetric in function
Larger number of unreliable nodes
Enabled by technology improvements

Question 5

Q

P2P architecture

Answer

A

All nodes are both clients and servers Node
- Provide and consume
- Any node can initiate a connection
No centralized data source
- “The ultimate form of democracy on the Internet”
- “The ultimate threat to copyright protection on the Internet”
In practice, hybrid models are popular
- Combination of client‐ server & peer‐to‐peer
- E.g., Skype (early days, now unknown) Spotify

Question 6

Q

P2P benefits

Answer

A

Efficient use of resources
- Unused bandwidth, storage, processing power at the edge of the network
Scalability
- Consumers of resources also donate resources
- Aggregate resources grow naturally with utilization
  - Organic scaling
  - Infrastructure‐less scaling
Caveat: It is not a one size fits all
- Large companies are not switching to p2p
Reliability (in aggregate)
- Replicas
- Redundancy
- Geographic distribution
- No single point of failure
Ease of administration
- Nodes self‐organize
- No need to deploy servers to satisfy demand
- Built‐in fault‐tolerance, replication, and load balancing

Question 7

Q

Popular P2P systems (first generation)

Answer

A

Unstructured p2p systems: Napster, Gnutella, FastTrack, Freenet, eDonkey, BitTorrent
Large‐scalesharingoffiles
- User A makes files (music, video, etc.) on their computer available to others
- User B connects to the network, searches for files and downloads files directly from User A
Issues of copyright infringement

Question 8

Q

Napster: June’1999‐July’2001

Answer

A

A way to share (music) files with others (maybe the first)
Users upload their list of files to Napster server
Users send queries to Napster server for files of interest
- Keyword search (artist, song, album, bit rate, etc.)
Napster server replies with IP address of users with matching files
Querying users connect directly to file providing user for download

Question 9

Q

Gnutella: 2000 – today

Answer

A

Share any type of files (not just music)
Decentralized search, unlike Napster
Ask neighbors for files of interest
Neighbors ask their neighbors, and so on
- TTL field quenches messages after a number of hops
Users with matching files reply to you

Question 10

Q

Freenetproject.org, since 2000

Answer

A

Goals by founder: “Providing freedom of speech with strong anonymity protection.”
Protects anonymity of participants
Platform for censorship‐resistant communication
Decentralized, highly survivable, distributed cache (blogs, pages, files, etc.)
Fully peer‐to‐peer, no dedicated clients or servers
Only enables access to information, previously inserted (it is not a Web proxy)
Every node contributes a configurable amount of storage
Not possible for a node to rate another node (except on insert/retrieve capacity)

Question 11

Q

Freenet Anonymity requirement & implications

Answer

A

Anonymity for information upload & download
Source does not remain on the network after upload
Files are broken into encrypted blocks and are redundantly stored across network
For download, blocks are found and reassembled
Node requesting a datum does not connect directly to node that has datum
Datum routed across intermediaries, none of which know request originator or location
Higher bandwidth use required, slower transfers

Question 12

Q

Freenet Key disadvantage of storage model

Answer

A

No one node is responsible for any block of data
If data is not retrieved for some time, old data might be dropped, if space is exceeded by newly arriving data
Therefore, Freenet tends to‘forget’ data, not retrieved regularly
There is no way to delete data (unless it is “forgotten”)

Question 13

Q

Comparison of file sharing networks

Answer

A

Napster (centralized)
- Bottleneck (scalability, failure, denial of service)
- Correct search results (centralized search)
Gnutella (distributed)
- No central bottleneck, but large cost due to flooding query
- No guarantee on search results
Freenet (distributed)
- Anonymity
- Less efficient data transfer
- No guarantee on search result

Question 14

Q

Structured peer‐to‐peer systems

Answer

A

Second generation peer‐to‐peer overlay networks
Self‐organizing, load balanced, fault‐tolerant
Guarantees on numbers of hops to answer a query
Based on a (distributed) hash table interface
- Put(Key, Data)
- Get(Key)
Systems: Chord, CAN, Pastry, etc.

Question 15

Q

Distributed hash tables (DHT)

Answer

A

Distributed version of a hash table data structure
Store and retrieve (key, value)‐pairs
- Key is like a filename, hash of name, hash of content (since name could change)
- Value is file content

Question 16

Q

A DHT has a simple interface

Answer

A

Put (key, value) and get(key) value
- Simple interface!
API supports a wide range of applications
- DHT imposes neither structure nor meaning on keys
Key‐value pairs are persisted and globally available
- Can store keys in other DHT values
- Thus, build complex data structures

Question 17

Q

A DHT makes a good shared infrastructure

Answer

A

Many applications can share single DHT service
Eases deployment of new applications
Pools resources from many participants
Essentially, a middleware service

Question 18

Q

DHT‐based projects

Answer

A

File sharing [CFS, OceanStore, PAST, Ivy, …]
Web cache [Squirrel, ..]
Archival/Backup store [HiveNet,Mojo,Pastiche]
Censor‐resistant stores [Eternity, FreeNet,..]
DB query and indexing [PIER, …]
Event notification [Scribe]
Naming systems [ChordDNS, Twine, ..]
Communication primitives [I3, …]
Plethora of key‐value stores [BigTable, Dynamo, PNUTS, …]
Common denominator:
• Data is location‐independent • All leverage DHT abstraction

Question 19

Q

CFS: Cooperative file sharing

Answer

A

DHT is a robust block store
Client of DHT implements file system
- Read‐only: CFS, PAST
- Read‐write: OceanStore, Ivy

Question 20

Q

DHT desirable properties

Answer

A

Keys mapped evenly to all nodes in the network
Node arrival & departures only affect a few nodes
Each node maintains information about only a few other nodes
Messages can be routed to a node efficiently

Question 21

Q

Chord identifier circle

Answer

A

Nodes organized in an identifier circle based on node identifiers
Keys assigned to their successor node in the identifier circle
Hash function ensures even distribution of nodes and keys on the circle
Cf. consistent hashing
With N nodes and K keys each node is responsible for roughly K/N keys

Question 22

Q

CHORD Node joins & leaves

Answer

A

Successor (or predecessor) node may disappear from the network (e.g., failure, departure)
Each node records a whole segment of the circle adjacent to it, i.e., nodes preceding and following it
With high probability a node is able to correctly locate its successor or predecessor (even under high node failure)
When a new node joins or leaves the network, responsibility for O(K/N) keys changes hands.

Question 23

Q

Searching in Chord

Answer

A

With success or knowledge of nodes, a linear search over network could locate a particular key (naïve search)
Any given message may potentially have to be relayed through most of the network
Faster search method requires each node to keep a “finger table” containing up to m entries
- i‐th entry of node n contains the address of successor((n + 2i‐1) mod 2m)
- number of nodes that must be contacted to find a successor in an n‐node network is O(log n)

Question 24

Q

Chord key location

Answer

A

Lookup in finger table the furthest node that precedes key
Query homes in on target in O(log n) hops
Each hop at least halves distance to destination

Question 25

Q

CAN: Content addressable network

Answer

A

CAN is designed to be scalable, fault tolerant and self‐organizing
Design is based on virtual multi‐dimensional Cartesian coordinate space to organize overlay
Nodes are layered on a multi‐torus (i.e., coordinates at edges wrap around)
d‐dimensional coordinate space is a virtual logical address space
Nodes map to points in space
Address space is independent of physical location and physical connectivity of nodes
Keys map to points in space
Points in the space are identified with coordinates
A hash function is used for this mapping

Question 26

Q

CAN principles

Answer

A

Entire coordinate space is dynamically partitioned among all nodes in system
Each node owns a distinct zone in the space
Each key hashes to a point in the space

Question 27

Q

CAN routing

Answer

A

Put(key, data), get(key)
Greedily forward message to neighbor closest to destination in Cartesian coordinate space
Nodes maintain a routing table that holds IP address and zone of its neighbours

Question 28

Q

Node joining a CAN

Answer

A

Find a node already in the overlay network
Identify a zone that can be split
- Pick random point
- Route join request to node managing the point’s zone
- Initiate split of zone at that node
Update routing tables of nodes neighbouring the newly split zone

Question 29

Q

DHT routing summary

Answer

A

Chord
- Finger table routing
  - Each hop at least halves distance (in identifier circle) to destination
CAN
- Neighbour routing
  - Forward to neighbor that is closest (in Cartesian coordinate space) to destination
Pastry
- Prefix routing
  - Each hop matches destination identifier by at least one more digit

Question 30

Q

Peer‐to‐peer systems review

Answer

A

Two key functions of P2P systems
- Sharing content
- Finding content
Sharing content
- Direct transfer between peers
- Structured vs. unstructured placement of data
- Automatic replication of data
Finding content
- Centralized (Napster)
- Decentralized (Gnutella)
- Guarantee bounds (DHTs)

Question 31

Q

Brainscape's Knowledge GenomeTM

Chapter 11: Large Scale Systems and Overlay Routing Flashcards

Brainscape's Knowledge Genome^TM