Test 3 Traffic Engineering Flashcards

Question

Data Center Topologies - Pods

Answer 1

the Scale problem arises because we have tens of thousands of servers on a flat layer two Topology, where all of the servers have a Topology independent MAC or hardware address and thus, in the default case every switch in the topology has to store affording table entry for every single MAC address Solution: pods - - assign sudo MAC addresses to each server corresponding to the pod in which they're location in the Topology. - -Each server has what's a real MAC address and a pseudo-MAC address - -switches in the Data Centre Topology no longer need to maintain forwarding table entries for every host. - -They only need to maintain entries for reaching other pods in the Topology. - -Once a frame answers a pod, the switch then of course has entries for all of the servers inside that pod but they don't need to maintain entries for the MAC addresses for servers outside of each pod.

Answer 2

hosts are unmodified. So, they are still going to respond to things like ARP queries, with their real MAC addresses Also need a way of Mapping pseudo MAC addresses to real MAC addresses Fabric manager achieves hierachial forwarding - -switch intercepts the query and forwards it to the Fabric Manager - -Fabric Manager then responds with the pseudo-MAC corresponding to that IP address. - -Host A then sends the frame with the destination pseudo-MAC address and switches in the Topology can forward that frame to the appropriate pod corresponding to the pseudo MAC address of the destination server - -Once the frame reaches the destination pod, let's say in this case pod 3, the switch at the top of that pod can then Map the pseudo MAC address back to the real MAC address. - -And the server that receives the frame receives an Ethernet frame with its real destination MAC address, so it knows that the Ethernet frame was intended for it

Answer 3

reduce linkulization reduce the number of hops to each the edge of the data center make the data center network easier to maintain

Answer 4

VL2: creating virtual layer-2 networks that allow application addresses to be separated from network devices -http://blog.moertel.com/posts/2011-03-17-a-quick-summary-of-the-vl2-data-center-network-scheme.html VL2 has two main objectives. - -achieve layer-two semantics across the entire data center topology. This is done with a name-location separation and a resolution service that resembles the fabric manager - -achieve uniform high capacity between the servers and balance load across links in topology, VL2 relies on flow based random traffic interaction using valiant load balancing.

Answer 5

goals of Valient load balancing in the VL2 network: - -spread traffic evenly across the servers - -ensure that traffic load is balanced independently of the destinations of the traffic flows achieved by inserting an indirection level into the switching hierarchy. - -When a switch at the access layer wants to send traffic to a destination, it first selects a switch at the indirection level to send the traffic at random. - -This intermediate switch then forwards the traffic to the ultimate destination Depending on the destination MAC address, of the traffic. - -Subsequent flows might pick different, indirection, points for the traffic, at random

Answer 6

Jellyfish is a technique to network data centers randomly goals - -achieve high throughput to support, for example, big data analytics or agile placement of virtual machines - -incremental expandability, so that network operators can easily add or replace servers and switches For example, large companies like Facebook are adding capacity on a daily basis. Commercial products make it easy to expand or provision servers in response to changing traffic load but not the network. Unfortunately, the structure of the data center networks constrains expansion. Structures such as a hypercube require two to the K switches, where K is the number of servers. Even more efficient topologies, like a FAT tree, are still quadratic in the number of servers.

Answer 7

Pods VL2 Jellyfish

Answer 8

most of the congestion occurs at the top level. Jellyfishes answer to how data structure constrains expansion is to simply have no structure at all.

Answer 9

Jellyfish's topology is what is called a random regular graph. - -It's random because each graph is uniformly selected at random from the set of all regular graphs. - -A regular graph is simply one where each node has the same degree. - -And a graph in Jellyfish is one where the switches in the topology are the nodes. Every node in this graph has a fixed degree of 12. Jellyfish's approach is to construct a random graph at the Top of Rack switch layer. Every Top of Rack switch i, has some total number of Ki ports, of which it uses Ri to connect to other Top of Rack switches. The remaining Ki minus Ri ports are used to connect servers. With n racks, the network then supports n times Ki minus Ri servers. And the network is a random regular graph denoted as follows. Formally, random regular graphs are sampled uniformly from the space of all R regular graphs. Achieving such a property is a complex graph theory problem, but there's a simple procedure that produces a sufficiently uniform random graph that empirically have the desired properties.

Answer 10

pick a random switch pair with free ports for which the switch pair are not already neighbors. Next, join them with a link, repeat this process until no further links can be added. If a switch remains with greater than or equal to two free ports, which might happen during the incremental expansion by adding a new switch, these switches can be incorporated in the topology by removing a uniform random existing link and adding links to that switch. For a particular equipment cost, using identical equipment, the jelly fish topology can achieve increased capacity by supporting twenty five percent more servers. --This higher capacity is achieved because the paths through the topology are shorter than they would be in a Fat tree topology.

Answer 11

Consider a topology with sixteen servers, twenty switches, and a fixed degree of four for both the fat tree topology and the jellyfish random graph. In the fat tree topology, only four of 16 servers are reachable in less than five hops. In contrast, in the jellyfish random graph, there are 12 servers reachable. By making more servers reachable along shorter paths, jellyfish can increase capacity over a conventional Fat tree topology.

Answer 12

First, how close are these random graphs to optimal, in terms of the optimal throughput that could be achieved for a particular set of equipment. Second, what about typologies where switches are heterogeneous with different numbers of ports or link speeds. From a system design perspective, the random topology model could create problems with physically cabling the datacenter network, how to perform routing or congestion control without the structure of a conventional datacenter network like a fat tree.

Answer 13

We need to measure the topology, including not only the connectivity but also the capacity of each link and router. This could be done by routers self-reporting, similar to how they exchange information in a Link State protocol, but in practice is probably more often simply entered as data by a network engineer. We also need to measure the traffic, or offered load. This can be done using the “simple counters” measurement technique that we learned about earlier, since we want to know how much traffic is on each part of the network but don't necessarily need the details of specific flows

Answer 14

The “traditional” way to implement control is by adjusting link weights, which indirectly affects the routes calculated by the routing protocol. In practice, link weights are more often used this way than to represent any “real” property of the network, like bandwidth or link latency. Another way to implement control is by using SDN to directly control the routes that are used on the network.

Answer 15

LOCAL_PREF, the local preference parameter AS_PATH length, as determined by counting the number of ASes in the AS_PATH MULTI_EXIT_DISC, the MED value IGP metric to the NEXT_HOP, i.e., equal “hot potato” routing distance

Answer 16

This changes the flat layer 2 addressing (MAC addresses) into a hierarchical addressing (pseudo-MAC addresses). This means that switches only need to store a forwarding entry for each host in the same pod plus one for each other pod, rather than needing an entry for each host on the entire network. (Notice that hierarchical addressing is the same thing that allows IP to scalable at layer 3, so the idea is to push that concept down into layer 2.)

Answer 17

Network load balancing – prevents bottleneck links and heavily loaded aggregation or core switches • Higher capacity – since the network is balanced, more hosts can reasonably be hosted on a network with the same number of switches • Shorter paths – shorter average number of hops between any two hosts results in faster network performance • Incremental expansion – allows adding switches to the network without reconfiguring the existing network infrastructure or adding additional “higher-level” switches

Answer 18

Does not handle heterogeneous switch devices well • Long cable runs between random switch pairs may be necessary, but are inconvenient and difficult to install

Answer 19

The Fabric Manager is primarily responsible for maintaining network configuration soft state. Using this soft state, the Fabric Manager performs ARP resolution, provides multicast capability to the network, and achieves fault tolerance goals

Answer 20

The Fabric Manager is a user process, running on a dedicated machine. This machine may be located on the network itself, or it can reside on a separate control network.

Answer 21

A PMAC encodes the position of an end host in a fat-tree network. This encoding consists of four components in the format pod.position.port.vmid . pod component -- encodes the pod number the end host and the edge switch reside in, position component -- number encodes the end host’s position in the pod. port component -- encodes the switch’s physical port number the end host is attached to. - -vmid component encodes a unique ID for each virtual machine that is present on the end host. The edge switch maintains a mapping for each VM, which uses its own AMAC (actual MAC) address. This permits multiplexing of virtual hosts resident on a single physical host.

Answer 22

The use of PMACs greatly simplify layer 2 forwarding due to their hierarchical nature. Switches no longer need a forwarding table entry per virtual host. A single forwarding table entry can be used to aggregate hosts, enabling forwarding behavior that exploits longest prefix match. Using AMACs, switch state size is O(n), where n is the number of virtual hosts in the data center, whereas state size is O(k) for PMACs, where k is the number of ports on switches used to construct the fat tree network.

Answer 23

To create a Jellyfish topology, we need to know three values: - -N, the number of racks / switches, - -k, the number of ports per switch, - -r, the number of ports to be used to connect to other switches. Next, an approximation algorithm is used to generate a RRG (Random Regular Graph) using N, k, and r as input. The result is a blueprint for the Jellyfish topology that can be used to physically cable the switches and servers.

Answer 24

To incrementally add a new server rack, it is not necessary to generate a new RRG with N+1, k, and r. At a high level, we can add the new rack by iteratively selecting connections between other ToR switches (not otherwise connected to the new ToR switch) and replacing that connection with two new connections, each to the new switch. This maintains the previous connectivity of the topology, and also consumes two of the r ports on the new ToR switch dedicated to connecting to other ToR switches. This process is repeated until one or zero or the r ports remain. It is important to note that after expansion, the new topology cannot be expected to be uniformly random, as it would be if a new RRG was created and the entire data center re-cabled appropriately

Answer 25

Incremental network expansion: --adding servers and network capacity incrementally to the data center Solutions: --planned overprovisioning of space --power, or by upgrading old servers to a larger number of more powerful but energyefficient new servers --replace a switch with one of larger port count or oversubscribe certain switches, but this makes capacity distribution constrained and uneven across the servers -- leave free ports for future network connections [14, 20] but this wastes investment until actual expansion

Answer 26

Jellyfish, is a degree-bounded random graph topology among top-of-rack (ToR) switches. The inherently sloppy nature of this design has the potential to be significantly more flexible than past designs. Additional components—racks of servers or switches to improve capacity—can be incorporated with a few random edge swaps. The design naturally supports heterogeneity, allowing the addition of newer network elements with higher port-counts as they become available, unlike past proposals which depend on certain regular portcounts. Jellyfish also allows construction of arbitrary-size networks, unlike topologies discussed above which limit the network to very coarse design points dictated by their structure. Jellyfish supports more servers than a fat-tree built using the same network equipment while providing at least as high per-server bandwidth, measured either via bisection bandwidth or in throughput under a random-permutation traffic pattern. In addition, Jellyfish has lower mean path length, and is resilient to failures and miswirings.

Answer 27

routing (schemes depending on a structured topology are not applicable) physical construction cabling layout

Answer 28

lack of scalability, difficult management, inflexible communication, or limited support for virtual machine migration

Answer 29

a scalable, fault tolerant layer 2 routing and forwarding protocol for data center environments. PortLand holds promise for supporting a “plug-and-play” large-scale, data center network. The goal of PortLand is to deliver scalable layer 2 routing, forwarding, and addressing for data center network environments.

Answer 30

R1. Any VM may migrate to any physical machine. Migrating VMs should not have to change their IP addresses as doing so will break pre-existing TCP connections and application-level state. • R2. An administrator should not need to configure any switch before deployment. • R3. Any end host should be able to efficiently communicate with any other end host in the data center along any of the available physical communication paths. • R4. There should be no forwarding loops. • R5. Failures will be common at scale, so failure detection should be rapid and efficient. Existing unicast and multicast sessions should proceed unaffected to the extent allowed by underlying physical connectivity.

Answer 31

PortLand employs a logically centralized fabric manager that maintains soft state about network configuration information such as topology. The fabric manager is a user process running on a dedicated machine responsible for assisting with ARP resolution, fault tolerance, and multicast