Amazon EC2 for SysOps Flashcards

Question

AWS charges for IPv6 addresses

Answer 1

* What about IPv6? * Unfortunately, many Internet Service Provider (ISP) around the world don’t support IPv6, so the course would not work for some of you * You can test IPv6 by going to https://test-ipv6.com/ * If you use IPv6 in this course, you’re on your own (security groups, networking…) but you can do it! * How to troubleshoot charges? * Go into your AWS Bill * Look into the AWS Public IP Insights service * Nice article here: https://repost.aws/articles/ARknH_OR0cTvqoTfJrVGaB8A/why-am-i-seeing-charges-for-public-ipv4-addresses-when-i-am-under-the-aws-free-tier

Answer 2

* Can get a discount of up to 90% compared to On-demand * Define max spot price and get the instance while current spot price < max * The hourly spot price varies based on offer and capacity * If the current spot price > your max price you can choose to stop or terminate your instance with a 2 minutes grace period. * Other strategy: Spot Block * “block” spot instance during a specified time frame (1 to 6 hours) without interruptions * In rare situations, the instance may be reclaimed * Used for batch jobs, data analysis, or workloads that are resilient to failures. * Not great for critical jobs or databases

Answer 3

see attachment Note : You can only cancel Spot Instance requests that are open, active, or disabled. Cancelling a Spot Request does not terminate instances You must first cancel a Spot Request, and then terminate the associated Spot Instances

Answer 4

* Spot Fleets = set of Spot Instances + (optional) On-Demand Instances * The Spot Fleet will try to meet the target capacity with price constraints- * Define possible launch pools: instance type (m5.large), OS, Availability Zone * Can have multiple launch pools, so that the fleet can choose * Spot Fleet stops launching instances when reaching capacity or max cost * Strategies to allocate Spot Instances: * lowestPrice: from the pool with the lowest price (cost optimization, short workload) * diversified: distributed across all pools (great for availability, long workloads) * capacityOptimized: pool with the optimal capacity for the number of instances * priceCapacityOptimized (recommended): pools with highest capacity available, then select the pool with the lowest price (best choice for most workloads) * Spot Fleets allow us to automatically request Spot Instances with the lowest price

Answer 5

* AWS has the concept of burstable instances (T2/T3 machines) * Burst means that overall, the instance has OK CPU performance. * When the machine needs to process something unexpected (a spike in load for example), it can burst, and CPU can be VERY good. * If the machine bursts, it utilizes “burst credits” * If all the credits are gone, the CPU becomes BAD * If the machine stops bursting, credits are accumulated over time

Answer 6

* Burstable instances can be amazing to handle unexpected traffic and getting the insurance that it will be handled correctly * If your instance consistently runs low on credit, you need to move to a different kind of non-burstable instance

Answer 7

* Experiment: run a CPU stress command (to peak at 100%) * After the credits are exhausted, the measured CPU utilization drops

Answer 8

* It is possible to have an “unlimited burst credit balance” * You pay extra money if you go over your credit balance, but you don’t lose in performance * If average CPU usage over a 24-hour period exceeds the baseline, the instance is billed for additional usage per vCPU/hour * Be careful, costs could go high if you’re not monitoring the CPU health of your instances

Answer 9

* When you stop and then start an EC2 instance, it changes its public IP * If you need to have a fixed public IP, you need an Elastic IP * An Elastic IP is a public IPv4 you own as long as you don’t delete it * You can attach it to one instance at a time * You can remap it across instances * You don’t pay for the Elastic IP if it’s attached to a server * You pay for the Elastic IP if it’s not attached to a server * With an Elastic IP address, you can mask the failure of an instance or software by rapidly remapping the address to another instance in your account. * You can only have 5 Elastic IP in your account (you can ask AWS to increase that).

Answer 10

* Always think if other alternatives are available to you * You could use a random public IP and register a DNS name to it * Or use a Load Balancer with a static hostname

Answer 11

AWS Provided metrics (AWS pushes them): * Basic Monitoring (default): metrics are collected at a 5 minute internal * Detailed Monitoring (paid): metrics are collected at a 1 minute interval * Includes CPU, Network, Disk and Status Check Metrics Custom metric (yours to push): * Basic Resolution: 1 minute resolution * High Resolution: all the way to 1 second resolution * Include RAM, application level metrics * Make sure the IAM permissions on the EC2 instance role are correct !

Answer 12

* CPU: CPU Utilization + Credit Usage / Balance * Network: Network In / Out * Status Check: * Instance status = check the EC2 VM * System status = check the underlying hardware * Attached EBS status = check attached EBS volumes Note - * Disk: Read / Write for Ops / Bytes (only for instance store) * RAM is NOT included in the AWS EC2 metrics

Answer 13

* For virtual servers (EC2 instances, on-premises servers,…) * Collect additional system-level metrics such as RAM, processes, used disk space, etc. * Collect logs to send to CloudWatch Logs - No logs from inside your EC2 instance will be sent to cloud watch logs without using an Agent. * Centralized configuration using SSM Parameter Store * Make sure IAM permissions are correct * Default namespace for metrics collected by Unified CloudWatch Agent is CWAgent (Can be configured and changed)

Answer 14

* Collect metrics and monitor system utilization of individual processes * Supports both Linux and Windows servers * Example: amount of time the process uses CPU, amount of memory the process uses, … * Select which processes to monitor by: * pid_file: name of process identification number (PID) files they create * exe: process name that match string you specify (RegEx) * pattern: command lines used to start the processes (RegEx) * Metrics collected by procstat plugin begins with “procstat” prefix (e.g., procstat_cpu_time, procstat_cpu_usage, …)

Answer 15

* Automated checks to identify hardware and software issues

Answer 16

Monitors Problems with AWS Systems (Software/hardware issues on the physical host, loss of power,....) Check Personal Health Dashboard for any schedule. critical maintenance by AWS to your instance's host Resolution: stop and start the instance (instance migrated to a new host)

Answer 17

Monitors software/network configuration of your instance (invalid network configuration, exhausted memory,...)

Answer 18

Monitors EBS Volumes attached to your instance (reachable & complete I/O operations) Resolution: Reboot the instance or replace affected EBS Volumes.

Answer 19

CloudWatch Metrics (1 minute interval) * StatusCheckFailed_System * StatusCheckFailed_Instance EC2 Instance * StatusCheckFailed_AttachedEBS * StatusCheckFailed (for any) Option 1: CloudWatch Alarm Recover EC2 instance with the same private/public IP, EIP, metadata, and Placement Group * Send notifications using SNS * Option 2: Auto Scaling Group * Set min/max/desired 1 to recover an instance but won't keep the same private and elastic IP.

Answer 20

* We know we can stop, terminate instances * Stop – s the data on disk (EBS) is kept intact in the next start * Terminate – any EBS volumes (root) also set-up to be destroyed is lost * On start, the following happens: * First start: the OS boots & the EC2 User Data script is run * Following starts: the OS boots up * Then your application starts, caches get warmed up, and that can take time!

Answer 21

Introducing EC2 Hibernate: * The in-memory (RAM) state is preserved * The instance boot is much faster! (the OS is not stopped / restarted) * Under the hood: the RAM state is written to a file in the root EBS volume * The root EBS volume must be encrypted Use cases: * Long-running processing * Saving the RAM state * Services that take time to initialize

Answer 22

* Supported Instance Families – C3, C4, C5, I3, M3, M4, R3, R4, T2, T3, … * Instance RAM Size – must be less than 150 GB. * Instance Size – not supported for bare metal instances. * AMI – Amazon Linux 2, Linux AMI, Ubuntu, RHEL, CentOS & Windows… * Root Volume – must be EBS, encrypted, not instance store, and large * Available for On-Demand, Reserved and Spot Instances * An instance can NOT be hibernated more than 60 days

Amazon EC2 for SysOps Flashcards

(47 cards)