Monitoring Flashcards
Differences between the CloudWatch applications? (Logs, Alarms, Events)
Logs: cotains log files, can set alarm and react to changes in AWS resources - can be viewed in real time
Alarms: Monitors a single metric and perfoms actions of it
Events: near real time stream of system events
What is free of charge in monitoring?
3 Dashboards Basic monitoring for EC2 instances Metrics for EBS, ELB and RDS 50 metrics, 10 alarms 1 mio. API requests 5min Refresh Rate
Is CloudWatch inter-regional?
Metrics are only available for one region
Ways to create custom metrics
PutMetricData API
monitoring scripts for Win. / Linux
Applications from the AWS Partner Network
How is the retention for the metrics (outside of Cloudwatch Logs)
1 min for 15 days
5 min for 63 days
1 hr for 455 days
Metrics cannot be deleted - they expire after 15month if no data is incoming
What are the reasons for a system status checks to fail?
Loss of network connectivity
Loss of system power
Software issues of host system
Hardware issues of the host
What are the reasons for instance status failures?
Network configuration issues Incorrect configurations of the OS Exhausted memory Corrupt file system Kernel issues
What is the purpose of CloudWatch Alarms?
Initiate automatic action in response to a predefined condition of a single metric
What are the status of CloudWatch Alarms?
OK
ALARM
INSUFFICIENT_DATA
When are events created?
Change in AWS resource status (for example EC2 instance stopping)
Events sent by CloudTrail, user login e.g.
Issued by a scheduled (cron) basis
What is the Cost Explorer?
Review of the costs of the last 13 month and a forecast of the next 3
It also generates detailed CSV reports
How are inter-regional Trails managed?
All trails are applied to all regions by default, but it is not one trail but a copy to all regions
What can be done with AWS Config?
Create snapshot of current environments
Historical configurations
Notifications then resources change
See relations between resources
What is the maximum number of months of history that AWS Cost Explorer displays?
13 months
What are the basic metrics of EC2?
CPUCreditBalace
CPUUtilization
Network In / Network Out
What are the basic metrics of EBS?
Volume IdleTime
V. ReadBytes / ReadOps
V WriteByte / WriteOps
What are the basic metrics of an ALB?
Active Connections Rejected Connections (Un)HealthyHostCounts Status Codes RequestCounts
What are the basic metrics of RDS?
CPUUtilization
ReadIOPS / WriteIOPS
Free Space
Active DB Connections
What is the definition of a custom metric?
- Anything generated inside the OS
- Coming from outside the resources - as long as it has access to the CloudWatch Service API
- Application Metrics like RAM or Disk Space
What is needed to push custom metrics?
CloudWatch Agent or scripts/SDK
Appropriate IAM access
What action should be taken if an EC2 instance (behind a ASG) is in an alarm state?
Terminate the instance - the auto scaling group will boot another one
What are the EBS Volume Status Checks? (non prov. IOPS)
oka
warning
impared
insufficient-data
What are (some) RDS status checks?
Available Backing Up Creating / Deleting Failed Mainenance Rebooting
Which arichtectural choice improve network performance?
Single AZ
Placement Groups
Enhances Networking
Keeping Traffic in the VPC