Auto Scaling

Auto Scaling is a service from AWS that aids in adjusting the infrastructure resources and capacity based on the demands and usage to provide a steady performance of the applications to the end user. 

Using the Auto Scaling service also helps in reducing the cost as it takes care of putting to use the resources which are actually needed based on the utilization.  The services lets us build scaling plans that automate how groups of different resources respond to demand. Auto Scaling service lets the following resources to be part of the scaling plans –

  • Amazon EC2 instances – Launch or terminate Amazon EC2 instances in an Amazon EC2 Auto Scaling group
  • Amazon ECS – Adjust ECS service desired count up or down to respond to load variations
  • Amazon Dynamo DB – Enable a DynamoDB table or a global secondary index to increase its provisioned read and write capacity to handle sudden increases in traffic without throttling
  • Amazon Aurora – Dynamically adjust the number of Aurora Read Replicas provisioned for an Aurora DB cluster to handle sudden increases in active connections or workload


