From the course: Complete Guide to AWS Software Deployment

Unlock the full course today

Join today to access over 24,400 courses taught by industry experts.

Understanding AWS Auto Scaling groups

Understanding AWS Auto Scaling groups

- [Instructor] Let's take a look at AWS Auto Scaling. One of the main ways that AWS allows you to dynamically change your compute capacity. When you deploy a web application and you're taking it out of the cozy confines of your development environment and into the real world, where it will be exposed to real world traffic. As your app's popularity increases, so will its load. Eventually, your single instance will be overwhelmed as too many web requests max out its CPU, RAM, or network capacity. To solve this, you could just trade out your single host for a more powerful machine. That's called vertical scaling and an EC2, it'll require a reboot and some downtime while you change your instance type. Or you could introduce another host, then add a load balancer to distribute the traffic across these two hosts. This is called horizontal scaling, and it tends to be a more robust, resilient way of scaling an application, since, unlike with vertical scaling, you can add capacity without…

Contents