Auto Scaling Groups
Auto Scaling Groups (ASGs) on Krutrim Cloud provide a managed way to automatically adjust virtual machine (VM) capacity in response to demand. ASGs help users maintain application performance while optimizing cost and operational efficiency by scaling resources up or down based on policy-driven triggers or scheduled actions.
ASGs can be created directly or derived from reusable templates, enabling users to standardize instance configurations and group behaviors across environments.
This document explains how ASGs function on Krutrim Cloud, how to configure and manage them, and how templates, policy configuration, dashboards, and UI behaviors work.
Key Concepts
1) Auto Scaling Group
An Auto Scaling Group is a logical construct that:
Launches VMs based on instance configuration.
Adjusts the number of running VMs automatically.
Applies policies such as CPU usage, memory usage, or schedule-based triggers.
Ensures defined minimum, desired, and maximum capacity boundaries.
ASGs are suitable for:
Web servers and APIs with fluctuating load
ML inference services
Queue processing and worker pools
Batch workloads with predictable schedules
2) ASG Templates
Creating an Auto Scaling Group
ASG Specification
Name : A unique identifier for the ASG within the selected VPC.
Creation Mode:
From Scratch: All fields start empty.
From Template: Selecting a template pre-populates values. All values remain editable.
Region
Auto-filled from the global region selector.
User can override via dropdown.
VPC: Only VPCs in the selected region are shown. Dynamically updated if the region changes.
Subnet:
Single-select from subnets within the chosen VPC.
Subnets are shown with CIDR and public/private tag.
Security Groups:
Multi-select.
Lists all security groups associated with the selected VPC.
Instance Configuration
Instance Name: Base name used as prefix for VMs launched by this ASG.
Instance Type: CPU instance types with displayed specifications and per-hour pricing.
Scaling Configuration:
Minimum capacity
Maximum capacity
Validation:
min ≤ max
Bootable Volume:
Configured with:
Size
Name
Machine image
Additional Storage Volume:
Users may attach multiple volumes.
All such volumes are ephemeral and deleted when the instance terminates.
SSH Key:
Used for secure access to VMs created by the ASG.
Scaling Policies
An ASG may include one of each of the following policies:
Average CPU Utilization
Average Memory Utilization
Scheduled Scaling Action
All policies are additive but only one policy of each type is allowed.
CPU / Memory Utilization Policies
Each policy includes:
Upscale Target (%): VM count increases when aggregated usage exceeds this threshold.
Downscale Target (%): VM count decreases when usage falls below this threshold.
Scale-out Cooldown: Prevents immediate re-execution of scale-out logic.
Scale-in Cooldown: Prevents immediate re-execution of scale-in logic.
Validation rules:
Upscale and downscale values may be equal (allowed with warning).
Downscale target may not exceed upscale target.
Scheduled Actions
Scheduled policies allow users to define:
A scale-up time
A scale-down time
Both times cannot be identical
Saving as Template
At the end of the creation process, users may enable:
Save as Template
Creates a template with name
<ASG Name> TemplateCaptures full configuration (instance + ASG + policies)
Template becomes available for future ASG creation.
Last updated
Was this helpful?

