【发布时间】:2021-07-04 01:02:02
【问题描述】:
在 Terraform v13 中创建自动缩放策略和 cloudwatch 警报资源时,它们可以正常创建。但是,在对端点进行负载测试时,它们成功地扩展了实例,但是当 CPU 利用率在一段时间内达到必要的百分比时无法缩减。错误如下所示:
"historySummary": "未能执行 AutoScaling 操作:未找到度量值 [5.99763732496649, 2.7634547331059975] 和违规增量 -4.00236267503351 的步进调整"
下面列出的是 terraform 资源:
自动扩缩政策 -
resource "aws_appautoscaling_policy" "frontend_down" {
name = "${var.name}_frontend_scale_down"
service_namespace = "ecs"
resource_id = "service/${aws_ecs_cluster.main.name}/${aws_ecs_service.frontend.name}"
scalable_dimension = "ecs:service:DesiredCount"
step_scaling_policy_configuration {
adjustment_type = "ChangeInCapacity"
cooldown = 30
metric_aggregation_type = "Maximum"
step_adjustment {
metric_interval_lower_bound = 0
scaling_adjustment = -1
}
}
depends_on = [aws_appautoscaling_target.frontend_target]
}
Cloudwatch 警报 -
resource "aws_cloudwatch_metric_alarm" "frontend_service_cpu_low" {
alarm_name = "${var.name}_cpu_utilization_low_fe"
comparison_operator = "LessThanOrEqualToThreshold"
evaluation_periods = "2"
metric_name = "CPUUtilization"
namespace = "AWS/ECS"
period = "60"
statistic = "Average"
threshold = "10"
dimensions = {
ClusterName = var.ecs_cluster_name
ServiceName = var.ecs_service_name_frontend
}
alarm_actions = [var.autoscaling_down_arn_frontend]
tags = {
Name = "${var.name}-autoscaling"
BillingCode = var.billing_code_tag
Environment = var.environment_tag
}
}
【问题讨论】:
-
找到了原因,这是因为在缩减策略时我使用的是“metric_interval_lower_bound”而不是“metric_interval_upper_bound”。当按比例缩小时,与警报阈值和 cloudwatch 指标相比,它提供负增量,因此 0 成为上限。扩大规模时,您使用下限,因为它提供正增量。
标签: javascript amazon-web-services terraform amazon-cloudwatch autoscaling