Skip to content

DevOps

Deployment health, change risk assessment, incident triage, and RCA drafting. Currently in Beta.

Beta

DevOps skills are in Beta. Coverage and output quality are improving. Expected GA: Q3 2026.


What it covers

SkillWhat it detectsOutput
Deployment healthRecent Lambda, ECS, and EC2 deployment statusTriage summary
Incident triageCorrelated view of recent changes + CloudWatch alarms + log errorsTriage summary
RCA draftingTimeline reconstruction for a degradation or incidentRCA draft
Change risk scoringLikelihood of a planned change causing impact based on historical patternsFinding
Log correlationCloudWatch log error patterns correlated with estate changesTriage

Example prompts

What changed in the last 2 hours before the latency spike?

Triage the payments service degradation from this morning

What's the blast radius of deploying to the prod Lambda function right now?

Correlate recent deployments with the increase in 5xx errors

Required permissions

json
"cloudwatch:DescribeAlarms",
"cloudwatch:GetMetricStatistics",
"logs:DescribeLogGroups",
"logs:FilterLogEvents",
"logs:GetLogEvents",
"lambda:ListFunctions",
"ecs:DescribeServices",
"ecs:ListTasks",
"cloudtrail:LookupEvents"

Next steps

  • IAM — For change attribution (who deployed what)
  • Infra Ops — For resource topology context during triage

Escher — Agentic CloudOps by Tessell