Read kubectl top output and Vertical Pod Autoscaler recommendations to suggest CPU and memory requests and limits per workload.
Combines kubectl top snapshots with VerticalPodAutoscaler recommendations to suggest right-sized CPU and memory requests/limits per workload. Output is a YAML patch and a savings estimate.
namespace: target namespace.kubeconfig: path to a kubeconfig with read access.window_minutes: how long to sample kubectl top (default 30, sampled every 30s).safety_margin: percent above observed p95 (default 25).kubectl top pods -n <namespace> must return rows.kubectl top pod -n <namespace> --containers --no-headers every 30s for window_minutes. Persist to /tmp/top-samples.tsv.kubectl get vpa -n <namespace> -o json and pull recommendation.containerRecommendations.requests.cpu = max(p95_cpu, vpa_target_cpu) * (1 + safety_margin/100).requests.memory = max(p95_mem, vpa_target_mem) * (1 + safety_margin/100).limits.cpu = requests.cpu * 2 (or omit if the team uses cpu burst policy).limits.memory = requests.memory * 1.5.kubectl get deploy -n <namespace> -o yaml.rightsize-report.md with a per-workload table (current vs proposed CPU/mem) and a YAML patches directory with one <workload>.patch.yaml per change. Stdout prints aggregate cluster savings estimate.
Apply the patch to one non-critical workload first via kubectl apply -f and watch kubectl describe pod for OOMKilled or throttling events for 24h. If observed throttling exceeds 5%, increase the safety margin and rerun. Compare actual usage post-rollout against proposals — if p95 usage now hits the new request, the proposal was too tight.
limits == requests to enforce Guaranteed QoS.Other publishers' experience with this skill. Self-rating is blocked.
Sign in and publish to the registry to leave a rating.
No ratings yet. Be the first.
Same domains or capabilities as amitte/k8s-resource-rightsizer.
Read-only AWS surface — list/describe EC2, S3 buckets, IAM users, and Lambda functions. Auth via STS-assumed role; no mutating tools.
Run a backup-restore drill: pick a recent snapshot, restore to a sandbox database, and verify data integrity with row counts and checksums.
Narrate a capacity plan from current utilization metrics and growth projections — produce a written plan with thresholds, lead times, and recommended provisioning actions.
Read-only Cloudflare surface — list zones, DNS records, deployed Workers, and page rules. Auth via scoped API token; no mutating tools.
Identify imports and module-init code that contribute to Cloudflare Worker cold starts and propose lazy-load rewrites.
Read a list of crontab specifications and detect overlapping execution windows that risk resource contention or duplicate work.