Jake Cohen
Posts by Jake Cohen
Balancing Centralization and Autonomy: The Key to Automation at Scale
The recent global outage reminds us that identifying issues and their impact radius is just the first part of a lengthy process to remediation. ...
AWS Orchestration with Systems Manager & Runbook Automation
“We have the automation, but it will need to be invoked separately for each account. Doable, but time consuming and error-prone. Oh, and only ...
Debugging Kubernetes with Automated Runbooks & Ephemeral Containers
In our previous blog, we discussed the difficulty in capturing all relevant diagnostics during an incident before a “band-aid” fix is applied. The...