Channels Are Free: Why We Gave Every Incident Its Own
“Channels are free!” At PagerDuty, that’s our standard response to “Should we create a Slack channel for this?” Sure, channels can multiply if not careful...
7 min read
“Channels are free!” At PagerDuty, that’s our standard response to “Should we create a Slack channel for this?” Sure, channels can multiply if not careful...
7 min read
As patterns for effective Agentic AI adoption become clearer, success will rely on an engineer’s ability to shift from “writing code” to “writing specifications”. From...
5 min read
If you feel like your technical debt is piling up while your engineering team gets more stretched by the week, you’re not alone. Framework upgrades...
9 min read
A Problem Worth Naming The week Forward Deployed Engineering (FDE) officially launched at PagerDuty, one of my engineers and I decided to find out...
12 min read
We didn’t try to build a clever agent. We built one that shows up pre‑armed. The lesson arrived earlier this year, as we began developing...
The holidays amplify an inherent risk to businesses: lighter staffing, heavier traffic, and zero appetite for surprises. In addition to locking in your coverage crew...
5 min read
Our official API client for Python has come a long way. It began in July 2018 as pdpyras (for PagerDuty Python REST API Sessions) out...
2 min read
PagerDuty and Logz.io have united efforts to bring more integrated AI capabilities from both platforms. This article shows how we are leveraging PagerDuty’s event intelligence...
5 min read
On October 20, 2025, a significant outage in AWS’s US-EAST-1 region rippled across the internet, affecting some of the most widely used SaaS, messaging, conferencing,...
8 min read
Inspired by Shopify’s boring update, we decided to dedicate a focused sprint this Fall to working on customer-driven improvements outside of our regular roadmap across...
9 min read
The Model Context Protocol (MCP) is quickly becoming the de facto standard for connecting tools to AI agents. Often described as the “USB-C for AI,”...
5 min read
On August 28, just before the long Labor Day weekend in the United States, PagerDuty experienced a service disruption in our U.S. service region. If...
4 min read
At 3:53 UTC on August 28, 2025 a failure on one of PagerDuty’s message queuing systems (Kafka) triggered cascading issues that disrupted or delayed the...
15 min read
One year ago, as AI agents started to make waves, a handful of us at PagerDuty began investigating how they might reshape the way we...
8 min read
When PagerDuty was founded over 15 years ago, we had an ambitious mission: ensure that critical incidents never go unnoticed, and that teams can respond...
12 min read
On-call is about more than just reducing mean time to acknowledge and mean time to resolve (MTTA and MTTR, respectively), it’s about improving the human...
This is the first in a three part series about how the PagerDuty Front-end team approaches their micro front-end architecture. The front-end of PagerDuty’s web...
8 min read
Late last year, we had an interesting problem occur with the Kafka clusters in our staging environment. Random hosts across several clusters started experiencing events...
12 min read