A global survey of site reliability engineers (SREs) found diagnosing issues is the most difficult aspect of incident management.
Real-time app monitoring is about fundamentally shifting your mindset toward a culture of accountability and continuous improvement.
We have built some beautiful toolchains that crank out a finished product on the fly without needing anything close to…
In this week’s The Long View: A S. Korean conflagration leads to a ridiculously long outage, and the price of…
Outages happen, it’s inevitable. But, unplanned downtime often comes with substantial costs—not only in terms of recovery and revenue loss,…
What makes an app reliable? If you ask most IT professionals that question, their minds immediately go to uptime. That’s…
Regardless of whether an organization subscribes to a bimodal IT approach, one thing is clear: It’s more important, and more…
Here are some other ROELBOB’s you might like A Unit Testing Dillema The Pink Slip Another Pink Slip…
The goal of every SaaS provider should be 100% availability - it should be ingrained in their culture. But it's…