Modern Incident and Change Management
Best of 2023: ‘Scrum == Cancer’ ¦ Plus: Linux 6.5 Ships
In this week’s #TheLongView: Scrum sucks, sources say; and here comes the Linux 6.5 kernel ...
Microsoft kills Python 3.7 ¦ … and VBScript ¦ Exascaling ARM on Jupiter
In this week’s #TheLongView: VS Code drops support for Python 3.7, Windows drops VBScript, and Europe plans the fastest ARM supercomputer ...
Oracle Bill is 5x Client’s Budget ¦ Toyota Out of Space
In this week’s The Long View: Birmingham looks like the Detroit of the UK—is it Oracle’s fault? Plus: Was Toyota’s factory failure caused by running out of disk space? ...
80% of Bosses ‘Regret’ Stopping WFH ¦ PSA: Disable STS!
In this week’s #TheLongView: Rethinking return-to-office mandates and a ridiculous, ancient Windows bug ...
Microsoft’s 9th Outage in 2023 ¦ RISE of RISC-V ¦ Meta Ends WFH
In this week’s #TheLongView: Redmond SaaS keeps failing, RISC-V is RISEing, and Meta is enforcing hybrid work ...
Cloudflare Outage Outrage | Yet More FAA 5G Stupidity
In this week’s The Long View: Cloudflare suffers another huge outage while the FAA and FCC still disagree over 5G/NR near airports ...
The Evolution of Incident Management
Have you ever thought about the history of incident management? If you’re an SRE, you might be so caught up in the day-to-day work of managing reliability and responding to incidents that ...
Choosing an Incident Management Platform
When you’re feeling the stress and pain of manually managing incidents and incident response, making the decision to find an incident management tool is a no-brainer. But how do you choose the ...
Best Practices for Cloud Incident Response
Cloud computing is now mainstream, with almost all organizations running at least some resources in the public cloud—whether software-as-a-service (SaaS), platform-as-a-service (PaaS) or infrastructure-as-a-service (IaaS). Security teams have been scrambling to adapt ...
What Chaos Engineering Is (and Isn’t)
The birth of chaos engineering happened somewhat accidentally in 2008 when Netflix moved from the data center to the cloud. The move didn’t go as planned. The thinking at the time was ...
SREs Say AIOps Doesn’t Live Up to the Hype
What can you expect when investing in artificial intelligence for IT operations (AIOps)? Real-time visibility across huge volumes of information. Lightning-fast event correlation and anomaly detection. Automated remediation and self-healing, without Ops ...
Code Ownership Is Key to Accelerate Debugging
App stability is a fundamental part of every app experience. Broadly speaking, app stability is a measurement of the number of total app sessions that are crash-free or the percentage of daily ...