Topic: site reliability engineering

Making site reliability Blameless

As site reliability becomes more important as software releases grow in frequency and complexity, a startup called Blameless today released an SRE platform that can handle the increasing velocity of code deployments while offering faster, more efficient incident resolution. Ashar Rizqi, CEO of Blameless, said the company’s vision is to enable any modern software business … continue reading

Google introduces Stackdriver IRM for Site Reliability Engineering

Google announced a new Site Reliability Engineering-inspired tool for investigating, understanding, mitigating and recovering from incidents quickly and efficiently. Stackdriver Incident Response and Management (IRM) on Google Cloud Platform is available as an alpha version and features new monitoring tools for SRE journeys. After facing availability and reliability challenges, Google created SRE and SRE principles … continue reading

O’Reilly Velocity: SRE is an opinionated implementation of DevOps

When Google first came up with the term Site Reliability Engineering, it stemmed from its own production growth and challenges. “SRE is what you get when you treat operations as if it’s a software problem. Our mission is to protect, provide for, and progress the software and systems behind all of Google’s public services — … continue reading

Ad will close in seconds
Continue to site
Widgets Magazine

Get access to this and other exclusive articles for FREE!

There's no charge and it only takes a few seconds.

Sign up now!