Topic: site reliability engineering

Transitioning to SRE

Over the years, there have been a lot of new methodologies that aim to help an organization manage their technology more efficiently, whether that means making programmers more efficient or the operators who manage a company’s technology infrastructure. DevOps, which sought to bring developers and operators together, is one such example of this, and one … continue reading

Making site reliability Blameless

As site reliability becomes more important as software releases grow in frequency and complexity, a startup called Blameless today released an SRE platform that can handle the increasing velocity of code deployments while offering faster, more efficient incident resolution. Ashar Rizqi, CEO of Blameless, said the company’s vision is to enable any modern software business … continue reading

Google introduces Stackdriver IRM for Site Reliability Engineering

Google announced a new Site Reliability Engineering-inspired tool for investigating, understanding, mitigating and recovering from incidents quickly and efficiently. Stackdriver Incident Response and Management (IRM) on Google Cloud Platform is available as an alpha version and features new monitoring tools for SRE journeys. After facing availability and reliability challenges, Google created SRE and SRE principles … continue reading

O’Reilly Velocity: SRE is an opinionated implementation of DevOps

When Google first came up with the term Site Reliability Engineering, it stemmed from its own production growth and challenges. “SRE is what you get when you treat operations as if it’s a software problem. Our mission is to protect, provide for, and progress the software and systems behind all of Google’s public services — … continue reading

DMCA.com Protection Status

Get access to this and other exclusive articles for FREE!

There's no charge and it only takes a few seconds.

Sign up now!