Topic: sre

Puppet announces Relay for cloud-native, event driven automation

Puppet introduced the public beta of Relay, an event-driven automation platform that automates across any cloud infrastructure, tools and APIs that developers, DevOps engineers, and SREs are managing manually currently. “Without a way to manage and automate the flood of events and hundreds of APIs developers use – time, money and mental capital are being … continue reading

Red Hat: Culture change and automation are necessary to create DevOps and SRE teams

In order to bring more effective operational practices, DevOps and site reliability engineering (SRE) teams need to go through a culture change within the organization. Red Hat held its virtual summit this week where it talked about how to reinvent IT Ops as SRE.  According to the company, change can happen by automating processes and … continue reading

ITOps Times Open-Source Project of the Week: Kubei

Portshift has announced the release of Kubei, an open-source Kubernetes runtime vulnerabilities scanner tool.  According to the company, while there are a lot of options already out there, not all scanners are the same and differ by the number of feeds they consume, updates they product and information they provide.  ‘All tools, however, require some … continue reading

Google details how to apply SRE to monolith applications

Microservices are taking the software industry by storm, but that doesn’t mean monolithic applications are becoming extinct. While SRE is more commonly associated with modern architectures, Google is providing some insight into how enterprises can use SRE to manage their monoliths.  “When and why to choose monolithic architecture is usually a matter of what works … continue reading

Transitioning to SRE

Over the years, there have been a lot of new methodologies that aim to help an organization manage their technology more efficiently, whether that means making programmers more efficient or the operators who manage a company’s technology infrastructure. DevOps, which sought to bring developers and operators together, is one such example of this, and one … continue reading

ITOps 2020 predictions from around the industry

Tim Armandpour, SVP of engineering at PagerDuty Forget reliability — with the adoption of resilience engineering and the proper use of automation, operators can expect a 20% reduction in unplanned work. Today’s organizations are fixated on the reliability of their technology. But any developer can tell you that the reality is not if it will … continue reading

DevOps Institute announces a new course for site reliability engineering

The DevOps Institute today announced a Site Reliability Engineering (SRE) Foundation certification that will be available to registered education partners starting in January of next year. While there is no prerequisite to take the examination, a training course through an accredited partner is required.  According to the company, the certification content includes practical advice, related … continue reading

Making site reliability Blameless

As site reliability becomes more important as software releases grow in frequency and complexity, a startup called Blameless today released an SRE platform that can handle the increasing velocity of code deployments while offering faster, more efficient incident resolution. Ashar Rizqi, CEO of Blameless, said the company’s vision is to enable any modern software business … continue reading

Google introduces Stackdriver IRM for Site Reliability Engineering

Google announced a new Site Reliability Engineering-inspired tool for investigating, understanding, mitigating and recovering from incidents quickly and efficiently. Stackdriver Incident Response and Management (IRM) on Google Cloud Platform is available as an alpha version and features new monitoring tools for SRE journeys. After facing availability and reliability challenges, Google created SRE and SRE principles … continue reading

O’Reilly Velocity: SRE is an opinionated implementation of DevOps

When Google first came up with the term Site Reliability Engineering, it stemmed from its own production growth and challenges. “SRE is what you get when you treat operations as if it’s a software problem. Our mission is to protect, provide for, and progress the software and systems behind all of Google’s public services — … continue reading

Defining the role of a Site Reliability Engineer

The Site Reliability Engineer (SRE) is an emerging role in IT operations with more than 1,000 SRE job listings on LinkedIn. But while the role continues to gain traction, there is confusion about what it demands. Catchpoint, a digital experience monitoring company, released new survey findings to help understand how SREs play a part in … continue reading

Get access to this and other exclusive articles for FREE!

There's no charge and it only takes a few seconds.

Sign up now!