SRE is quickly becoming the standard for IT transformation journeys. According to DevOps Institute's Global SRE Pulse 2022, 62% of organizations across the globe are adopting SRE in various ways — (55% in specific teams, services or products; 19% across the entire organization; and, 23% as a pilot).
For organizations to be successful with SRE, they must also transform the culture and human side within their organization. This cultural shift and new way of thinking must happen across IT and the business. The Global SRE Pulse 2022 report offers a deep look into the state and trends that are shaping SRE now and looking forward. With more than 460 survey responses from SRE professionals at organizations of all sizes, we've identified four top takeaways from the Global SRE Pulse report:
SRE is an Essential Engineering Function for Digital Transformation
SRE enhances development and operations collaboration. The outcome is more reliable systems, services and/or applications. This leads to an improvement of business value of services and applications created and improves the relationship between IT and the business. As existing software stacks only get more complex, organizations look to SRE to establish better collaboration between development and operations teams, while continuously improving the reliability and health of applications and services, ultimately optimizing their customer experience.
The survey asked where SRE is currently leveraged: the software the company builds or the set of services SRE teams interact with. Fifty-six percent (56%) of respondents said they leverage SRE for operating their Systems of Engagement (SOE) and 42% for their Systems of Record (SOR). Of the survey respondents, 52% who have adopted SRE described their company as a leader across customer experience, product quality, offerings, processes, services, and innovation.
What this means:
SRE is an essential operating model to improve both back-end (SOR) and front-end (SOE) services and applications, aiding organizations in accelerating their digital transformation. Other models such as DevOps are supported through the SRE engineering function as it eliminates toil and improves automation across a variety of essential processes such as incident, change, configuration and capacity management.
SRE Adoption Includes Challenges and Complexities
According to Global SRE Pulse 2022, a lack of staff with the necessary skill set essential to work as an SRE is the biggest challenge that organizations face, no matter what size. Eighty-five percent (85%) of survey respondents said they lack staff with the necessary skills to work as an SRE when implementing it.
Survey respondents also noted other challenges such as "value of SRE is not understood" (71%); "don't have time to implement SRE" (53%); "lack of tools in place" (55%); and, "lack of management support" (44%).
Process issues and new releases create the biggest sources of toil, according to the study. For SRE members, eliminating toil across different processes is a key point of focus. According to the survey, 27% of respondents cite process issues as the top source of toil. Another 19% cites the releases of new applications as the main source of toil. For a digital business, the revenue is directly tied to the value the software provides.
What this means:
The SRE operating model, and its critical success factors, must be presented showing benefits and routes to value so that the necessary adjustments within the organization around upskilling or reskilling can be made. Once the practice is established, measurements of its success through key performance indicators (KPI) such as improved adherence to Service Level Agreements (SLA), improvements around Service Level Objectives (SLO) and other KPIs can accelerate its adoption.
Observability and Monitoring Platforms Are in Demand
The second most adopted automation tools are observability and monitoring platforms. Seventy-two percent (72%) of survey respondents indicated they are currently implementing and continuously implementing observability and monitoring tools. This is a good indication that observability and monitoring strategies are starting to bear fruits.
However, for effective observability, organizations must adopt it everywhere and today, only 29% of survey respondents report the leverage of observability tools and techniques everywhere.
What this means:
While many organizations still leverage fragmented monitoring approaches across their organization, it results in limited insights into the performance of modern hybrid cloud applications and other business-critical resources. This fragmented approach challenges progress in the digital transformation. Observability should be adopted holistically to infer the outputs through observing the internal states of a system.
The SRE Job Market is HOT
Site Reliability Engineering continues to be a top IT job. More than 50% of respondents said they had expanded their skills and capabilities in the SRE role.
Further, 44% strongly agreed that they are more engaged and excited about their SRE role and 36% strongly agree that they are more valued as a team member.
34% feel more valued and appreciated.
Additionally, respondents revealed that the SRE role tends to get higher pay. Fifty-two percent (52%) of respondents revealed that they agree (strongly, or somewhat) that their compensation has improved.
What this means:
Today, SRE is an essential engineering function providing great fulfillment, pay and opportunities to learn. As organizations adopt it more widely across their organizations, there is a need — and opportunity — for more skilled SRE professionals to help improve processes and establish a more collaborative culture across the organization.
Generative AI may be a great tool for the enterprise to help drive further innovation and meaningful work, but it also runs the risk of generating massive amounts of spam that will counteract its intended benefits. From increased AI spam bots to data maintenance due to large volumes of outputs, enterprise AI applications can create a cascade of issues that end up detracting from productivity gains ...
A long-running study of DevOps practices ... suggests that any historical gains in MTTR reduction have now plateaued. For years now, the time it takes to restore services has stayed about the same: less than a day for high performers but up to a week for middle-tier teams and up to a month for laggards. The fact that progress is flat despite big investments in people, tools and automation is a cause for concern ...
Companies implementing observability benefit from increased operational efficiency, faster innovation, and better business outcomes overall, according to 2023 IT Trends Report: Lessons From Observability Leaders, a report from SolarWinds ...
Customer loyalty is changing as retailers get increasingly competitive. More than 75% of consumers say they would end business with a company after a single bad customer experience. This means that just one price discrepancy, inventory mishap or checkout issue in a physical or digital store, could have customers running out to the next store that can provide them with better service. Retailers must be able to predict business outages in advance, and act proactively before an incident occurs, impacting customer experience ...
Earlier this year, New Relic conducted a study on observability ... The 2023 Observability Forecast reveals observability's impact on the lives of technical professionals and businesses' bottom lines. Here are 10 key takeaways from the forecast ...
Only 33% of executives are "very confident" in their ability to operate in a public cloud environment, according to the 2023 State of CloudOps report from NetApp. This represents an increase from 2022 when only 21% reported feeling very confident ...