Skip to main content

4 Important Findings from Global SRE Pulse 2022 and What They Mean

Eveline Oehrlich
DevOps Institute

SRE is quickly becoming the standard for IT transformation journeys. According to DevOps Institute's Global SRE Pulse 2022, 62% of organizations across the globe are adopting SRE in various ways — (55% in specific teams, services or products; 19% across the entire organization; and, 23% as a pilot).


For organizations to be successful with SRE, they must also transform the culture and human side within their organization. This cultural shift and new way of thinking must happen across IT and the business. The Global SRE Pulse 2022 report offers a deep look into the state and trends that are shaping SRE now and looking forward. With more than 460 survey responses from SRE professionals at organizations of all sizes, we've identified four top takeaways from the Global SRE Pulse report:

SRE is an Essential Engineering Function for Digital Transformation

SRE enhances development and operations collaboration. The outcome is more reliable systems, services and/or applications. This leads to an improvement of business value of services and applications created and improves the relationship between IT and the business. As existing software stacks only get more complex, organizations look to SRE to establish better collaboration between development and operations teams, while continuously improving the reliability and health of applications and services, ultimately optimizing their customer experience.

The survey asked where SRE is currently leveraged: the software the company builds or the set of services SRE teams interact with. Fifty-six percent (56%) of respondents said they leverage SRE for operating their Systems of Engagement (SOE) and 42% for their Systems of Record (SOR). Of the survey respondents, 52% who have adopted SRE described their company as a leader across customer experience, product quality, offerings, processes, services, and innovation.

What this means:

SRE is an essential operating model to improve both back-end (SOR) and front-end (SOE) services and applications, aiding organizations in accelerating their digital transformation. Other models such as DevOps are supported through the SRE engineering function as it eliminates toil and improves automation across a variety of essential processes such as incident, change, configuration and capacity management.

SRE Adoption Includes Challenges and Complexities

According to Global SRE Pulse 2022, a lack of staff with the necessary skill set essential to work as an SRE is the biggest challenge that organizations face, no matter what size. Eighty-five percent (85%) of survey respondents said they lack staff with the necessary skills to work as an SRE when implementing it.

Survey respondents also noted other challenges such as "value of SRE is not understood" (71%); "don't have time to implement SRE" (53%); "lack of tools in place" (55%); and, "lack of management support" (44%).

Process issues and new releases create the biggest sources of toil, according to the study. For SRE members, eliminating toil across different processes is a key point of focus. According to the survey, 27% of respondents cite process issues as the top source of toil. Another 19% cites the releases of new applications as the main source of toil. For a digital business, the revenue is directly tied to the value the software provides.

What this means:

The SRE operating model, and its critical success factors, must be presented showing benefits and routes to value so that the necessary adjustments within the organization around upskilling or reskilling can be made. Once the practice is established, measurements of its success through key performance indicators (KPI) such as improved adherence to Service Level Agreements (SLA), improvements around Service Level Objectives (SLO) and other KPIs can accelerate its adoption.

Observability and Monitoring Platforms Are in Demand

The second most adopted automation tools are observability and monitoring platforms. Seventy-two percent (72%) of survey respondents indicated they are currently implementing and continuously implementing observability and monitoring tools. This is a good indication that observability and monitoring strategies are starting to bear fruits.

However, for effective observability, organizations must adopt it everywhere and today, only 29% of survey respondents report the leverage of observability tools and techniques everywhere.

What this means:

While many organizations still leverage fragmented monitoring approaches across their organization, it results in limited insights into the performance of modern hybrid cloud applications and other business-critical resources. This fragmented approach challenges progress in the digital transformation. Observability should be adopted holistically to infer the outputs through observing the internal states of a system.

The SRE Job Market is HOT

Site Reliability Engineering continues to be a top IT job. More than 50% of respondents said they had expanded their skills and capabilities in the SRE role.

Further, 44% strongly agreed that they are more engaged and excited about their SRE role and 36% strongly agree that they are more valued as a team member.

34% feel more valued and appreciated.

Additionally, respondents revealed that the SRE role tends to get higher pay. Fifty-two percent (52%) of respondents revealed that they agree (strongly, or somewhat) that their compensation has improved.

What this means:

Today, SRE is an essential engineering function providing great fulfillment, pay and opportunities to learn. As organizations adopt it more widely across their organizations, there is a need — and opportunity — for more skilled SRE professionals to help improve processes and establish a more collaborative culture across the organization.

Eveline Oehrlich is Chief Research Officer at DevOps Institute

Hot Topics

The Latest

In today’s data and AI driven world, enterprises across industries are utilizing AI to invent new business models, reimagine business and achieve efficiency in operations. However, enterprises may face challenges like flawed or biased AI decisions, sensitive data breaches and rising regulatory risks ...

In MEAN TIME TO INSIGHT Episode 12, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses purchasing new network observability solutions.... 

There's an image problem with mobile app security. While it's critical for highly regulated industries like financial services, it is often overlooked in others. This usually comes down to development priorities, which typically fall into three categories: user experience, app performance, and app security. When dealing with finite resources such as time, shifting priorities, and team skill sets, engineering teams often have to prioritize one over the others. Usually, security is the odd man out ...

Image
Guardsquare

IT outages, caused by poor-quality software updates, are no longer rare incidents but rather frequent occurrences, directly impacting over half of US consumers. According to the 2024 Software Failure Sentiment Report from Harness, many now equate these failures to critical public health crises ...

In just a few months, Google will again head to Washington DC and meet with the government for a two-week remedy trial to cement the fate of what happens to Chrome and its search business in the face of ongoing antitrust court case(s). Or, Google may proactively decide to make changes, putting the power in its hands to outline a suitable remedy. Regardless of the outcome, one thing is sure: there will be far more implications for AI than just a shift in Google's Search business ... 

Image
Chrome

In today's fast-paced digital world, Application Performance Monitoring (APM) is crucial for maintaining the health of an organization's digital ecosystem. However, the complexities of modern IT environments, including distributed architectures, hybrid clouds, and dynamic workloads, present significant challenges ... This blog explores the challenges of implementing application performance monitoring (APM) and offers strategies for overcoming them ...

Service disruptions remain a critical concern for IT and business executives, with 88% of respondents saying they believe another major incident will occur in the next 12 months, according to a study from PagerDuty ...

IT infrastructure (on-premises, cloud, or hybrid) is becoming larger and more complex. IT management tools need data to drive better decision making and more process automation to complement manual intervention by IT staff. That is why smart organizations invest in the systems and strategies needed to make their IT infrastructure more resilient in the event of disruption, and why many are turning to application performance monitoring (APM) in conjunction with high availability (HA) clusters ...

In today's data-driven world, the management of databases has become increasingly complex and critical. The following are findings from Redgate's 2025 The State of the Database Landscape report ...

With the 2027 deadline for SAP S/4HANA migrations fast approaching, organizations are accelerating their transition plans ... For organizations that intend to remain on SAP ECC in the near-term, the focus has shifted to improving operational efficiencies and meeting demands for faster cycle times ...

4 Important Findings from Global SRE Pulse 2022 and What They Mean

Eveline Oehrlich
DevOps Institute

SRE is quickly becoming the standard for IT transformation journeys. According to DevOps Institute's Global SRE Pulse 2022, 62% of organizations across the globe are adopting SRE in various ways — (55% in specific teams, services or products; 19% across the entire organization; and, 23% as a pilot).


For organizations to be successful with SRE, they must also transform the culture and human side within their organization. This cultural shift and new way of thinking must happen across IT and the business. The Global SRE Pulse 2022 report offers a deep look into the state and trends that are shaping SRE now and looking forward. With more than 460 survey responses from SRE professionals at organizations of all sizes, we've identified four top takeaways from the Global SRE Pulse report:

SRE is an Essential Engineering Function for Digital Transformation

SRE enhances development and operations collaboration. The outcome is more reliable systems, services and/or applications. This leads to an improvement of business value of services and applications created and improves the relationship between IT and the business. As existing software stacks only get more complex, organizations look to SRE to establish better collaboration between development and operations teams, while continuously improving the reliability and health of applications and services, ultimately optimizing their customer experience.

The survey asked where SRE is currently leveraged: the software the company builds or the set of services SRE teams interact with. Fifty-six percent (56%) of respondents said they leverage SRE for operating their Systems of Engagement (SOE) and 42% for their Systems of Record (SOR). Of the survey respondents, 52% who have adopted SRE described their company as a leader across customer experience, product quality, offerings, processes, services, and innovation.

What this means:

SRE is an essential operating model to improve both back-end (SOR) and front-end (SOE) services and applications, aiding organizations in accelerating their digital transformation. Other models such as DevOps are supported through the SRE engineering function as it eliminates toil and improves automation across a variety of essential processes such as incident, change, configuration and capacity management.

SRE Adoption Includes Challenges and Complexities

According to Global SRE Pulse 2022, a lack of staff with the necessary skill set essential to work as an SRE is the biggest challenge that organizations face, no matter what size. Eighty-five percent (85%) of survey respondents said they lack staff with the necessary skills to work as an SRE when implementing it.

Survey respondents also noted other challenges such as "value of SRE is not understood" (71%); "don't have time to implement SRE" (53%); "lack of tools in place" (55%); and, "lack of management support" (44%).

Process issues and new releases create the biggest sources of toil, according to the study. For SRE members, eliminating toil across different processes is a key point of focus. According to the survey, 27% of respondents cite process issues as the top source of toil. Another 19% cites the releases of new applications as the main source of toil. For a digital business, the revenue is directly tied to the value the software provides.

What this means:

The SRE operating model, and its critical success factors, must be presented showing benefits and routes to value so that the necessary adjustments within the organization around upskilling or reskilling can be made. Once the practice is established, measurements of its success through key performance indicators (KPI) such as improved adherence to Service Level Agreements (SLA), improvements around Service Level Objectives (SLO) and other KPIs can accelerate its adoption.

Observability and Monitoring Platforms Are in Demand

The second most adopted automation tools are observability and monitoring platforms. Seventy-two percent (72%) of survey respondents indicated they are currently implementing and continuously implementing observability and monitoring tools. This is a good indication that observability and monitoring strategies are starting to bear fruits.

However, for effective observability, organizations must adopt it everywhere and today, only 29% of survey respondents report the leverage of observability tools and techniques everywhere.

What this means:

While many organizations still leverage fragmented monitoring approaches across their organization, it results in limited insights into the performance of modern hybrid cloud applications and other business-critical resources. This fragmented approach challenges progress in the digital transformation. Observability should be adopted holistically to infer the outputs through observing the internal states of a system.

The SRE Job Market is HOT

Site Reliability Engineering continues to be a top IT job. More than 50% of respondents said they had expanded their skills and capabilities in the SRE role.

Further, 44% strongly agreed that they are more engaged and excited about their SRE role and 36% strongly agree that they are more valued as a team member.

34% feel more valued and appreciated.

Additionally, respondents revealed that the SRE role tends to get higher pay. Fifty-two percent (52%) of respondents revealed that they agree (strongly, or somewhat) that their compensation has improved.

What this means:

Today, SRE is an essential engineering function providing great fulfillment, pay and opportunities to learn. As organizations adopt it more widely across their organizations, there is a need — and opportunity — for more skilled SRE professionals to help improve processes and establish a more collaborative culture across the organization.

Eveline Oehrlich is Chief Research Officer at DevOps Institute

Hot Topics

The Latest

In today’s data and AI driven world, enterprises across industries are utilizing AI to invent new business models, reimagine business and achieve efficiency in operations. However, enterprises may face challenges like flawed or biased AI decisions, sensitive data breaches and rising regulatory risks ...

In MEAN TIME TO INSIGHT Episode 12, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses purchasing new network observability solutions.... 

There's an image problem with mobile app security. While it's critical for highly regulated industries like financial services, it is often overlooked in others. This usually comes down to development priorities, which typically fall into three categories: user experience, app performance, and app security. When dealing with finite resources such as time, shifting priorities, and team skill sets, engineering teams often have to prioritize one over the others. Usually, security is the odd man out ...

Image
Guardsquare

IT outages, caused by poor-quality software updates, are no longer rare incidents but rather frequent occurrences, directly impacting over half of US consumers. According to the 2024 Software Failure Sentiment Report from Harness, many now equate these failures to critical public health crises ...

In just a few months, Google will again head to Washington DC and meet with the government for a two-week remedy trial to cement the fate of what happens to Chrome and its search business in the face of ongoing antitrust court case(s). Or, Google may proactively decide to make changes, putting the power in its hands to outline a suitable remedy. Regardless of the outcome, one thing is sure: there will be far more implications for AI than just a shift in Google's Search business ... 

Image
Chrome

In today's fast-paced digital world, Application Performance Monitoring (APM) is crucial for maintaining the health of an organization's digital ecosystem. However, the complexities of modern IT environments, including distributed architectures, hybrid clouds, and dynamic workloads, present significant challenges ... This blog explores the challenges of implementing application performance monitoring (APM) and offers strategies for overcoming them ...

Service disruptions remain a critical concern for IT and business executives, with 88% of respondents saying they believe another major incident will occur in the next 12 months, according to a study from PagerDuty ...

IT infrastructure (on-premises, cloud, or hybrid) is becoming larger and more complex. IT management tools need data to drive better decision making and more process automation to complement manual intervention by IT staff. That is why smart organizations invest in the systems and strategies needed to make their IT infrastructure more resilient in the event of disruption, and why many are turning to application performance monitoring (APM) in conjunction with high availability (HA) clusters ...

In today's data-driven world, the management of databases has become increasingly complex and critical. The following are findings from Redgate's 2025 The State of the Database Landscape report ...

With the 2027 deadline for SAP S/4HANA migrations fast approaching, organizations are accelerating their transition plans ... For organizations that intend to remain on SAP ECC in the near-term, the focus has shifted to improving operational efficiencies and meeting demands for faster cycle times ...