The New Normal for IT Ops Deepens Need for AI - Part 2
May 06, 2020

Will Cappelli
Moogsoft

Share this

The global pandemic has radically changed how enterprise IT services are consumed, both in the short and long term. Here's how AIOps can help IT Ops teams:

Start with The New Normal for IT Ops Deepens Need for AI - Part 1

Managing the New Normal

The new normal includes not only periodic recurrences of Covid-19 outbreaks but also the periodic emergence of new global pandemics. This means putting in place at least three layers of digital business continuity practice:

■ Continuity for illness-free periods

■ Continuity for periods marked by known pandemics

■ Continuity for periods marked by new pandemics

Rules-based, historical data analysis, and predictive analysis based on history become useless in this scenario. Instead, what's needed is technology that can anticipate outages without reliance on stable historical patterns, as AIOps does.

Significant economic contraction and resulting pressure on both capital and operational expenditures will lead to chronic understaffing of IT operations and NOC functions. IT Ops can leverage AIOps to achieve heightened levels of automation and to support radically deep cuts in the number of tools required to both monitor the digital infrastructure and respond to incidents that occur.

As remote work becomes default, it will become impossible to replicate the "monitoring cockpit" experience or the "service desk cockpit" experience. IT operations team members and first responders will need to get by with standard IT management software. That requires a significant increase in the number of signals that require observation on the one hand and the number of tickets which require response on the other hand. AIOps can help to manage this by reducing signals and tickets.

Optimizing the New Normal

The move to an almost entirely virtualized infrastructure and service portfolio will allow for maximum agility and the ability to reconfigure people, processes and technologies to meet emerging business needs (which will themselves likely be novel in the new normal.) To provide continuous assurance of service levels (even as the services themselves evolve), IT Ops teams can leverage AIOps and its ability to anticipate outages and brown-outs on the basis of data as it arrives, as opposed to pre-existing static models of topology and user behaviour.

The shift from an IT budget that, beyond labor commitments, is dominated by capital expenditures and maintenance, to one that is almost entirely dominated by renewable operational expenditures, will increase business resilience in the face of the three types of continuity issues outlined above. AIOps can help in this area as well by helping to anticipate short-term fluctuations in resource requirements based on the possibility of looming outages and brown-outs.

The economic contraction will accelerate digitalization and, in fact, lead to what may be called "maximum digitalization" with the consequence that, for the most part, business process events will be IT system state changes. One will not be able to manage business processes unless one simultaneously manages IT system events. AIOps can be invaluable here by effectively discovering and managing the higher-level IT system event patterns that are, in fact, business process patterns.

Will Cappelli is Field CTO at Moogsoft
Share this

The Latest

April 21, 2021

Few tools provide early detection of mission-critical mail outages. On March 15, Microsoft had a service outage worldwide that impacted its services such as Teams AV, Yammer, OneDrive, and Azure Active Directory. Users reported not being able to login into either of these services and were getting timeout messages ...

April 20, 2021

More than half (60%) of IT organizations are investing in improving employee experience to support remote workforce productivity and performance according to The Changing Role of the IT Leader study by Elastic ...

April 19, 2021

Why are CDNs becoming more important to so many businesses? And how will they handle the new applications coming out over the next few years? APMdigest sat down with Mehdi Daoudi, CEO and co-founder of Catchpoint Systems, to find out ...

April 15, 2021

A growing need for process automation as a result of the confluence of digital transformation initiatives with the remote/hybrid work policies brought on by the pandemic was uncovered by an independent survey of over 500 IT Operations, DevOps, and Site Reliability Engineering (SRE) professionals commissioned by Transposit for its inaugural State of DevOps Automation Report ...

April 14, 2021

As the Covid-19 pandemic forces a global reset of how we gather and work, 60% of organizations are looking forward to increased spending in 2021 to deploy new technologies, according to the 14th annual State of the Network global study of enterprise networking and security challenges released by VIAVI Solutions ...

April 13, 2021

Complexity breaks correlation. Intelligence brings cohesion. This simple principle is what makes real-time asset intelligence a must-have for AIOps that is meant to diffuse complexity. To further create a context for the user, it is critical to understand service dependencies and correlate alerts across the stack to resolve incidents ...

April 12, 2021

We're all familiar with the process of QA within the software development cycle. Developers build a product and send it to QA engineers, who test and bless it before pushing it into the world. After release, a different team of SREs with their own toolset then monitor for issues and bugs. Now, a new level of customer expectations for speed and reliability have pushed businesses further toward delivering rapid product iterations and innovations to keep up with customer demands. This leaves little time to run the traditional development process ...

April 08, 2021

On Wednesday January 27, 2021, Microsoft Office 365 experienced an outage affected a number of its services with a prolonged outage affecting Exchange Online. Despite Microsoft indicating that it was just Exchange Online affected during this outage, some monitoring tools detected that Azure Active Directory and dependent services like SharePoint and OneDrive were also affected at the time. The outage information indicated a rollout and rollback but we wouldn't expect to see such a widescale outage and slowdown just affecting some of the schema unless everything had to be taken offline ...

April 07, 2021

Application availability depends on the availability of other elements in a system, for example, network, server, operating system and so on, which support the application. Concentrating solely on the availability of any one block will not produce optimum availability of the application for the end user ...

April 06, 2021

A hybrid work environment will persist after the pandemic recedes, with over 80% stating that they expect over a quarter of workers to remain remote, and over two-thirds desiring flexibility between on-premises and remote deployments according to the 2021 State of the WAN report released by Aryaka ...