One of the benefits of doing the EMA Radar Report: AIOps- A Guide for Investing in Innovation was getting data from all 17 vendors on critical areas ranging from deployment and adoption challenges, to cost and pricing, to architectural and functionality insights across everything from heuristics, to automation, and data assimilation.
Administration and Deployment
In the area of deployment and administration, EMA found that on average AIOps vendors indicated between 1-1.5 full-time employees (FTE) were required for ongoing administration in an enterprise with about 10,000 employees. This didn’t include initial deployment or any significant extension in breadth of coverage or functionality.
In 31 interviews, these estimates were generally borne out. Three vendors at the high end estimated between 2.5 and 3 FTEs, whereas the three vendors at the low end estimated between less than 0.5 FTEs.
The great majority of AIOps platforms have heuristics that can "learn" their environments dynamically, without added administrative intervention. On average, they can do this in a little more than one week for 5,000 managed entities.
EMA then asked vendors to weight their AI/ML heuristics on a scale from 0-2, with 2 being a featured heuristic value, 1 being present, and 0 being absent. The top 10 heuristics getting a 2 weighting were:
2. Anomaly detection
3. Machine learning and baselining for event pattern recognition
4. Topology-based analytics
5. Prescriptive analytics
6. Predictive algorithms
8. Streaming analytics
9. Optimization algorithms
10. Object-based modeling
On average, AIOps vendors could assimilate between 1 million and 10 million metrics within five minutes. When we asked about what data types were in play, we saw:
1. Events (performance related)
2. Time Series
3. Log files
4. Events/ Time Series (security related)
5. Transaction (application performance)
7. Unstructured data
8. Agent data (systems)
9. Byte code instrumentation
10. Comma delimited files /CSV files
Third-party toolset integration
Significantly, all 17 vendors have some level of third-party toolset integration out of the box, or in parallel, none claim to do "all their own monitoring." In fact, the average AIOps platform has supported integrations for more than 50 different third-party toolsets, with four vendors indicating 100 or more.
These integrations can have powerful political and practical advantages, easing stakeholder reluctance by eliminating the need to break away from their existing tools completely. Additional values include toolset consolidation as IT organizations begin to observe redundancies while also realizing which toolsets are most valuable.
The most common toolset integrations were application performance monitoring (APM) tools tied with CMDBs or extended configuration management systems. Service desk integration for trouble ticketing followed and third-party event management systems came in fourth. Automation integrations were also key, with IT process automation (runbook), and workflow across IT in the lead.
A few use-case views
We had three use-case scenarios. And for each use case we examined a number of factors ranging from domain reach, stakeholders supported, real-time data currency, and heuristics to enable not only awareness of anomalies, but predictive and prescriptive recommendations. Vendors were positioned separately on a per-use-case basis.
When we asked about the top benefits for incident, availability and performance management all vendors led with the following six items, which were also born out in deployment interviews:
■ Faster time to repair problems
■ Proactive ability to prevent problems
■ Improved OpEx efficiencies within IT
■ Less time spent writing rules
■ Real-time insights and historical trends on IT services
■ Reduction/consolidation, minimalization of tools
When we asked what changes each vendor could trace for change impact and capacity optimization, we got the following top five:
■ System configuration service impact analysis
■ Application release changes
■ Service impact analysis (in general)
■ Virtualized infrastructure service impact analysis
■ Containers and microservices service impact analysis
For business impact and IT-to-business alignment, we asked about relevant data sources and saw these as the top five:
■ Enterprise operations data
■ IT warehouse for advanced trending
■ Business application owner data
■ Executive dashboard
■ Security/audit compliance systems
To wrap up
This is just a taste of the data that emerged from our AIOps Radar research. The report contains considerably more detail, while still being a condensation of 105 data-rich slides.
Doing this has been an adventure for me, for EMA as a whole, and I believe for the vendors involved, as well. I do hope you can check out the report and see for yourself as to why.
A growing need for process automation as a result of the confluence of digital transformation initiatives with the remote/hybrid work policies brought on by the pandemic was uncovered by an independent survey of over 500 IT Operations, DevOps, and Site Reliability Engineering (SRE) professionals commissioned by Transposit for its inaugural State of DevOps Automation Report ...
As the Covid-19 pandemic forces a global reset of how we gather and work, 60% of organizations are looking forward to increased spending in 2021 to deploy new technologies, according to the 14th annual State of the Network global study of enterprise networking and security challenges released by VIAVI Solutions ...
Complexity breaks correlation. Intelligence brings cohesion. This simple principle is what makes real-time asset intelligence a must-have for AIOps that is meant to diffuse complexity. To further create a context for the user, it is critical to understand service dependencies and correlate alerts across the stack to resolve incidents ...
We're all familiar with the process of QA within the software development cycle. Developers build a product and send it to QA engineers, who test and bless it before pushing it into the world. After release, a different team of SREs with their own toolset then monitor for issues and bugs. Now, a new level of customer expectations for speed and reliability have pushed businesses further toward delivering rapid product iterations and innovations to keep up with customer demands. This leaves little time to run the traditional development process ...
On Wednesday January 27, 2021, Microsoft Office 365 experienced an outage affected a number of its services with a prolonged outage affecting Exchange Online. Despite Microsoft indicating that it was just Exchange Online affected during this outage, some monitoring tools detected that Azure Active Directory and dependent services like SharePoint and OneDrive were also affected at the time. The outage information indicated a rollout and rollback but we wouldn't expect to see such a widescale outage and slowdown just affecting some of the schema unless everything had to be taken offline ...
Application availability depends on the availability of other elements in a system, for example, network, server, operating system and so on, which support the application. Concentrating solely on the availability of any one block will not produce optimum availability of the application for the end user ...
A hybrid work environment will persist after the pandemic recedes, with over 80% stating that they expect over a quarter of workers to remain remote, and over two-thirds desiring flexibility between on-premises and remote deployments according to the 2021 State of the WAN report released by Aryaka ...
As vaccinations rise and businesses plan for a post-covid future, more than 80% of knowledge workers in the US would like their long-term work environment to include some element of remote work ...
With so many of us working from home, IT leaders and executives are now more than ever interested in ensuring that the cloud services their team relies on are available. But instead of accessing popular business-critical applications such as Salesforce, G Suite, Office 365, Microsoft 365, and so on through the company's data center, employees now get these services directly from the Internet. Experience and productivity at each location vary by internet, ISP, gateway, proxy, etc. ...
Integration challenges continue to be a major roadblock for digital transformation initiatives, according to MuleSoft’s 2021 Connectivity Benchmark Report. As digital initiatives accelerate, integration has emerged as a critical factor in determining the success and speed of digital transformation across industries ...