Successful insight into the performance of a company's networks starts with effective network performance management (NPM) tools. However, with the plethora of options it can be overwhelming for IT teams to choose the right one. This blog continues the 10 essential questions to ask before selecting an NPM tool.
Start with: 10 Questions to Ask When Evaluating Network Performance Management Solutions - Part 1
Question #6: Does the NPM solution support machine-learning, advanced anomaly detection and correlation?
Most solutions make broad claims in these areas, without much to show for it. Networks, and the demands on those networks, are highly unique between companies, so it's extremely difficult even with today's computing technologies to apply generalizations across network performance monitoring. But what is becoming more practical is the ability of NPM solutions to learn and apply knowledge based on machine learning of data trends over time, to create baselines and identify anomalous behavior without having to pre-configure limits or behavior characteristics.
Legacy systems require a great deal of a prior knowledge, and then significant configuration, for anomaly detection to work effectively. ML and AI are beginning to change that, but it's important to really validate the claims of any NPM solution.
Question #7: Is the solution utilizing advanced analytics and reporting?
To derive meaningful insights into complex issues, analytics platforms must provide reports and analyses on most, if not all, of a network's performance. This includes offering custom reporting for baselining and trend analysis and the ability to easily pivot reports to focus on key network performance intelligence.
Additionally, a modern NPM solution should correlate data across multiple network domains offering a cohesive, big-picture view of performance metrics and providing intelligent alerting, giving back valuable time to strapped IT teams.
Question #8: Does the solution assist with capacity planning?
Under-provisioning network resources can lead to congestion, bad user experience, and loss of productivity — overall, a negative business impact. Over-provisioning can lead to excess spending and a hit to the bottom line. Therefore, capacity planning is critical in helping to avoid performance problems and negative impacts.
When looking at an NPM solution, it is critical that it supports capacity planning through these features:
■ Service Level Agreement (SLA) management
■ Network and application analysis
■ Baselining and trending
■ Exception management
■ QoS management
Question #9: Does the solution facilitate root-cause analysis?
Most NPM solutions focus on visualization and reporting based on flow data (NetFlow, sFlow, IPFIX, etc.). These solutions, and the flow data that feed them, provide enough detail to troubleshoot many network and application issues. But at times flow data are simply not enough to get to the root cause of a problem. When more detailed data are needed, a recording of the network traffic itself, at the packet level, provides the detailed data needed for root-cause analysis. And when this packet data is analyzed with appropriate software, the software itself can identify many of these detailed network and application issues.
An NPM solution that can quickly pivot from flow data for visualization and reporting to packet data for analysis provides the most comprehensive solution and will significantly reduce the mean time to repair (MTTR).
Question #10: Can the solution provide scalable, enterprise support?
As the number of devices in many organizations continues to grow, it's important to implement tools that support this growth, particularly for large-scale organizations. A modern NPM platform must be able to analyze devices and environments at scale without latency and extend into additional environments such as multi-vendor WAN, public and private clouds and more. It also must support capacity planning and predict if a network can support an increase in business-critical traffic.
As organizations continue to grow and disperse, it is more evident than ever that ensuring optimal network performance is critical to business efficiency. When choosing a network performance monitoring solution, considering the questions above and implementing a unified platform will help organizations eliminate the cost and complexity of point solutions, reduce downtime, and successfully address the challenges of a modern network system.
Developers need a tool that can be portable and vendor agnostic, given the advent of microservices. It may be clear an issue is occurring; what may not be clear is if it's part of a distributed system or the app itself. Enter OpenTelemetry, commonly referred to as OTel, an open-source framework that provides a standardized way of collecting and exporting telemetry data (logs, metrics, and traces) from cloud-native software ...
As SLOs grow in popularity their usage is becoming more mature. For example, 82% of respondents intend to increase their use of SLOs, and 96% have mapped SLOs directly to their business operations or already have a plan to, according to The State of Service Level Objectives 2023 from Nobl9 ...
Observability has matured beyond its early adopter position and is now foundational for modern enterprises to achieve full visibility into today's complex technology environments, according to The State of Observability 2023, a report released by Splunk in collaboration with Enterprise Strategy Group ...
Before network engineers even begin the automation process, they tend to start with preconceived notions that oftentimes, if acted upon, can hinder the process. To prevent that from happening, it's important to identify and dispel a few common misconceptions currently out there and how networking teams can overcome them. So, let's address the three most common network automation myths ...
Many IT organizations apply AI/ML and AIOps technology across domains, correlating insights from the various layers of IT infrastructure and operations. However, Enterprise Management Associates (EMA) has observed significant interest in applying these AI technologies narrowly to network management, according to a new research report, titled AI-Driven Networks: Leveling Up Network Management with AI/ML and AIOps ...
When it comes to system outages, AIOps solutions with the right foundation can help reduce the blame game so the right teams can spend valuable time restoring the impacted services rather than improving their MTTI score (mean time to innocence). In fact, much of today's innovation around ChatGPT-style algorithms can be used to significantly improve the triage process and user experience ...
Gartner identified the top 10 data and analytics (D&A) trends for 2023 that can guide D&A leaders to create new sources of value by anticipating change and transforming extreme uncertainty into new business opportunities ...
The only way for companies to stay competitive is to modernize applications, yet there's no denying that bringing apps into the modern era can be challenging ... Let's look at a few ways to modernize applications and consider what new obstacles and opportunities 2023 presents ...
As online penetration grows, retailers' profits are shrinking — with the cost of serving customers anytime, anywhere, at any speed not bringing in enough topline growth to best monetize even existing investments in technology, systems, infrastructure, and people, let alone new investments, according to Digital-First Retail: Turning Profit Destruction into Customer and Shareholder Value, a new report from AlixPartners and World Retail Congress ...