Exchange Server Needs Proactive Monitoring
April 01, 2014

Praveen Manohar
SolarWinds

Share this

A major chunk of communication in an organization happens via email and any downtime in email can impact productivity and revenue. This is why availability and performance of Microsoft Exchange is vital for an organization that uses it.

To maintain uptime of Microsoft Exchange, it's essential that performance and availability is continuously monitored. But what are the parameters that have to be monitored? Here are some issues that can affect your exchange server performance and the parameters that need to be monitored to avoid those issues.

Storage Performance

As an IT admin, you would have seen cases where Microsoft Outlook users experience poor performance while trying to fetch emails from the exchange server. Storage Performance, i.e. input/output operations per second (IOPS), on the exchange server could be the culprit here. This is because IOPS defines how fast data - in our case email data - can be written to or read from the storage. Monitoring the performance of the Exchange storage lets you know of possible performance issues that can have an effect on mail fetching or sending.

RPC Threads

Do you often get support calls with users complaining that Outlook is unable to connect to their mailbox? One cause for this can be unavailability of Remote Procedure Call (RPC) threads. Outlook client connects using RPC threads to the Exchange server to perform operations, such as sending and receiving email, creating appointments, meetings and tasks, and so on. There's a limit in the number of available RPC threads on an Exchange server.

In cases where all RPC threads get used up, Outlook client automatically retries the connection until RPC threads are available making the user action slow. You can make RPC counters, such as RPC Requests, RPC operations/sec and RPC Averaged Latency counters throw alerts when the permitted limit is crossed by monitoring them. And once you receive an alert, you can restart the Exchange RPC Client Access service to free up RPC threads.

Something else is that an increase in the usage of RPC threads can also cause a bottleneck on the server’s resources, (RAM and CPU) thereby slowing down the server itself.

Replication

Data loss incidents, such as file corruption, water damage, human error, and so on can occur in an organization. All organizations should foresee these undesirable incidents and they need a database backup.

Most organizations run Exchange with the replication feature. This feature from Microsoft Exchange Server enables high availability for the Exchange Server's database. But just having this feature in your Exchange Server is not enough, ensuring the proper operation of replication is needed. An improper or fractional database backup is as bad as not having a backup at all. Thus copy status, copy queue, and replay queue for both the active and passive copies of all mailbox databases should be monitored to ensure there's no failure looming.

Further, replication status check is essential for factors like Active Manager, Cluster Service, and Replay Service, etc.

Storage Limits

When storage used to be expensive, Exchange admins used to limit the size of the mailbox storage. But now, because of cheaper storage space many admins decide not to have storage limits. This can cause the database size to grow and further on cause issues, such as a backup failure or increased restore time. Thus it is recommended to limit the mailbox size to minimize the time needed for data restore and reduce the probability of backup failure. Monitoring the mailbox size helps check if the applied rules for maintaining the mailbox size is operational.

Detailed monitoring and proper alerting for the above mentioned counters will help you take action before most undesirable events happen or get out of hand.

Praveen Manohar is a Head Geek at SolarWinds.

Share this

The Latest

November 30, 2023

To help you stay on top of the ever-evolving tech scene, Automox IT experts shake the proverbial magic eight ball and share their predictions about tech trends in the coming year. From M&A frenzies to sustainable tech and automation, these forecasts paint an exciting picture of the future ...

November 29, 2023
The past few years have presented numerous challenges for businesses: a pandemic, rising interest rates, supply chain disruptions, and geopolitical conflict that sent shockwaves across the global economy. But change may finally be on the horizon. According to a recent report by Endava ... a majority of executives confirmed they are feeling optimistic about the current business climate, and as a result, are forecasting larger IT budgets, increased technology funding and rollout, and prioritized innovation in the coming year ...
November 28, 2023

Incident management processes are not keeping pace with the demands of modern operations teams, failing to meet the needs of SREs as well as platform and ops teams. Results from the State of DevOps Automation and AI Survey, commissioned by Transposit, point to an incident management paradox. Despite nearly 60% of ITOps and DevOps professionals reporting they have a defined incident management process that's fully documented in one place and over 70% saying they have a level of automation that meets their needs, teams are unable to quickly resolve incidents ...

November 27, 2023

Today, in the world of enterprise technology, the challenges posed by legacy Virtual Desktop Infrastructure (VDI) systems have long been a source of concern for IT departments. In many instances, this promising solution has become an organizational burden, hindering progress, depleting resources, and taking a psychological and operational toll on employees ...

November 22, 2023

Within retail organizations across the world, IT teams will be bracing themselves for a hectic holiday season ... While this is an exciting opportunity for retailers to boost sales, it also intensifies severe risk. Any application performance slipup will cause consumers to turn their back on brands, possibly forever. Online shoppers will be completely unforgiving to any retailer who doesn't deliver a seamless digital experience ...

November 21, 2023

Black Friday is a time when consumers can cash in on some of the biggest deals retailers offer all year long ... Nearly two-thirds of consumers utilize a retailer's web and mobile app for holiday shopping, raising the stakes for competitors to provide the best online experience to retain customer loyalty. Perforce's 2023 Black Friday survey sheds light on consumers' expectations this time of year and how developers can properly prepare their applications for increased online traffic ...

November 20, 2023

This holiday shopping season, the stakes for online retailers couldn't be higher ... Even an hour or two of downtime for a digital storefront during this critical period can cost millions in lost revenue and has the potential to damage brand credibility. Savvy retailers are increasingly investing in observability to help ensure a seamless, omnichannel customer experience. Just ahead of the holiday season, New Relic released its State of Observability for Retail report, which offers insight and analysis on the adoption and business value of observability for the global retail/consumer industry ...

November 16, 2023

As organizations struggle to find and retain the talent they need to manage complex cloud implementations, many are leaning toward hybrid cloud as a solution ... While it's true that using the cloud is not a "one size fits all" proposition, it is clear that both large and small companies prefer a hybrid cloud model ...

November 15, 2023

In the same way a city is a sum of its districts and neighborhoods, complex IT systems are made of many components that continually interact. Observability requires a comprehensive and connected view of all aspects of the system, including even some that don't directly relate to its technological innards ...

November 14, 2023

Multicasting in this context refers to the process of directing data streams to two or more destinations. This might look like sending the same telemetry data to both an on-premises storage system and a cloud-based observability platform concurrently. The two principal benefits of this strategy are cost savings and service redundancy ...