Downtime

September 29, 2022

Our digital economy is intolerant of downtime. But consumers haven't just come to expect always-on digital apps and services. They also expect continuous innovation, new functionality and lightening fast response times. Organizations have taken note, investing heavily in teams and tools that supposedly increase uptime and free resources for innovation. But leaders have not realized this "throw money at the problem" approach to monitoring is burning through resources without much improvement in availability outcomes ...

September 13, 2022

Data professionals are spending 40% of their time evaluating or checking data quality and that poor data quality impacts 26% of their companies' revenue, according to The State of Data Quality 2022, a report commissioned by Monte Carlo and conducted by Wakefield Research ...

June 27, 2022

Hybrid work adoption and the accelerated pace of digital transformation are driving an increasing need for automation and site reliability engineering (SRE) practices, according to new research. In a new survey almost half of respondents (48.2%) said automation is a way to decrease Mean Time to Resolution/Repair (MTTR) and improve service management ...

June 14, 2022

Findings from the 2022 State of Edge Messaging Report from Ably and Coleman Parkes Research show that most organizations (65%) that have built edge messaging capabilities in house have experienced an outage or significant downtime in the last 12-18 months. Most of the current in-house real-time messaging services aren't cutting it ...

June 09, 2022

Networks need to be up and running for businesses to continue operating and sustaining customer-facing services. Streamlining and automating network administration tasks enable routine business processes to continue without disruption, eliminating any network downtime caused by human error or other system flaws ...

April 18, 2022

Because CIOs often have limited visibility into the number of machine identities on their networks and these critical security assets are not prioritized in IAM and security budgets, CIOs should expect to see a sharp increase in machine identity related outages and security breaches, according to a new study conducted by Venafi ...

February 10, 2022

Still not convinced on the value an AIOps platform offers? Consider this: one minute of downtime at Amazon costs the company roughly $220,000 in revenue. With that kind of money on the line, SRE and DevOps teams forced to manage availability by writing rules and querying logs manually are set up to fail — and failure is costly. AIOps is the necessary lift your monitoring tools need to improve performance and cut out the toil for DevOps and IT teams. Here are five ways AIOps does exactly that ...

February 01, 2022

Our growing dependence on the cloud and Internet for business means we must take time to prepare for downtime and latency issues. There are valuable lessons found in most failures, and the Internet outages of 2021 certainly provide ample motivation to revamp processes for mitigating system disruptions. Here are six take-aways from 2021's Internet fails that can be used to increase efficiencies in managing the system infrastructure of any enterprise, no matter its size or sector ...

January 06, 2022

In a world where digital services have become a critical part of how we go about our daily lives, the risk of undergoing an outage has become even more significant. Outages can range in severity and impact companies of every size — while outages from larger companies in the social media space or a cloud provider tend to receive a lot of coverage, application downtime from even the most targeted companies can disrupt users' personal and business operations ...

November 15, 2021

Most (83%) companies would suffer business damage during the first 24 hours of an outage and thereafter, according to Pivoting to Risk-Driven Security Operations, a report from Netenrich based on a global survey of IT and security professionals ...

August 30, 2021

The Fastly outage in June 2021 showed how one inconspicuous coding error can cause worldwide chaos. A single Fastly customer making a legitimate configuration change, triggered a hidden bug that sent half of the internet offline, including web giants like Amazon and Reddit. Ultimately, this incident illustrates why organizations must test their software in production ...

July 12, 2021

When you see distressing internet outages occur like the recent Fastly incident that threw a slew of websites offline, I am never surprised by how widespread the problem was, but paradoxically that it wasn't worse ...

June 09, 2021

An hour-long outage this Tuesday ground the Internet to a halt after popular Content Delivery Network (CDN) provider, Fastly, experienced a glitch that downed Reddit, Spotify, HBO Max, Shopify, Stripe and the BBC, to name just a few of properties affected ...

May 27, 2021

In summer 2020, changes to a Facebook API triggered a series of major mobile app crashes worldwide. Popular iOS apps including Spotify, Pinterest, TikTok, Venmo, Tinder and DoorDash, among others, failed immediately upon being opened, leaving millions of users without access to their favorite services. However, the API wasn't at fault, it was actually Facebook's iOS software development kit (SDK) that was responsible for the crash ...

April 26, 2021

On March 22, Android users around the globe suddenly saw notifications pop up on their devices saying that apps had stopped running. Critical apps such as Gmail, Google Pay, Amazon, Yahoo and certain banking apps couldn't be opened, creating widespread consumer concerns. Later, Google revealed the cause was a bug residing in the Android System WebView ...

March 09, 2021

In today's complex, dynamic IT environments, the proliferation of disparate IT Ops, NOC, DevOps, and SRE teams and tools is a given — and usually considered a necessity. This leads to the inevitable truth that when an incident happens, often the biggest challenge is collaborating between these teams to understand what happened and resolve the issue. Inefficiencies suffered during this critical stage can have huge impacts on how much each incident costs the business ...

February 25, 2021

Organizations use data to fuel their operations, make smart business decisions, improve customer relationships, and much more. Because so much value can be extracted from data its influence is generally positive, but it can also be detrimental to a business experiencing a serious disruption such as a cyberattack, insider threat, or storage platform-specific hack or bug ...

February 24, 2021

Previously siloed IT teams and technologies are converging as enterprises accelerate their modernization efforts in reaction to COVID-19, according to a study by LogicMonitor ...

February 02, 2021

A poll of over 1,000 IT decision makers in the US we conducted recently revealed that over 40% suffer network brownouts several times a week, while end user complaints about application performance soared by 60% due to performance degradations, excessive slowdowns and network congestion ...

November 04, 2020

More than 80% of organizations have experienced a significant increase in pressure on digital services since the start of the COVID-19 pandemic, according to a survey by PagerDuty ...

September 29, 2020

More than 80% of organizations have experienced a significant increase in pressure on digital services since the start of the COVID-19 pandemic, according to a new study conducted by PagerDuty ...

September 09, 2020

In 2020, our society is undergoing massive upheaval and businesses are being forced to adapt on the fly. During this period of crisis, the companies that make the necessary adjustments the quickest will succeed. We're already seeing it happen in the accelerated push toward digitization, as companies that smoothly digitize their customer experiences move forward and those that don't get left behind ...

September 02, 2020

Most business executives are worried about the competition taking them down. What they don't realize is, their own IT can do an equal amount of damage. Without realizing this fact, most organizations are one high-profile incident away from losing a lot of their customers ...

August 26, 2020

No matter what year it is, businesses cannot afford, financially or operationally, to be hit by a data breach or system loss. This is an ongoing concern, but in the age of COVID-19, that risk multiplies several fold due to remote data access. Any downtime as companies work to recover lost information could have major consequences ...

August 19, 2020

Some years ago, the computer systems' key focus was on performance and many articles, products and efforts were evident in this area. A few years later, the emphasis moved to high availability (HA) of hardware and software and all the other machinations they entail. Today the focus is on (cyber)security. These discrete environments' boundaries have now blurred under the heading of resilience. The main components of resilience are ...

Pages