Key Takeaway:


This past weekend’s global IT outage, triggered by a problematic software update, underscores the interconnected and often delicate nature of contemporary IT systems. It highlights how a single point of failure can ripple across numerous sectors.

The incident stemmed from an automatic update to CrowdStrike Falcon, a widely used cybersecurity tool, leading to crashes of Microsoft Windows computers worldwide.

CrowdStrike has since resolved the issue. While many organizations have resumed operations, IT teams face a prolonged effort to manually restore all affected systems.

Why did this happen?

Many organizations depend on the same cloud providers and cybersecurity solutions, creating a digital monoculture. This standardization allows for efficient and compatible systems but also means issues can propagate widely, as seen with CrowdStrike’s global impact.

Modern IT infrastructure is highly interconnected and interdependent. A failure in one component can trigger a chain reaction, affecting other parts of the system.

As software and networks grow more complex, the potential for unforeseen interactions and bugs increases. Even minor updates can have unintended, widespread consequences.

How was Microsoft involved?

When Windows computers began displaying the “blue screen of death,” initial reports blamed Microsoft. Microsoft confirmed a cloud services outage in the Central United States, starting around 6 pm Eastern Time on Thursday, July 18, 2024.

This outage affected some customers using various Azure services, Microsoft’s cloud platform. The impact was extensive, disrupting multiple sectors, including airlines, retail, banking, and media, not just in the U.S. but also in countries like Australia and New Zealand. It also affected several Microsoft 365 services, including PowerBI, Microsoft Fabric, and Teams.

It was later revealed that the Azure outage was also linked to the CrowdStrike update, impacting Microsoft’s virtual machines running Windows with Falcon installed. However, Microsoft has since confirmed these were unrelated events, and the Azure issue has “fully recovered.”

Lessons from this episode

Avoid over-reliance on a single IT provider. Companies should adopt a multi-cloud strategy, distributing their IT infrastructure across multiple providers to maintain operations if one fails.

Building redundancies into IT systems ensures continuity. Backup servers, alternative data centers, and failover mechanisms can take over if primary systems go down.

Automating routine IT processes minimizes human error, a common cause of outages. Automated systems can also proactively monitor and address potential issues.

Training staff to respond effectively during outages is crucial. Knowing who to contact, what steps to take, and how to use alternative workflows can help manage the situation.

Potential severity of IT outages

While a total global internet blackout is unlikely due to the internet’s distributed and decentralized nature, significant disruptions are possible.

Potential causes include intense solar flares, like the Carrington Event of 1859, which could damage satellites, power grids, and undersea cables essential for the internet. Such an event could lead to continent-spanning outages lasting months.

The global internet depends heavily on undersea fiber optic cables. Damage to key cables, whether from natural disasters, seismic events, accidents, or sabotage, could disrupt international internet traffic.

Coordinated cyber attacks targeting critical internet infrastructure, such as root DNS servers or major internet exchange points, could also cause widespread outages.

While a complete internet shutdown is improbable, the interconnected nature of our digital world means any large outage will have extensive impacts, disrupting the online services we rely on.

Continual adaptation and preparedness are crucial for ensuring the resilience of our global communications infrastructure.

Recently Published

Key Takeaway: Researchers have developed a technology that creates “audible enclaves” in open air, creating highly focused, localized zones of sound. These isolated audio pockets allow sound to materialize only at a precise point in space, unheard by others nearby. This breakthrough could revolutionize public communication, entertainment, military applications, and office design. The process, known […]
Key Takeaway: AI-powered mental health tools, such as chatbots and self-help apps, offer immediate emotional support to those in need. However, these tools cannot replace the complexity, depth, and ethical safeguards of human therapy, especially when dealing with serious mental health issues. AI lacks emotional understanding, cultural context, and real-time adaptability, which can be dangerous […]

Top Picks

Key Takeaway: Research shows that some animals form surprising partnerships, challenging traditional views on how intelligence evolves in the animal kingdom. For example, Octavia and Finn, a day octopus and coral trout, work as a team, each bringing unique skills to the hunt. Other species have also developed remarkable partnerships, such as the greater honeyguide […]
Key Takeaway: Satellite re-entry, a process where defunct satellites are disposed of, is causing a significant environmental impact on Earth’s atmosphere. As satellite usage increases, researchers are focusing on the re-entry process itself, which releases metal particles into the Earth’s atmosphere. These particles, such as aluminum oxide and lithium, can influence the planet’s energy balance, […]

Trending

I highly recommend reading the McKinsey Global Institute’s new report, “Reskilling China: Transforming The World’s Largest Workforce Into Lifelong Learners”, which focuses on the country’s biggest employment challenge, re-training its workforce and the adoption of practices such as lifelong learning to address the growing digital transformation of its productive fabric. How to transform the country […]

Join our Newsletter

Get our monthly recap with the latest news, articles and resources.

Login

Welcome to Empirics

We are glad you have decided to join our mission of gathering the collective knowledge of Asia!
Join Empirics