Advertisement

Routine maintenance turned into Facebook’s biggest crisis

Routine maintenance on Facebook’s network stitching together its data centres caused a collapse of its global system for more than six hours, the company says.

Oct 06, 2021, updated Oct 06, 2021
Facebook CEO Mark Zuckerberg. REUTERS/Leah Millis

Facebook CEO Mark Zuckerberg. REUTERS/Leah Millis

The outage was the largest that Downdetector, a web monitoring firm, said it had ever seen.

It blocked access to apps for billions of users of Facebook, Instagram and WhatsApp, further intensifying weeks of scrutiny for the $US1 trillion ($A1.4 trillion) company.

The outage was the second blow to the social media giant in as many days after a former employee turned whistleblower told a US Senate hearing Facebook put profits before people’s safety, which the company denies.

In a blog post, Facebook Vice President of engineering Santosh Janardhan explained the company’s engineers issued a command that unintentionally disconnected Facebook data centres from the rest of the world.

While users lost access to one of the world’s most popular messaging apps – as WhatsApp has more than two billion users – employees were also blocked from internal tools.

The outage knocked out tools that engineers would normally use to investigate and repair such outages, making the task even more difficult, Facebook said.

The company said it sent a team of engineers to its data centres to try to debug and restart the systems.

However, it took the company extra time to get engineers inside to work on the servers due to the system security in place.

“Every failure like this is an opportunity to learn and get better,” Janardhan wrote.

“From here on out, our job is to … make sure events like this happen as rarely as possible.”

Local News Matters
Advertisement

We strive to deliver the best local independent coverage of the issues that matter to Queenslanders.

Copyright © 2024 InQueensland.
All rights reserved.
Privacy Policy