Amazon investigating Simple Storage Service (S3) 8-hour outage

Amazon's Simple Storage Service (S3) was down on July 20 for more than eight hours, affecting sites like SmugMug, Twitter, Centernetworks, and many of Amazon's own sites, and the company is still investigating the cause of the problem. The Amazon Web Service Health Dashboard shows that the Simple Storage Service and Simple Queue service experienced a […]

Amazon's Simple Storage Service (S3) was down on July 20 for more than eight hours, affecting sites like SmugMug, Twitter, Centernetworks, and many of Amazon's own sites, and the company is still investigating the cause of the problem. The Amazon Web Service Health Dashboard shows that the Simple Storage Service and Simple Queue service experienced a “service disruption.”

Amazon S3 Simple Storage Service was introduced in 2006, is a part of the Amazon Web Services (AWS) suite, also consisting of the Elastic Compute Cloud (EC2) and SimpleDB services.

In a communication with the company, GigaOM's Om Malik received a rather general explanation as to why the service was down: “As a distributed system, the different components of S3 need to be aware of the state of each other. For example, this awareness makes it possible for the system to decide which redundant physical storage server to route a request to.”

“We experienced a problem with those internal system communications, leaving the components unable to interact properly, and customers unable to successfully process requests. After exploring several alternatives, the team determined it had to take the service offline to restore proper communication and then bring service online again.”

Source:→ BetaNews