Outage at UAA: A Week Without Critical Information Systems

Bogdan Hoanca (Computer Information Systems, College of Business and Public Policy, University of Alaska Anchorage, Anchorage, AK, USA) and David Fitzgerald (Computer Information Systems, College of Business and Public Policy, University of Alaska Anchorage, Anchorage, AK, USA)
Copyright: © 2013 |Pages: 41
EISBN13: 9781466658950|DOI: 10.4018/jcit.2013040102
OnDemand PDF Download:
$37.50
OnDemand PDF Download
Download link provided immediately after order completion
$37.50

Abstract

With redundant hardware, it is rare that a disk failure results in downtime at the system level. System failures do sometimes occur, typically as a sequence of very rare events that leads to a catastrophic failure. This case describes how a combination of hardware and firmware failures, along with human error, led to the failure of a redundant disk storage unit, which in turn affected several enterprise systems at a major public university. Subsequently, a small number of conservative and seemingly “good” decisions in the process of restoring the system from backups led to negative outcomes, primarily additional downtime over the course of several days. The case illustrates how even well-considered and conservative decisions may seem flawed in hindsight. An important lesson from the case is that it is difficult to justify to management the provision of sufficient backup resources to prevent very low-probability failure events.
InfoSci-OnDemand Powered Search