Tech: Another Disaster
Posted by Joe
Once again I found myself in a terrible case of IT disaster. Last Friday night I was working on some Exchange maintenance with another sysadmin from my company. We each took two of our Exchange servers and did some work. Two servers (the ones I was working with were in the US, the other two were in other countries). Once we were done, we rebooted the Exchange servers and were pleased that they seemed to come back online and mail started flowing again. However, I quickly noticed that one server in particular, (one of mine of course), was not back online yet. I couldn’t ping it, couldn’t browse to it, nothing, it was dead to the world. Desperately I started making phone calls, no one could make it to the office until the next afternoon. I tried resetting the power ports that server was supposed to be connected to on the UPS in this remote office. Nothing worked. The next day, staff from the remote office were able to determine that the problem was a fatal SCSI controller error. Aparently a memory controller board on the SCSI controller had gone bad. This card was replaced with a spare from a retired server and that fixed the issue. The server is now back online after almost 24 hours of downtime. What a mess! I was so stressed over this I could hardly stand it. I feel much better now that the server is back up!
About JoeI am the author of this blog, IT engineer, husband, father, and somewhat of a nerd.
Comments are closed.