Jump to content

Failure-oblivious computing

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by SpuriousQ (talk | contribs) at 06:04, 10 June 2005. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Failure-oblivious computing is a technique that enables computer programs to continue executing despite memory errors. The technique handles attempts to read invalid memory by returning a manufactured value to the program, and it ignores invalid writes. This is a great contrast to typical memory checkers, which inform the program of the error or abort the program. In failure-oblivious computing, no attempt is made to inform the program that an error occured.

Failure-oblivious computing was developed by Martin Rinard and other researchers at the MIT Computer Science and Artificial Intelligence Laboratory. They introduced the concept at the 6th Symposium on Operating Systems Design and Implementation in 2004.

Enhancing Server Availability and Security Through Failure-Oblivious Computing (PDF)