Fault tolerance refers to how an operating system responds to a hardware or software failure. Fault tolerance is essentially a system’s ability to allow for failures or malfunctions, and this ability can be provided by software, hardware or a combination of both. Some computer systems have two or more duplicate systems in order to handle faults gracefully.
Fault tolerance software may be part of the OS interface, allowing the programmer to check critical data at specific points during a transaction. Fault tolerance can include:
In general, 100% fault tolerance can never be achieved due to cost constraints.
Read More »
Get Techopedia delivered to your inbox!