Definition - What does Fault Management mean?
Fault management is a network management component that deals with the detection, isolation and resolution of network problems. The proper implementation of a fault management system can help ensure that networks function optimally.
Techopedia explains Fault Management
In a telecommunications network, fault management refers to a set of functions that detects, isolates and corrects network malfunctions. The system examines error logs, accepts and acts on error detection notifications, traces and identifies faults, and carries out a sequence of diagnostic tests.
When a fault is encountered in a network, a component sends notification to a network operator using protocols such as Simple Network Management Protocol, and uses complex filtering systems to assign alarms to severity levels.
Fault management can be active or passive. Passive fault management is performed by collecting alarms from devices when an error is encountered. Active fault management addresses issues by monitoring devices using certain tools to determine whether devices are active and responding. Applications designed for fault management systems are called fault management platforms.