Method and apparatus for identifying a failure mechanism for a component in a computer system

One embodiment of the present invention provides a system that identifies a failure mechanism for a component. The system first constructs a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Usynin, Alexander V, Urmanov, Aleksey M, Gross, Kenny C
Format: Patent
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:One embodiment of the present invention provides a system that identifies a failure mechanism for a component. The system first constructs a library of failure signatures from a set of components, wherein each failure signature in the library characterizes a known failure mechanism associated with a component in the set of components. Next, the system receives a telemetry signal which is measured from a component under surveillance. The system then determines if there is an anomaly in the telemetry signal. If so, the system performs pattern-matching operations between the telemetry signal and failure signatures in the library. Next, if a match is found, the system identifies the failure mechanism for the component under surveillance based on the matching failure signature.