NLP-based log clustering method and device
The invention provides an NLP-based log clustering method and device, and the method comprises the steps: carrying out the data processing of a log text through the combination of multiple algorithms, i.e., Pundulation, DBSCAN, LCS and JFLEX, obtaining a log clustering result, gathering logs with hi...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides an NLP-based log clustering method and device, and the method comprises the steps: carrying out the data processing of a log text through the combination of multiple algorithms, i.e., Pundulation, DBSCAN, LCS and JFLEX, obtaining a log clustering result, gathering logs with high similarity together, and extracting a common log PATTERN, thereby facilitating the discovery of law and generality problems in the logs, and improving the efficiency of log clustering. Problems can be conveniently checked and faults can be conveniently positioned from massive logs, only a small number of log modes are needed for representation, common parts are extracted, independent information is reserved, and the storage cost is reduced.
本申请提供了一种基于NLP的日志聚类方法和装置,通过将Punctuation、DBSCAN、LCS、JFLEX多种算法相互结合进行数据处理加工日志文本得到日志聚类结果,将相似度高的日志聚集在一起,提取共同的日志PATTERN,有利于发现日志中的规律和共性问题,方便从海量日志中排查问题、定位故障,仅需少量日志模式表示,提取共性部分保留独立信息,减少存储成本。 |
---|