Automatic generation method of e-commerce dictionary
The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one |
---|