Automatic generation method of e-commerce dictionary
The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | CHEN HAO FAN YINGLEI YAO MINGDONG |
description | The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN102902757A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN102902757A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN102902757A3</originalsourceid><addsrcrecordid>eNrjZDBxLC3Jz00syUxWSE_NSy0CsvLzFHJTSzLyUxTy0xRSdZPzc3NTi5JTFVIyk0GSiUWVPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMRloVEm8s5-hgZGlgZG5qbmjMTFqAEXWLPk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Automatic generation method of e-commerce dictionary</title><source>esp@cenet</source><creator>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</creator><creatorcontrib>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</creatorcontrib><description>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2013</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20130130&DB=EPODOC&CC=CN&NR=102902757A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20130130&DB=EPODOC&CC=CN&NR=102902757A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN HAO</creatorcontrib><creatorcontrib>FAN YINGLEI</creatorcontrib><creatorcontrib>YAO MINGDONG</creatorcontrib><title>Automatic generation method of e-commerce dictionary</title><description>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2013</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDBxLC3Jz00syUxWSE_NSy0CsvLzFHJTSzLyUxTy0xRSdZPzc3NTi5JTFVIyk0GSiUWVPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMRloVEm8s5-hgZGlgZG5qbmjMTFqAEXWLPk</recordid><startdate>20130130</startdate><enddate>20130130</enddate><creator>CHEN HAO</creator><creator>FAN YINGLEI</creator><creator>YAO MINGDONG</creator><scope>EVB</scope></search><sort><creationdate>20130130</creationdate><title>Automatic generation method of e-commerce dictionary</title><author>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN102902757A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2013</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN HAO</creatorcontrib><creatorcontrib>FAN YINGLEI</creatorcontrib><creatorcontrib>YAO MINGDONG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN HAO</au><au>FAN YINGLEI</au><au>YAO MINGDONG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Automatic generation method of e-commerce dictionary</title><date>2013-01-30</date><risdate>2013</risdate><abstract>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN102902757A |
source | esp@cenet |
subjects | CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | Automatic generation method of e-commerce dictionary |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T11%3A49%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20HAO&rft.date=2013-01-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN102902757A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |