Automatic generation method of e-commerce dictionary

The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN HAO, FAN YINGLEI, YAO MINGDONG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN HAO
FAN YINGLEI
YAO MINGDONG
description The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN102902757A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN102902757A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN102902757A3</originalsourceid><addsrcrecordid>eNrjZDBxLC3Jz00syUxWSE_NSy0CsvLzFHJTSzLyUxTy0xRSdZPzc3NTi5JTFVIyk0GSiUWVPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMRloVEm8s5-hgZGlgZG5qbmjMTFqAEXWLPk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Automatic generation method of e-commerce dictionary</title><source>esp@cenet</source><creator>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</creator><creatorcontrib>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</creatorcontrib><description>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2013</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20130130&amp;DB=EPODOC&amp;CC=CN&amp;NR=102902757A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20130130&amp;DB=EPODOC&amp;CC=CN&amp;NR=102902757A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN HAO</creatorcontrib><creatorcontrib>FAN YINGLEI</creatorcontrib><creatorcontrib>YAO MINGDONG</creatorcontrib><title>Automatic generation method of e-commerce dictionary</title><description>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2013</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDBxLC3Jz00syUxWSE_NSy0CsvLzFHJTSzLyUxTy0xRSdZPzc3NTi5JTFVIyk0GSiUWVPAysaYk5xam8UJqbQdHNNcTZQze1ID8-tbggMRloVEm8s5-hgZGlgZG5qbmjMTFqAEXWLPk</recordid><startdate>20130130</startdate><enddate>20130130</enddate><creator>CHEN HAO</creator><creator>FAN YINGLEI</creator><creator>YAO MINGDONG</creator><scope>EVB</scope></search><sort><creationdate>20130130</creationdate><title>Automatic generation method of e-commerce dictionary</title><author>CHEN HAO ; FAN YINGLEI ; YAO MINGDONG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN102902757A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2013</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN HAO</creatorcontrib><creatorcontrib>FAN YINGLEI</creatorcontrib><creatorcontrib>YAO MINGDONG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN HAO</au><au>FAN YINGLEI</au><au>YAO MINGDONG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Automatic generation method of e-commerce dictionary</title><date>2013-01-30</date><risdate>2013</risdate><abstract>The invention discloses an automatic generation method of an e-commerce dictionary. The automatic generation method comprises the following steps of 1 data crawling: crawling original commodity data from an e-commerce website and a search engine; 2 pretreatment; 3 exhaustion in a mode going forward one by one; 4 word frequency statistics; 5 merger treatment; 6 redundancy filtering; 7 regular type filtering; 8 potential word compensation; 9 low frequency word rejecting; and 10 feature word compensation. The automatic generation method mainly has the advantages of being high in dictionary generation speed, adopting algorithms such as machine learning, intelligent filtering, error correction and compensation to automatically generate the dictionary, and being capable of greatly improving generation efficiency; being high in including rate of the generated dictionary, enabling fewer entries to be leaked in a word segmentation process due to the fact that a method of exhaustion in the mode going forward one by one</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN102902757A
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Automatic generation method of e-commerce dictionary
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T11%3A49%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20HAO&rft.date=2013-01-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN102902757A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true