Streaming text data mining method and apparatus using multidimensional subspaces

A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: FERNG WILLIAM, POTEET STEPHEN R, WU YUAN-JYE, KAO ANNE S-W, CRANFILL ROBERT E
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator FERNG WILLIAM
POTEET STEPHEN R
WU YUAN-JYE
KAO ANNE S-W
CRANFILL ROBERT E
description A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US8234279B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US8234279B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US8234279B23</originalsourceid><addsrcrecordid>eNrjZAgILilKTczNzEtXKEmtKFFISSxJVAByQQK5qSUZ-SkKiXlAXFCQWJRYUlqsUFoMlirNKclMycxNzSvOzM9LzFEoLk0qLkhMTi3mYWBNS8wpTuWF0twMCm6uIc4euqkF-fGpYDV5qSXxocEWRsYmRuaWTkbGRCgBAIRdN-w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Streaming text data mining method and apparatus using multidimensional subspaces</title><source>esp@cenet</source><creator>FERNG WILLIAM ; POTEET STEPHEN R ; WU YUAN-JYE ; KAO ANNE S-W ; CRANFILL ROBERT E</creator><creatorcontrib>FERNG WILLIAM ; POTEET STEPHEN R ; WU YUAN-JYE ; KAO ANNE S-W ; CRANFILL ROBERT E</creatorcontrib><description>A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2012</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20120731&amp;DB=EPODOC&amp;CC=US&amp;NR=8234279B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76318</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20120731&amp;DB=EPODOC&amp;CC=US&amp;NR=8234279B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>FERNG WILLIAM</creatorcontrib><creatorcontrib>POTEET STEPHEN R</creatorcontrib><creatorcontrib>WU YUAN-JYE</creatorcontrib><creatorcontrib>KAO ANNE S-W</creatorcontrib><creatorcontrib>CRANFILL ROBERT E</creatorcontrib><title>Streaming text data mining method and apparatus using multidimensional subspaces</title><description>A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2012</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZAgILilKTczNzEtXKEmtKFFISSxJVAByQQK5qSUZ-SkKiXlAXFCQWJRYUlqsUFoMlirNKclMycxNzSvOzM9LzFEoLk0qLkhMTi3mYWBNS8wpTuWF0twMCm6uIc4euqkF-fGpYDV5qSXxocEWRsYmRuaWTkbGRCgBAIRdN-w</recordid><startdate>20120731</startdate><enddate>20120731</enddate><creator>FERNG WILLIAM</creator><creator>POTEET STEPHEN R</creator><creator>WU YUAN-JYE</creator><creator>KAO ANNE S-W</creator><creator>CRANFILL ROBERT E</creator><scope>EVB</scope></search><sort><creationdate>20120731</creationdate><title>Streaming text data mining method and apparatus using multidimensional subspaces</title><author>FERNG WILLIAM ; POTEET STEPHEN R ; WU YUAN-JYE ; KAO ANNE S-W ; CRANFILL ROBERT E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US8234279B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2012</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>FERNG WILLIAM</creatorcontrib><creatorcontrib>POTEET STEPHEN R</creatorcontrib><creatorcontrib>WU YUAN-JYE</creatorcontrib><creatorcontrib>KAO ANNE S-W</creatorcontrib><creatorcontrib>CRANFILL ROBERT E</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>FERNG WILLIAM</au><au>POTEET STEPHEN R</au><au>WU YUAN-JYE</au><au>KAO ANNE S-W</au><au>CRANFILL ROBERT E</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Streaming text data mining method and apparatus using multidimensional subspaces</title><date>2012-07-31</date><risdate>2012</risdate><abstract>A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US8234279B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Streaming text data mining method and apparatus using multidimensional subspaces
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T18%3A28%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=FERNG%20WILLIAM&rft.date=2012-07-31&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS8234279B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true