Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects

Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GAUSSIER, ERIC, CHEN, FRANCINE R, POPAT, ASHOK C
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator GAUSSIER, ERIC
CHEN, FRANCINE R
POPAT, ASHOK C
description Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be configured to receive a collection of documents, each document including a plurality of words, and perform a modified deterministic annealing Expectation-Maximization (EM) process on the collection to produce a softly assigned hierarchy of nodes. The process may involve assigning documents and document fragments to multiple nodes in the hierarchy based on words included in the documents, such that a document may be assigned to any ancestor node included in the hierarchy, thus eliminating the hard assignment of documents in the hierarchy.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP1304627B1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP1304627B1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP1304627B13</originalsourceid><addsrcrecordid>eNqNizEKwkAQRdNYiHqHOYABY0R7JWIjWNjLOJk1K5vdMDNbeHsjegCr_x68Py3cma1LrS5BX2rcj4CxBRTzFFghOegxZodkWRhcEtDkDDrPgkKdJwxAIY9f8fHx6SmViSjL1-9PJtN5MXEYlBe_nRVwbK6HU8lDurEOSBzZbs2lqleb7Xq3r-o_kjf2OT9P</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects</title><source>esp@cenet</source><creator>GAUSSIER, ERIC ; CHEN, FRANCINE R ; POPAT, ASHOK C</creator><creatorcontrib>GAUSSIER, ERIC ; CHEN, FRANCINE R ; POPAT, ASHOK C</creatorcontrib><description>Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be configured to receive a collection of documents, each document including a plurality of words, and perform a modified deterministic annealing Expectation-Maximization (EM) process on the collection to produce a softly assigned hierarchy of nodes. The process may involve assigning documents and document fragments to multiple nodes in the hierarchy based on words included in the documents, such that a document may be assigned to any ancestor node included in the hierarchy, thus eliminating the hard assignment of documents in the hierarchy.</description><language>eng ; fre ; ger</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2014</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20140402&amp;DB=EPODOC&amp;CC=EP&amp;NR=1304627B1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,778,883,25551,76302</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20140402&amp;DB=EPODOC&amp;CC=EP&amp;NR=1304627B1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAUSSIER, ERIC</creatorcontrib><creatorcontrib>CHEN, FRANCINE R</creatorcontrib><creatorcontrib>POPAT, ASHOK C</creatorcontrib><title>Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects</title><description>Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be configured to receive a collection of documents, each document including a plurality of words, and perform a modified deterministic annealing Expectation-Maximization (EM) process on the collection to produce a softly assigned hierarchy of nodes. The process may involve assigning documents and document fragments to multiple nodes in the hierarchy based on words included in the documents, such that a document may be assigned to any ancestor node included in the hierarchy, thus eliminating the hard assignment of documents in the hierarchy.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2014</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNizEKwkAQRdNYiHqHOYABY0R7JWIjWNjLOJk1K5vdMDNbeHsjegCr_x68Py3cma1LrS5BX2rcj4CxBRTzFFghOegxZodkWRhcEtDkDDrPgkKdJwxAIY9f8fHx6SmViSjL1-9PJtN5MXEYlBe_nRVwbK6HU8lDurEOSBzZbs2lqleb7Xq3r-o_kjf2OT9P</recordid><startdate>20140402</startdate><enddate>20140402</enddate><creator>GAUSSIER, ERIC</creator><creator>CHEN, FRANCINE R</creator><creator>POPAT, ASHOK C</creator><scope>EVB</scope></search><sort><creationdate>20140402</creationdate><title>Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects</title><author>GAUSSIER, ERIC ; CHEN, FRANCINE R ; POPAT, ASHOK C</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP1304627B13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2014</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>GAUSSIER, ERIC</creatorcontrib><creatorcontrib>CHEN, FRANCINE R</creatorcontrib><creatorcontrib>POPAT, ASHOK C</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAUSSIER, ERIC</au><au>CHEN, FRANCINE R</au><au>POPAT, ASHOK C</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects</title><date>2014-04-02</date><risdate>2014</risdate><abstract>Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be configured to receive a collection of documents, each document including a plurality of words, and perform a modified deterministic annealing Expectation-Maximization (EM) process on the collection to produce a softly assigned hierarchy of nodes. The process may involve assigning documents and document fragments to multiple nodes in the hierarchy based on words included in the documents, such that a document may be assigned to any ancestor node included in the hierarchy, thus eliminating the hard assignment of documents in the hierarchy.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; ger
recordid cdi_epo_espacenet_EP1304627B1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T19%3A25%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAUSSIER,%20ERIC&rft.date=2014-04-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP1304627B1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true