Method for computing frequency distribution for many fields in one pass in parallel

Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of ti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Beckerle, Michael James, Callen, Jerry Lee
Format: Patent
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Beckerle, Michael James
Callen, Jerry Lee
description Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of times the field value occurs in the set of records, and wherein the field identifier concatenated with the field value comprises a composite key value. It is determined that at least one count table of frequency distributions is approaching a maximum amount of memory allocated to that count table. The records of the at least one count table that is approaching the maximum amount of memory are sent for sorting and additional counting, wherein the records include composite key values.
format Patent
fullrecord <record><control><sourceid>uspatents_EFH</sourceid><recordid>TN_cdi_uspatents_grants_07565349</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>07565349</sourcerecordid><originalsourceid>FETCH-uspatents_grants_075653493</originalsourceid><addsrcrecordid>eNqNijEOwjAMALMwIOAP_gBSpVIQcwXqwgQ7Mo1TIrlOiJOhvwcqHsB0p9MtzfVC-RksuJCgD2Ms2csALtGrkPQTWK85-ccnB5mnEWUC54mtghcIQhBRZ4-YkJl4bRYOWWnz48rA-XRru23RiJkk631I-EV1aPZNvTvWfyxvN8Y5jQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method for computing frequency distribution for many fields in one pass in parallel</title><source>USPTO Issued Patents</source><creator>Beckerle, Michael James ; Callen, Jerry Lee</creator><creatorcontrib>Beckerle, Michael James ; Callen, Jerry Lee ; International Business Machines Corporation</creatorcontrib><description>Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of times the field value occurs in the set of records, and wherein the field identifier concatenated with the field value comprises a composite key value. It is determined that at least one count table of frequency distributions is approaching a maximum amount of memory allocated to that count table. The records of the at least one count table that is approaching the maximum amount of memory are sent for sorting and additional counting, wherein the records include composite key values.</description><language>eng</language><creationdate>2009</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7565349$$EPDF$$P50$$Guspatents$$Hfree_for_read</linktopdf><link.rule.ids>230,308,776,798,881,64012</link.rule.ids><linktorsrc>$$Uhttps://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7565349$$EView_record_in_USPTO$$FView_record_in_$$GUSPTO$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Beckerle, Michael James</creatorcontrib><creatorcontrib>Callen, Jerry Lee</creatorcontrib><creatorcontrib>International Business Machines Corporation</creatorcontrib><title>Method for computing frequency distribution for many fields in one pass in parallel</title><description>Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of times the field value occurs in the set of records, and wherein the field identifier concatenated with the field value comprises a composite key value. It is determined that at least one count table of frequency distributions is approaching a maximum amount of memory allocated to that count table. The records of the at least one count table that is approaching the maximum amount of memory are sent for sorting and additional counting, wherein the records include composite key values.</description><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2009</creationdate><recordtype>patent</recordtype><sourceid>EFH</sourceid><recordid>eNqNijEOwjAMALMwIOAP_gBSpVIQcwXqwgQ7Mo1TIrlOiJOhvwcqHsB0p9MtzfVC-RksuJCgD2Ms2csALtGrkPQTWK85-ccnB5mnEWUC54mtghcIQhBRZ4-YkJl4bRYOWWnz48rA-XRru23RiJkk631I-EV1aPZNvTvWfyxvN8Y5jQ</recordid><startdate>20090721</startdate><enddate>20090721</enddate><creator>Beckerle, Michael James</creator><creator>Callen, Jerry Lee</creator><scope>EFH</scope></search><sort><creationdate>20090721</creationdate><title>Method for computing frequency distribution for many fields in one pass in parallel</title><author>Beckerle, Michael James ; Callen, Jerry Lee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-uspatents_grants_075653493</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2009</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Beckerle, Michael James</creatorcontrib><creatorcontrib>Callen, Jerry Lee</creatorcontrib><creatorcontrib>International Business Machines Corporation</creatorcontrib><collection>USPTO Issued Patents</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Beckerle, Michael James</au><au>Callen, Jerry Lee</au><aucorp>International Business Machines Corporation</aucorp><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method for computing frequency distribution for many fields in one pass in parallel</title><date>2009-07-21</date><risdate>2009</risdate><abstract>Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of times the field value occurs in the set of records, and wherein the field identifier concatenated with the field value comprises a composite key value. It is determined that at least one count table of frequency distributions is approaching a maximum amount of memory allocated to that count table. The records of the at least one count table that is approaching the maximum amount of memory are sent for sorting and additional counting, wherein the records include composite key values.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_uspatents_grants_07565349
source USPTO Issued Patents
title Method for computing frequency distribution for many fields in one pass in parallel
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T10%3A13%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-uspatents_EFH&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Beckerle,%20Michael%20James&rft.aucorp=International%20Business%20Machines%20Corporation&rft.date=2009-07-21&rft_id=info:doi/&rft_dat=%3Cuspatents_EFH%3E07565349%3C/uspatents_EFH%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true