An application of document filtering in an operational system

This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information processing & management 2010-09, Vol.46 (5), p.611-627
Hauptverfasser: Lehner, Paul, Worrell, Charles, Vu, Chrissy, Mittel, Janet, Snyder, Stephen, Schulte, Eric, Greiff, Warren
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 627
container_issue 5
container_start_page 611
container_title Information processing & management
container_volume 46
creator Lehner, Paul
Worrell, Charles
Vu, Chrissy
Mittel, Janet
Snyder, Stephen
Schulte, Eric
Greiff, Warren
description This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.
doi_str_mv 10.1016/j.ipm.2009.12.006
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_855680569</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0306457310000026</els_id><sourcerecordid>2080657851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</originalsourceid><addsrcrecordid>eNqFkElrHDEQRkWIIRM7PyC3JmBy6naV1hbBBzPECxh8sc9CrZaCht4i9QT8763JGB98SE51qPfV8gj5itAgoLzYNXEZGwqgG6QNgPxANtgqVgum8CPZAANZc6HYJ_I55x0AcIF0Qy6vpsouyxCdXeM8VXOo-tntRz-tVYjD6lOcflWxQKW3-PSXskOVn_PqxzNyEuyQ_ZfXekqern8-bm_r-4ebu-3Vfe2YgrXusXc2tLqlCFTxHlF0aCnQTmrQHDiXFjvNGHfMudZy3dHO0j6wwPsgLDsl349zlzT_3vu8mjFm54fBTn7eZ9MKIVsQUv-XVKJFZK2Shfz2jtzN-1R-y0aC1lppjQXCI-TSnHPywSwpjjY9GwRzEG92pog3B_EGqSniS-b8dbDNzg4h2cnF_BakDJRAfTj1x5Hzxdyf6JPJLvrJ-T4m71bTz_EfW14AmW2Wmw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>609997991</pqid></control><display><type>article</type><title>An application of document filtering in an operational system</title><source>Elsevier ScienceDirect Journals</source><creator>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</creator><creatorcontrib>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</creatorcontrib><description>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</description><identifier>ISSN: 0306-4573</identifier><identifier>EISSN: 1873-5371</identifier><identifier>DOI: 10.1016/j.ipm.2009.12.006</identifier><identifier>CODEN: IPMADK</identifier><language>eng</language><publisher>Kidlington: Elsevier Ltd</publisher><subject>Algorithms ; Automatic text analysis ; Bayesian inference ; Clocks ; Computerized information retrieval ; Document filtering ; Exact sciences and technology ; Filtering ; Filtering systems ; Filtration ; Information and communication sciences ; Information retrieval ; Information retrieval systems. Information and document management system ; Information science. Documentation ; Monitoring ; Monitors ; News ; Recall ; Sciences and techniques of general use ; Specialized information systems ; Studies ; Utilities</subject><ispartof>Information processing &amp; management, 2010-09, Vol.46 (5), p.611-627</ispartof><rights>2010 Elsevier Ltd</rights><rights>2015 INIST-CNRS</rights><rights>Copyright Pergamon Press Inc. Sep 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0306457310000026$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=23075199$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lehner, Paul</creatorcontrib><creatorcontrib>Worrell, Charles</creatorcontrib><creatorcontrib>Vu, Chrissy</creatorcontrib><creatorcontrib>Mittel, Janet</creatorcontrib><creatorcontrib>Snyder, Stephen</creatorcontrib><creatorcontrib>Schulte, Eric</creatorcontrib><creatorcontrib>Greiff, Warren</creatorcontrib><title>An application of document filtering in an operational system</title><title>Information processing &amp; management</title><description>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</description><subject>Algorithms</subject><subject>Automatic text analysis</subject><subject>Bayesian inference</subject><subject>Clocks</subject><subject>Computerized information retrieval</subject><subject>Document filtering</subject><subject>Exact sciences and technology</subject><subject>Filtering</subject><subject>Filtering systems</subject><subject>Filtration</subject><subject>Information and communication sciences</subject><subject>Information retrieval</subject><subject>Information retrieval systems. Information and document management system</subject><subject>Information science. Documentation</subject><subject>Monitoring</subject><subject>Monitors</subject><subject>News</subject><subject>Recall</subject><subject>Sciences and techniques of general use</subject><subject>Specialized information systems</subject><subject>Studies</subject><subject>Utilities</subject><issn>0306-4573</issn><issn>1873-5371</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqFkElrHDEQRkWIIRM7PyC3JmBy6naV1hbBBzPECxh8sc9CrZaCht4i9QT8763JGB98SE51qPfV8gj5itAgoLzYNXEZGwqgG6QNgPxANtgqVgum8CPZAANZc6HYJ_I55x0AcIF0Qy6vpsouyxCdXeM8VXOo-tntRz-tVYjD6lOcflWxQKW3-PSXskOVn_PqxzNyEuyQ_ZfXekqern8-bm_r-4ebu-3Vfe2YgrXusXc2tLqlCFTxHlF0aCnQTmrQHDiXFjvNGHfMudZy3dHO0j6wwPsgLDsl349zlzT_3vu8mjFm54fBTn7eZ9MKIVsQUv-XVKJFZK2Shfz2jtzN-1R-y0aC1lppjQXCI-TSnHPywSwpjjY9GwRzEG92pog3B_EGqSniS-b8dbDNzg4h2cnF_BakDJRAfTj1x5Hzxdyf6JPJLvrJ-T4m71bTz_EfW14AmW2Wmw</recordid><startdate>20100901</startdate><enddate>20100901</enddate><creator>Lehner, Paul</creator><creator>Worrell, Charles</creator><creator>Vu, Chrissy</creator><creator>Mittel, Janet</creator><creator>Snyder, Stephen</creator><creator>Schulte, Eric</creator><creator>Greiff, Warren</creator><general>Elsevier Ltd</general><general>Elsevier</general><general>Elsevier Science Ltd</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope><scope>7SC</scope><scope>7TA</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100901</creationdate><title>An application of document filtering in an operational system</title><author>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Automatic text analysis</topic><topic>Bayesian inference</topic><topic>Clocks</topic><topic>Computerized information retrieval</topic><topic>Document filtering</topic><topic>Exact sciences and technology</topic><topic>Filtering</topic><topic>Filtering systems</topic><topic>Filtration</topic><topic>Information and communication sciences</topic><topic>Information retrieval</topic><topic>Information retrieval systems. Information and document management system</topic><topic>Information science. Documentation</topic><topic>Monitoring</topic><topic>Monitors</topic><topic>News</topic><topic>Recall</topic><topic>Sciences and techniques of general use</topic><topic>Specialized information systems</topic><topic>Studies</topic><topic>Utilities</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lehner, Paul</creatorcontrib><creatorcontrib>Worrell, Charles</creatorcontrib><creatorcontrib>Vu, Chrissy</creatorcontrib><creatorcontrib>Mittel, Janet</creatorcontrib><creatorcontrib>Snyder, Stephen</creatorcontrib><creatorcontrib>Schulte, Eric</creatorcontrib><creatorcontrib>Greiff, Warren</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Materials Business File</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Information processing &amp; management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lehner, Paul</au><au>Worrell, Charles</au><au>Vu, Chrissy</au><au>Mittel, Janet</au><au>Snyder, Stephen</au><au>Schulte, Eric</au><au>Greiff, Warren</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An application of document filtering in an operational system</atitle><jtitle>Information processing &amp; management</jtitle><date>2010-09-01</date><risdate>2010</risdate><volume>46</volume><issue>5</issue><spage>611</spage><epage>627</epage><pages>611-627</pages><issn>0306-4573</issn><eissn>1873-5371</eissn><coden>IPMADK</coden><abstract>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</abstract><cop>Kidlington</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.ipm.2009.12.006</doi><tpages>17</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0306-4573
ispartof Information processing & management, 2010-09, Vol.46 (5), p.611-627
issn 0306-4573
1873-5371
language eng
recordid cdi_proquest_miscellaneous_855680569
source Elsevier ScienceDirect Journals
subjects Algorithms
Automatic text analysis
Bayesian inference
Clocks
Computerized information retrieval
Document filtering
Exact sciences and technology
Filtering
Filtering systems
Filtration
Information and communication sciences
Information retrieval
Information retrieval systems. Information and document management system
Information science. Documentation
Monitoring
Monitors
News
Recall
Sciences and techniques of general use
Specialized information systems
Studies
Utilities
title An application of document filtering in an operational system
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T07%3A28%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20application%20of%20document%20filtering%20in%20an%20operational%20system&rft.jtitle=Information%20processing%20&%20management&rft.au=Lehner,%20Paul&rft.date=2010-09-01&rft.volume=46&rft.issue=5&rft.spage=611&rft.epage=627&rft.pages=611-627&rft.issn=0306-4573&rft.eissn=1873-5371&rft.coden=IPMADK&rft_id=info:doi/10.1016/j.ipm.2009.12.006&rft_dat=%3Cproquest_cross%3E2080657851%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=609997991&rft_id=info:pmid/&rft_els_id=S0306457310000026&rfr_iscdi=true