An application of document filtering in an operational system
This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news...
Gespeichert in:
Veröffentlicht in: | Information processing & management 2010-09, Vol.46 (5), p.611-627 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 627 |
---|---|
container_issue | 5 |
container_start_page | 611 |
container_title | Information processing & management |
container_volume | 46 |
creator | Lehner, Paul Worrell, Charles Vu, Chrissy Mittel, Janet Snyder, Stephen Schulte, Eric Greiff, Warren |
description | This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology. |
doi_str_mv | 10.1016/j.ipm.2009.12.006 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_855680569</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0306457310000026</els_id><sourcerecordid>2080657851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</originalsourceid><addsrcrecordid>eNqFkElrHDEQRkWIIRM7PyC3JmBy6naV1hbBBzPECxh8sc9CrZaCht4i9QT8763JGB98SE51qPfV8gj5itAgoLzYNXEZGwqgG6QNgPxANtgqVgum8CPZAANZc6HYJ_I55x0AcIF0Qy6vpsouyxCdXeM8VXOo-tntRz-tVYjD6lOcflWxQKW3-PSXskOVn_PqxzNyEuyQ_ZfXekqern8-bm_r-4ebu-3Vfe2YgrXusXc2tLqlCFTxHlF0aCnQTmrQHDiXFjvNGHfMudZy3dHO0j6wwPsgLDsl349zlzT_3vu8mjFm54fBTn7eZ9MKIVsQUv-XVKJFZK2Shfz2jtzN-1R-y0aC1lppjQXCI-TSnHPywSwpjjY9GwRzEG92pog3B_EGqSniS-b8dbDNzg4h2cnF_BakDJRAfTj1x5Hzxdyf6JPJLvrJ-T4m71bTz_EfW14AmW2Wmw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>609997991</pqid></control><display><type>article</type><title>An application of document filtering in an operational system</title><source>Elsevier ScienceDirect Journals</source><creator>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</creator><creatorcontrib>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</creatorcontrib><description>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</description><identifier>ISSN: 0306-4573</identifier><identifier>EISSN: 1873-5371</identifier><identifier>DOI: 10.1016/j.ipm.2009.12.006</identifier><identifier>CODEN: IPMADK</identifier><language>eng</language><publisher>Kidlington: Elsevier Ltd</publisher><subject>Algorithms ; Automatic text analysis ; Bayesian inference ; Clocks ; Computerized information retrieval ; Document filtering ; Exact sciences and technology ; Filtering ; Filtering systems ; Filtration ; Information and communication sciences ; Information retrieval ; Information retrieval systems. Information and document management system ; Information science. Documentation ; Monitoring ; Monitors ; News ; Recall ; Sciences and techniques of general use ; Specialized information systems ; Studies ; Utilities</subject><ispartof>Information processing & management, 2010-09, Vol.46 (5), p.611-627</ispartof><rights>2010 Elsevier Ltd</rights><rights>2015 INIST-CNRS</rights><rights>Copyright Pergamon Press Inc. Sep 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0306457310000026$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=23075199$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lehner, Paul</creatorcontrib><creatorcontrib>Worrell, Charles</creatorcontrib><creatorcontrib>Vu, Chrissy</creatorcontrib><creatorcontrib>Mittel, Janet</creatorcontrib><creatorcontrib>Snyder, Stephen</creatorcontrib><creatorcontrib>Schulte, Eric</creatorcontrib><creatorcontrib>Greiff, Warren</creatorcontrib><title>An application of document filtering in an operational system</title><title>Information processing & management</title><description>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</description><subject>Algorithms</subject><subject>Automatic text analysis</subject><subject>Bayesian inference</subject><subject>Clocks</subject><subject>Computerized information retrieval</subject><subject>Document filtering</subject><subject>Exact sciences and technology</subject><subject>Filtering</subject><subject>Filtering systems</subject><subject>Filtration</subject><subject>Information and communication sciences</subject><subject>Information retrieval</subject><subject>Information retrieval systems. Information and document management system</subject><subject>Information science. Documentation</subject><subject>Monitoring</subject><subject>Monitors</subject><subject>News</subject><subject>Recall</subject><subject>Sciences and techniques of general use</subject><subject>Specialized information systems</subject><subject>Studies</subject><subject>Utilities</subject><issn>0306-4573</issn><issn>1873-5371</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqFkElrHDEQRkWIIRM7PyC3JmBy6naV1hbBBzPECxh8sc9CrZaCht4i9QT8763JGB98SE51qPfV8gj5itAgoLzYNXEZGwqgG6QNgPxANtgqVgum8CPZAANZc6HYJ_I55x0AcIF0Qy6vpsouyxCdXeM8VXOo-tntRz-tVYjD6lOcflWxQKW3-PSXskOVn_PqxzNyEuyQ_ZfXekqern8-bm_r-4ebu-3Vfe2YgrXusXc2tLqlCFTxHlF0aCnQTmrQHDiXFjvNGHfMudZy3dHO0j6wwPsgLDsl349zlzT_3vu8mjFm54fBTn7eZ9MKIVsQUv-XVKJFZK2Shfz2jtzN-1R-y0aC1lppjQXCI-TSnHPywSwpjjY9GwRzEG92pog3B_EGqSniS-b8dbDNzg4h2cnF_BakDJRAfTj1x5Hzxdyf6JPJLvrJ-T4m71bTz_EfW14AmW2Wmw</recordid><startdate>20100901</startdate><enddate>20100901</enddate><creator>Lehner, Paul</creator><creator>Worrell, Charles</creator><creator>Vu, Chrissy</creator><creator>Mittel, Janet</creator><creator>Snyder, Stephen</creator><creator>Schulte, Eric</creator><creator>Greiff, Warren</creator><general>Elsevier Ltd</general><general>Elsevier</general><general>Elsevier Science Ltd</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope><scope>7SC</scope><scope>7TA</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100901</creationdate><title>An application of document filtering in an operational system</title><author>Lehner, Paul ; Worrell, Charles ; Vu, Chrissy ; Mittel, Janet ; Snyder, Stephen ; Schulte, Eric ; Greiff, Warren</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c370t-d1dcaf898210274d115b1a202b690940446a1b9334c3cc8a49b2ba2df3f4df5a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Automatic text analysis</topic><topic>Bayesian inference</topic><topic>Clocks</topic><topic>Computerized information retrieval</topic><topic>Document filtering</topic><topic>Exact sciences and technology</topic><topic>Filtering</topic><topic>Filtering systems</topic><topic>Filtration</topic><topic>Information and communication sciences</topic><topic>Information retrieval</topic><topic>Information retrieval systems. Information and document management system</topic><topic>Information science. Documentation</topic><topic>Monitoring</topic><topic>Monitors</topic><topic>News</topic><topic>Recall</topic><topic>Sciences and techniques of general use</topic><topic>Specialized information systems</topic><topic>Studies</topic><topic>Utilities</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lehner, Paul</creatorcontrib><creatorcontrib>Worrell, Charles</creatorcontrib><creatorcontrib>Vu, Chrissy</creatorcontrib><creatorcontrib>Mittel, Janet</creatorcontrib><creatorcontrib>Snyder, Stephen</creatorcontrib><creatorcontrib>Schulte, Eric</creatorcontrib><creatorcontrib>Greiff, Warren</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Materials Business File</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Information processing & management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lehner, Paul</au><au>Worrell, Charles</au><au>Vu, Chrissy</au><au>Mittel, Janet</au><au>Snyder, Stephen</au><au>Schulte, Eric</au><au>Greiff, Warren</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An application of document filtering in an operational system</atitle><jtitle>Information processing & management</jtitle><date>2010-09-01</date><risdate>2010</risdate><volume>46</volume><issue>5</issue><spage>611</spage><epage>627</epage><pages>611-627</pages><issn>0306-4573</issn><eissn>1873-5371</eissn><coden>IPMADK</coden><abstract>This paper describes an applied document filtering system embedded in an operational watch center that monitors disease outbreaks worldwide. At the initial time of this writing, the system effectively supported monitoring of 23 geographic regions by filtering documents in several thousand daily news sources in 11 different languages. This paper describes the filtering algorithm, statistical procedures for estimating Precision and Recall in an operational environment, summarizes operational performance data and suggests lessons learned for other applications of document filtering technology. Overall, these results are interpreted as supporting the general utility of document filtering and information retrieval technology and offers recommendations for future applications of this technology.</abstract><cop>Kidlington</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.ipm.2009.12.006</doi><tpages>17</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0306-4573 |
ispartof | Information processing & management, 2010-09, Vol.46 (5), p.611-627 |
issn | 0306-4573 1873-5371 |
language | eng |
recordid | cdi_proquest_miscellaneous_855680569 |
source | Elsevier ScienceDirect Journals |
subjects | Algorithms Automatic text analysis Bayesian inference Clocks Computerized information retrieval Document filtering Exact sciences and technology Filtering Filtering systems Filtration Information and communication sciences Information retrieval Information retrieval systems. Information and document management system Information science. Documentation Monitoring Monitors News Recall Sciences and techniques of general use Specialized information systems Studies Utilities |
title | An application of document filtering in an operational system |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T07%3A28%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20application%20of%20document%20filtering%20in%20an%20operational%20system&rft.jtitle=Information%20processing%20&%20management&rft.au=Lehner,%20Paul&rft.date=2010-09-01&rft.volume=46&rft.issue=5&rft.spage=611&rft.epage=627&rft.pages=611-627&rft.issn=0306-4573&rft.eissn=1873-5371&rft.coden=IPMADK&rft_id=info:doi/10.1016/j.ipm.2009.12.006&rft_dat=%3Cproquest_cross%3E2080657851%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=609997991&rft_id=info:pmid/&rft_els_id=S0306457310000026&rfr_iscdi=true |