A Scalable Topic-Based Open Source Search Engine
Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 234 |
---|---|
container_issue | |
container_start_page | 228 |
container_title | |
container_volume | |
creator | Buntine, Wray Lofstrom, Jaakko Perkio, Jukka Perttu, Sami Poroshin, Vladimir Silander, Tomi Tirri, Henry Tuominen, Antti Tuulos, Ville |
description | Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages. |
doi_str_mv | 10.5555/1025132.1026324 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>proquest_acm_b</sourceid><recordid>TN_cdi_acm_books_10_5555_1025132_1026324_brief</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>31535969</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1039-9054e848f1ba44ee393494f62e1965a62a7123ad118be6c76a555695817b1f33</originalsourceid><addsrcrecordid>eNqNkLtOw0AURFdCSEBITesK0djs3Ze9ZYjCQ4qUwu5X15trMDi28eL_Z1H8AUwzzcxodBi7A57pqEfgQoMUWXQjhbpgNzw3VgvgXFyxdQifPEpak-f2mvFNUnrssO4oqYax9ekTBjomh5H6pBzmyVNSEk7-I9n1721Pt-yywS7QevEVq5531fY13R9e3rabfYoQx1PLtaJCFQ3UqBSRtFJZ1RhBYI1GIzAHIfEIUNRkfG4wfo83C8hraKRcsfvz7DgN3zOFH3dqg6euw56GOTgJWmprbAxm5yD6k6uH4Ss44O6PhFtIuIWEq6eWmlh4-GdB_gL5R1tD</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>31535969</pqid></control><display><type>conference_proceeding</type><title>A Scalable Topic-Based Open Source Search Engine</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</creator><creatorcontrib>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</creatorcontrib><description>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</description><identifier>ISBN: 0769521002</identifier><identifier>ISBN: 9780769521008</identifier><identifier>DOI: 10.5555/1025132.1026324</identifier><language>eng</language><publisher>Washington, DC, USA: IEEE Computer Society</publisher><subject>Information systems ; Information systems -- Information retrieval</subject><ispartof>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings, 2004, p.228-234</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,776,780,785,786,27902</link.rule.ids></links><search><creatorcontrib>Buntine, Wray</creatorcontrib><creatorcontrib>Lofstrom, Jaakko</creatorcontrib><creatorcontrib>Perkio, Jukka</creatorcontrib><creatorcontrib>Perttu, Sami</creatorcontrib><creatorcontrib>Poroshin, Vladimir</creatorcontrib><creatorcontrib>Silander, Tomi</creatorcontrib><creatorcontrib>Tirri, Henry</creatorcontrib><creatorcontrib>Tuominen, Antti</creatorcontrib><creatorcontrib>Tuulos, Ville</creatorcontrib><title>A Scalable Topic-Based Open Source Search Engine</title><title>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings</title><description>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</description><subject>Information systems</subject><subject>Information systems -- Information retrieval</subject><isbn>0769521002</isbn><isbn>9780769521008</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNqNkLtOw0AURFdCSEBITesK0djs3Ze9ZYjCQ4qUwu5X15trMDi28eL_Z1H8AUwzzcxodBi7A57pqEfgQoMUWXQjhbpgNzw3VgvgXFyxdQifPEpak-f2mvFNUnrssO4oqYax9ekTBjomh5H6pBzmyVNSEk7-I9n1721Pt-yywS7QevEVq5531fY13R9e3rabfYoQx1PLtaJCFQ3UqBSRtFJZ1RhBYI1GIzAHIfEIUNRkfG4wfo83C8hraKRcsfvz7DgN3zOFH3dqg6euw56GOTgJWmprbAxm5yD6k6uH4Ss44O6PhFtIuIWEq6eWmlh4-GdB_gL5R1tD</recordid><startdate>20040920</startdate><enddate>20040920</enddate><creator>Buntine, Wray</creator><creator>Lofstrom, Jaakko</creator><creator>Perkio, Jukka</creator><creator>Perttu, Sami</creator><creator>Poroshin, Vladimir</creator><creator>Silander, Tomi</creator><creator>Tirri, Henry</creator><creator>Tuominen, Antti</creator><creator>Tuulos, Ville</creator><general>IEEE Computer Society</general><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20040920</creationdate><title>A Scalable Topic-Based Open Source Search Engine</title><author>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1039-9054e848f1ba44ee393494f62e1965a62a7123ad118be6c76a555695817b1f33</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Information systems</topic><topic>Information systems -- Information retrieval</topic><toplevel>online_resources</toplevel><creatorcontrib>Buntine, Wray</creatorcontrib><creatorcontrib>Lofstrom, Jaakko</creatorcontrib><creatorcontrib>Perkio, Jukka</creatorcontrib><creatorcontrib>Perttu, Sami</creatorcontrib><creatorcontrib>Poroshin, Vladimir</creatorcontrib><creatorcontrib>Silander, Tomi</creatorcontrib><creatorcontrib>Tirri, Henry</creatorcontrib><creatorcontrib>Tuominen, Antti</creatorcontrib><creatorcontrib>Tuulos, Ville</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Buntine, Wray</au><au>Lofstrom, Jaakko</au><au>Perkio, Jukka</au><au>Perttu, Sami</au><au>Poroshin, Vladimir</au><au>Silander, Tomi</au><au>Tirri, Henry</au><au>Tuominen, Antti</au><au>Tuulos, Ville</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A Scalable Topic-Based Open Source Search Engine</atitle><btitle>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings</btitle><date>2004-09-20</date><risdate>2004</risdate><spage>228</spage><epage>234</epage><pages>228-234</pages><isbn>0769521002</isbn><isbn>9780769521008</isbn><abstract>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</abstract><cop>Washington, DC, USA</cop><pub>IEEE Computer Society</pub><doi>10.5555/1025132.1026324</doi><tpages>7</tpages></addata></record> |
fulltext | fulltext |
identifier | ISBN: 0769521002 |
ispartof | IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings, 2004, p.228-234 |
issn | |
language | eng |
recordid | cdi_acm_books_10_5555_1025132_1026324_brief |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Information systems Information systems -- Information retrieval |
title | A Scalable Topic-Based Open Source Search Engine |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T08%3A49%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_acm_b&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20Scalable%20Topic-Based%20Open%20Source%20Search%20Engine&rft.btitle=IEEE/WIC%20International%20Conference%20on%20Web%20Intelligence%20(WI%202004)%20:%20Beijing,%20China,%20September%2020-24,%202004%20:%20proceedings&rft.au=Buntine,%20Wray&rft.date=2004-09-20&rft.spage=228&rft.epage=234&rft.pages=228-234&rft.isbn=0769521002&rft.isbn_list=9780769521008&rft_id=info:doi/10.5555/1025132.1026324&rft_dat=%3Cproquest_acm_b%3E31535969%3C/proquest_acm_b%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=31535969&rft_id=info:pmid/&rfr_iscdi=true |