A Scalable Topic-Based Open Source Search Engine

Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Buntine, Wray, Lofstrom, Jaakko, Perkio, Jukka, Perttu, Sami, Poroshin, Vladimir, Silander, Tomi, Tirri, Henry, Tuominen, Antti, Tuulos, Ville
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 234
container_issue
container_start_page 228
container_title
container_volume
creator Buntine, Wray
Lofstrom, Jaakko
Perkio, Jukka
Perttu, Sami
Poroshin, Vladimir
Silander, Tomi
Tirri, Henry
Tuominen, Antti
Tuulos, Ville
description Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.
doi_str_mv 10.5555/1025132.1026324
format Conference Proceeding
fullrecord <record><control><sourceid>proquest_acm_b</sourceid><recordid>TN_cdi_acm_books_10_5555_1025132_1026324_brief</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>31535969</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1039-9054e848f1ba44ee393494f62e1965a62a7123ad118be6c76a555695817b1f33</originalsourceid><addsrcrecordid>eNqNkLtOw0AURFdCSEBITesK0djs3Ze9ZYjCQ4qUwu5X15trMDi28eL_Z1H8AUwzzcxodBi7A57pqEfgQoMUWXQjhbpgNzw3VgvgXFyxdQifPEpak-f2mvFNUnrssO4oqYax9ekTBjomh5H6pBzmyVNSEk7-I9n1721Pt-yywS7QevEVq5531fY13R9e3rabfYoQx1PLtaJCFQ3UqBSRtFJZ1RhBYI1GIzAHIfEIUNRkfG4wfo83C8hraKRcsfvz7DgN3zOFH3dqg6euw56GOTgJWmprbAxm5yD6k6uH4Ss44O6PhFtIuIWEq6eWmlh4-GdB_gL5R1tD</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>31535969</pqid></control><display><type>conference_proceeding</type><title>A Scalable Topic-Based Open Source Search Engine</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</creator><creatorcontrib>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</creatorcontrib><description>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</description><identifier>ISBN: 0769521002</identifier><identifier>ISBN: 9780769521008</identifier><identifier>DOI: 10.5555/1025132.1026324</identifier><language>eng</language><publisher>Washington, DC, USA: IEEE Computer Society</publisher><subject>Information systems ; Information systems -- Information retrieval</subject><ispartof>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings, 2004, p.228-234</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,776,780,785,786,27902</link.rule.ids></links><search><creatorcontrib>Buntine, Wray</creatorcontrib><creatorcontrib>Lofstrom, Jaakko</creatorcontrib><creatorcontrib>Perkio, Jukka</creatorcontrib><creatorcontrib>Perttu, Sami</creatorcontrib><creatorcontrib>Poroshin, Vladimir</creatorcontrib><creatorcontrib>Silander, Tomi</creatorcontrib><creatorcontrib>Tirri, Henry</creatorcontrib><creatorcontrib>Tuominen, Antti</creatorcontrib><creatorcontrib>Tuulos, Ville</creatorcontrib><title>A Scalable Topic-Based Open Source Search Engine</title><title>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings</title><description>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</description><subject>Information systems</subject><subject>Information systems -- Information retrieval</subject><isbn>0769521002</isbn><isbn>9780769521008</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNqNkLtOw0AURFdCSEBITesK0djs3Ze9ZYjCQ4qUwu5X15trMDi28eL_Z1H8AUwzzcxodBi7A57pqEfgQoMUWXQjhbpgNzw3VgvgXFyxdQifPEpak-f2mvFNUnrssO4oqYax9ekTBjomh5H6pBzmyVNSEk7-I9n1721Pt-yywS7QevEVq5531fY13R9e3rabfYoQx1PLtaJCFQ3UqBSRtFJZ1RhBYI1GIzAHIfEIUNRkfG4wfo83C8hraKRcsfvz7DgN3zOFH3dqg6euw56GOTgJWmprbAxm5yD6k6uH4Ss44O6PhFtIuIWEq6eWmlh4-GdB_gL5R1tD</recordid><startdate>20040920</startdate><enddate>20040920</enddate><creator>Buntine, Wray</creator><creator>Lofstrom, Jaakko</creator><creator>Perkio, Jukka</creator><creator>Perttu, Sami</creator><creator>Poroshin, Vladimir</creator><creator>Silander, Tomi</creator><creator>Tirri, Henry</creator><creator>Tuominen, Antti</creator><creator>Tuulos, Ville</creator><general>IEEE Computer Society</general><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20040920</creationdate><title>A Scalable Topic-Based Open Source Search Engine</title><author>Buntine, Wray ; Lofstrom, Jaakko ; Perkio, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1039-9054e848f1ba44ee393494f62e1965a62a7123ad118be6c76a555695817b1f33</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Information systems</topic><topic>Information systems -- Information retrieval</topic><toplevel>online_resources</toplevel><creatorcontrib>Buntine, Wray</creatorcontrib><creatorcontrib>Lofstrom, Jaakko</creatorcontrib><creatorcontrib>Perkio, Jukka</creatorcontrib><creatorcontrib>Perttu, Sami</creatorcontrib><creatorcontrib>Poroshin, Vladimir</creatorcontrib><creatorcontrib>Silander, Tomi</creatorcontrib><creatorcontrib>Tirri, Henry</creatorcontrib><creatorcontrib>Tuominen, Antti</creatorcontrib><creatorcontrib>Tuulos, Ville</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Buntine, Wray</au><au>Lofstrom, Jaakko</au><au>Perkio, Jukka</au><au>Perttu, Sami</au><au>Poroshin, Vladimir</au><au>Silander, Tomi</au><au>Tirri, Henry</au><au>Tuominen, Antti</au><au>Tuulos, Ville</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A Scalable Topic-Based Open Source Search Engine</atitle><btitle>IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings</btitle><date>2004-09-20</date><risdate>2004</risdate><spage>228</spage><epage>234</epage><pages>228-234</pages><isbn>0769521002</isbn><isbn>9780769521008</isbn><abstract>Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.</abstract><cop>Washington, DC, USA</cop><pub>IEEE Computer Society</pub><doi>10.5555/1025132.1026324</doi><tpages>7</tpages></addata></record>
fulltext fulltext
identifier ISBN: 0769521002
ispartof IEEE/WIC International Conference on Web Intelligence (WI 2004) : Beijing, China, September 20-24, 2004 : proceedings, 2004, p.228-234
issn
language eng
recordid cdi_acm_books_10_5555_1025132_1026324_brief
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Information systems
Information systems -- Information retrieval
title A Scalable Topic-Based Open Source Search Engine
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T08%3A49%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_acm_b&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20Scalable%20Topic-Based%20Open%20Source%20Search%20Engine&rft.btitle=IEEE/WIC%20International%20Conference%20on%20Web%20Intelligence%20(WI%202004)%20:%20Beijing,%20China,%20September%2020-24,%202004%20:%20proceedings&rft.au=Buntine,%20Wray&rft.date=2004-09-20&rft.spage=228&rft.epage=234&rft.pages=228-234&rft.isbn=0769521002&rft.isbn_list=9780769521008&rft_id=info:doi/10.5555/1025132.1026324&rft_dat=%3Cproquest_acm_b%3E31535969%3C/proquest_acm_b%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=31535969&rft_id=info:pmid/&rfr_iscdi=true