Measuring index quality using random walks on the Web

Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer networks (1999) 1999-05, Vol.31 (11), p.1291-1303
Hauptverfasser: Henzinger, Monika R., Heydon, Allan, Mitzenmacher, Michael, Najork, Marc
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1303
container_issue 11
container_start_page 1291
container_title Computer networks (1999)
container_volume 31
creator Henzinger, Monika R.
Heydon, Allan
Mitzenmacher, Michael
Najork, Marc
description Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.
doi_str_mv 10.1016/S1389-1286(99)00016-X
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_57468595</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S138912869900016X</els_id><sourcerecordid>42299703</sourcerecordid><originalsourceid>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</originalsourceid><addsrcrecordid>eNqFkFtLwzAUgIsoOKc_QSgiog_VJG3SnCeR4Q0mPqi4t5AmqWZ27Za06v692UUEX_Z0Dofv3L4oOsToHCPMLp5wyiHBhLNTgDOEQi0ZbUU9zHOS5IjBdsh_kd1oz_txgLKM8F5EH4z0nbP1W2xrbb7jWScr287jzi9qTta6mcRfsvrwcVPH7buJX02xH-2UsvLmYB370cvN9fPgLhk-3t4ProaJoilvk3AdEJYjmYEhuCgKQnLIpCaKl4QRDSnngDGCXEtMMQNeKkZA89IUQChP-9HJau7UNbPO-FZMrFemqmRtms4LmmeMU6AbQcIA8rA9gEf_wHHTuTo8ITAApQQjFiC6gpRrvHemFFNnJ9LNBUZioVwslYuFTwEglsrFKPQdr4dLr2RVBnvK-r9mngJmWcAuV5gJ6j6tccIra2pltHVGtUI3dsOiH7kZkpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>199552106</pqid></control><display><type>article</type><title>Measuring index quality using random walks on the Web</title><source>Access via ScienceDirect (Elsevier)</source><creator>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</creator><creatorcontrib>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</creatorcontrib><description>Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</description><identifier>ISSN: 1389-1286</identifier><identifier>EISSN: 1872-7069</identifier><identifier>DOI: 10.1016/S1389-1286(99)00016-X</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Applied sciences ; Comparative analysis ; Evaluation ; Exact sciences and technology ; Index quality ; Indexing ; Information and communication sciences ; Information processing and retrieval ; Information retrieval. Man machine relationship ; Information science. Documentation ; Interconnected networks ; Internet ; Measurement ; Networks and services in france and abroad ; PageRank ; Quality ; Random walk theory ; Random walks ; Research process. Evaluation ; Sciences and techniques of general use ; Search engines ; Searching ; Studies ; Telecommunications ; Telecommunications and information theory ; Teleprocessing networks. Isdn ; World Wide Web</subject><ispartof>Computer networks (1999), 1999-05, Vol.31 (11), p.1291-1303</ispartof><rights>1999</rights><rights>1999 INIST-CNRS</rights><rights>Copyright Elsevier Sequoia S.A. May 17, 1999</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</citedby><cites>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S1389-1286(99)00016-X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>309,310,314,780,784,789,790,3550,23930,23931,25140,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=1839164$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Henzinger, Monika R.</creatorcontrib><creatorcontrib>Heydon, Allan</creatorcontrib><creatorcontrib>Mitzenmacher, Michael</creatorcontrib><creatorcontrib>Najork, Marc</creatorcontrib><title>Measuring index quality using random walks on the Web</title><title>Computer networks (1999)</title><description>Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</description><subject>Applied sciences</subject><subject>Comparative analysis</subject><subject>Evaluation</subject><subject>Exact sciences and technology</subject><subject>Index quality</subject><subject>Indexing</subject><subject>Information and communication sciences</subject><subject>Information processing and retrieval</subject><subject>Information retrieval. Man machine relationship</subject><subject>Information science. Documentation</subject><subject>Interconnected networks</subject><subject>Internet</subject><subject>Measurement</subject><subject>Networks and services in france and abroad</subject><subject>PageRank</subject><subject>Quality</subject><subject>Random walk theory</subject><subject>Random walks</subject><subject>Research process. Evaluation</subject><subject>Sciences and techniques of general use</subject><subject>Search engines</subject><subject>Searching</subject><subject>Studies</subject><subject>Telecommunications</subject><subject>Telecommunications and information theory</subject><subject>Teleprocessing networks. Isdn</subject><subject>World Wide Web</subject><issn>1389-1286</issn><issn>1872-7069</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1999</creationdate><recordtype>article</recordtype><recordid>eNqFkFtLwzAUgIsoOKc_QSgiog_VJG3SnCeR4Q0mPqi4t5AmqWZ27Za06v692UUEX_Z0Dofv3L4oOsToHCPMLp5wyiHBhLNTgDOEQi0ZbUU9zHOS5IjBdsh_kd1oz_txgLKM8F5EH4z0nbP1W2xrbb7jWScr287jzi9qTta6mcRfsvrwcVPH7buJX02xH-2UsvLmYB370cvN9fPgLhk-3t4ProaJoilvk3AdEJYjmYEhuCgKQnLIpCaKl4QRDSnngDGCXEtMMQNeKkZA89IUQChP-9HJau7UNbPO-FZMrFemqmRtms4LmmeMU6AbQcIA8rA9gEf_wHHTuTo8ITAApQQjFiC6gpRrvHemFFNnJ9LNBUZioVwslYuFTwEglsrFKPQdr4dLr2RVBnvK-r9mngJmWcAuV5gJ6j6tccIra2pltHVGtUI3dsOiH7kZkpA</recordid><startdate>19990517</startdate><enddate>19990517</enddate><creator>Henzinger, Monika R.</creator><creator>Heydon, Allan</creator><creator>Mitzenmacher, Michael</creator><creator>Najork, Marc</creator><general>Elsevier B.V</general><general>Elsevier Science</general><general>Elsevier Sequoia S.A</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>19990517</creationdate><title>Measuring index quality using random walks on the Web</title><author>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Applied sciences</topic><topic>Comparative analysis</topic><topic>Evaluation</topic><topic>Exact sciences and technology</topic><topic>Index quality</topic><topic>Indexing</topic><topic>Information and communication sciences</topic><topic>Information processing and retrieval</topic><topic>Information retrieval. Man machine relationship</topic><topic>Information science. Documentation</topic><topic>Interconnected networks</topic><topic>Internet</topic><topic>Measurement</topic><topic>Networks and services in france and abroad</topic><topic>PageRank</topic><topic>Quality</topic><topic>Random walk theory</topic><topic>Random walks</topic><topic>Research process. Evaluation</topic><topic>Sciences and techniques of general use</topic><topic>Search engines</topic><topic>Searching</topic><topic>Studies</topic><topic>Telecommunications</topic><topic>Telecommunications and information theory</topic><topic>Teleprocessing networks. Isdn</topic><topic>World Wide Web</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Henzinger, Monika R.</creatorcontrib><creatorcontrib>Heydon, Allan</creatorcontrib><creatorcontrib>Mitzenmacher, Michael</creatorcontrib><creatorcontrib>Najork, Marc</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer networks (1999)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Henzinger, Monika R.</au><au>Heydon, Allan</au><au>Mitzenmacher, Michael</au><au>Najork, Marc</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Measuring index quality using random walks on the Web</atitle><jtitle>Computer networks (1999)</jtitle><date>1999-05-17</date><risdate>1999</risdate><volume>31</volume><issue>11</issue><spage>1291</spage><epage>1303</epage><pages>1291-1303</pages><issn>1389-1286</issn><eissn>1872-7069</eissn><abstract>Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/S1389-1286(99)00016-X</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1389-1286
ispartof Computer networks (1999), 1999-05, Vol.31 (11), p.1291-1303
issn 1389-1286
1872-7069
language eng
recordid cdi_proquest_miscellaneous_57468595
source Access via ScienceDirect (Elsevier)
subjects Applied sciences
Comparative analysis
Evaluation
Exact sciences and technology
Index quality
Indexing
Information and communication sciences
Information processing and retrieval
Information retrieval. Man machine relationship
Information science. Documentation
Interconnected networks
Internet
Measurement
Networks and services in france and abroad
PageRank
Quality
Random walk theory
Random walks
Research process. Evaluation
Sciences and techniques of general use
Search engines
Searching
Studies
Telecommunications
Telecommunications and information theory
Teleprocessing networks. Isdn
World Wide Web
title Measuring index quality using random walks on the Web
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T03%3A15%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Measuring%20index%20quality%20using%20random%20walks%20on%20the%20Web&rft.jtitle=Computer%20networks%20(1999)&rft.au=Henzinger,%20Monika%20R.&rft.date=1999-05-17&rft.volume=31&rft.issue=11&rft.spage=1291&rft.epage=1303&rft.pages=1291-1303&rft.issn=1389-1286&rft.eissn=1872-7069&rft_id=info:doi/10.1016/S1389-1286(99)00016-X&rft_dat=%3Cproquest_cross%3E42299703%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=199552106&rft_id=info:pmid/&rft_els_id=S138912869900016X&rfr_iscdi=true