Measuring index quality using random walks on the Web
Recent research has studied how to measure the size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the...
Gespeichert in:
Veröffentlicht in: | Computer networks (1999) 1999-05, Vol.31 (11), p.1291-1303 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1303 |
---|---|
container_issue | 11 |
container_start_page | 1291 |
container_title | Computer networks (1999) |
container_volume | 31 |
creator | Henzinger, Monika R. Heydon, Allan Mitzenmacher, Michael Najork, Marc |
description | Recent research has studied how to measure the
size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the
quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines. |
doi_str_mv | 10.1016/S1389-1286(99)00016-X |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_57468595</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S138912869900016X</els_id><sourcerecordid>42299703</sourcerecordid><originalsourceid>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</originalsourceid><addsrcrecordid>eNqFkFtLwzAUgIsoOKc_QSgiog_VJG3SnCeR4Q0mPqi4t5AmqWZ27Za06v692UUEX_Z0Dofv3L4oOsToHCPMLp5wyiHBhLNTgDOEQi0ZbUU9zHOS5IjBdsh_kd1oz_txgLKM8F5EH4z0nbP1W2xrbb7jWScr287jzi9qTta6mcRfsvrwcVPH7buJX02xH-2UsvLmYB370cvN9fPgLhk-3t4ProaJoilvk3AdEJYjmYEhuCgKQnLIpCaKl4QRDSnngDGCXEtMMQNeKkZA89IUQChP-9HJau7UNbPO-FZMrFemqmRtms4LmmeMU6AbQcIA8rA9gEf_wHHTuTo8ITAApQQjFiC6gpRrvHemFFNnJ9LNBUZioVwslYuFTwEglsrFKPQdr4dLr2RVBnvK-r9mngJmWcAuV5gJ6j6tccIra2pltHVGtUI3dsOiH7kZkpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>199552106</pqid></control><display><type>article</type><title>Measuring index quality using random walks on the Web</title><source>Access via ScienceDirect (Elsevier)</source><creator>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</creator><creatorcontrib>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</creatorcontrib><description>Recent research has studied how to measure the
size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the
quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</description><identifier>ISSN: 1389-1286</identifier><identifier>EISSN: 1872-7069</identifier><identifier>DOI: 10.1016/S1389-1286(99)00016-X</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Applied sciences ; Comparative analysis ; Evaluation ; Exact sciences and technology ; Index quality ; Indexing ; Information and communication sciences ; Information processing and retrieval ; Information retrieval. Man machine relationship ; Information science. Documentation ; Interconnected networks ; Internet ; Measurement ; Networks and services in france and abroad ; PageRank ; Quality ; Random walk theory ; Random walks ; Research process. Evaluation ; Sciences and techniques of general use ; Search engines ; Searching ; Studies ; Telecommunications ; Telecommunications and information theory ; Teleprocessing networks. Isdn ; World Wide Web</subject><ispartof>Computer networks (1999), 1999-05, Vol.31 (11), p.1291-1303</ispartof><rights>1999</rights><rights>1999 INIST-CNRS</rights><rights>Copyright Elsevier Sequoia S.A. May 17, 1999</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</citedby><cites>FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S1389-1286(99)00016-X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>309,310,314,780,784,789,790,3550,23930,23931,25140,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1839164$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Henzinger, Monika R.</creatorcontrib><creatorcontrib>Heydon, Allan</creatorcontrib><creatorcontrib>Mitzenmacher, Michael</creatorcontrib><creatorcontrib>Najork, Marc</creatorcontrib><title>Measuring index quality using random walks on the Web</title><title>Computer networks (1999)</title><description>Recent research has studied how to measure the
size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the
quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</description><subject>Applied sciences</subject><subject>Comparative analysis</subject><subject>Evaluation</subject><subject>Exact sciences and technology</subject><subject>Index quality</subject><subject>Indexing</subject><subject>Information and communication sciences</subject><subject>Information processing and retrieval</subject><subject>Information retrieval. Man machine relationship</subject><subject>Information science. Documentation</subject><subject>Interconnected networks</subject><subject>Internet</subject><subject>Measurement</subject><subject>Networks and services in france and abroad</subject><subject>PageRank</subject><subject>Quality</subject><subject>Random walk theory</subject><subject>Random walks</subject><subject>Research process. Evaluation</subject><subject>Sciences and techniques of general use</subject><subject>Search engines</subject><subject>Searching</subject><subject>Studies</subject><subject>Telecommunications</subject><subject>Telecommunications and information theory</subject><subject>Teleprocessing networks. Isdn</subject><subject>World Wide Web</subject><issn>1389-1286</issn><issn>1872-7069</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1999</creationdate><recordtype>article</recordtype><recordid>eNqFkFtLwzAUgIsoOKc_QSgiog_VJG3SnCeR4Q0mPqi4t5AmqWZ27Za06v692UUEX_Z0Dofv3L4oOsToHCPMLp5wyiHBhLNTgDOEQi0ZbUU9zHOS5IjBdsh_kd1oz_txgLKM8F5EH4z0nbP1W2xrbb7jWScr287jzi9qTta6mcRfsvrwcVPH7buJX02xH-2UsvLmYB370cvN9fPgLhk-3t4ProaJoilvk3AdEJYjmYEhuCgKQnLIpCaKl4QRDSnngDGCXEtMMQNeKkZA89IUQChP-9HJau7UNbPO-FZMrFemqmRtms4LmmeMU6AbQcIA8rA9gEf_wHHTuTo8ITAApQQjFiC6gpRrvHemFFNnJ9LNBUZioVwslYuFTwEglsrFKPQdr4dLr2RVBnvK-r9mngJmWcAuV5gJ6j6tccIra2pltHVGtUI3dsOiH7kZkpA</recordid><startdate>19990517</startdate><enddate>19990517</enddate><creator>Henzinger, Monika R.</creator><creator>Heydon, Allan</creator><creator>Mitzenmacher, Michael</creator><creator>Najork, Marc</creator><general>Elsevier B.V</general><general>Elsevier Science</general><general>Elsevier Sequoia S.A</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>19990517</creationdate><title>Measuring index quality using random walks on the Web</title><author>Henzinger, Monika R. ; Heydon, Allan ; Mitzenmacher, Michael ; Najork, Marc</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c538t-10192670a49e21bbb22794ad2c8f262d9388911097da151698fc629d8feb92583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Applied sciences</topic><topic>Comparative analysis</topic><topic>Evaluation</topic><topic>Exact sciences and technology</topic><topic>Index quality</topic><topic>Indexing</topic><topic>Information and communication sciences</topic><topic>Information processing and retrieval</topic><topic>Information retrieval. Man machine relationship</topic><topic>Information science. Documentation</topic><topic>Interconnected networks</topic><topic>Internet</topic><topic>Measurement</topic><topic>Networks and services in france and abroad</topic><topic>PageRank</topic><topic>Quality</topic><topic>Random walk theory</topic><topic>Random walks</topic><topic>Research process. Evaluation</topic><topic>Sciences and techniques of general use</topic><topic>Search engines</topic><topic>Searching</topic><topic>Studies</topic><topic>Telecommunications</topic><topic>Telecommunications and information theory</topic><topic>Teleprocessing networks. Isdn</topic><topic>World Wide Web</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Henzinger, Monika R.</creatorcontrib><creatorcontrib>Heydon, Allan</creatorcontrib><creatorcontrib>Mitzenmacher, Michael</creatorcontrib><creatorcontrib>Najork, Marc</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer networks (1999)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Henzinger, Monika R.</au><au>Heydon, Allan</au><au>Mitzenmacher, Michael</au><au>Najork, Marc</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Measuring index quality using random walks on the Web</atitle><jtitle>Computer networks (1999)</jtitle><date>1999-05-17</date><risdate>1999</risdate><volume>31</volume><issue>11</issue><spage>1291</spage><epage>1303</epage><pages>1291-1303</pages><issn>1389-1286</issn><eissn>1872-7069</eissn><abstract>Recent research has studied how to measure the
size of a search engine, in terms of the number of pages indexed. In this paper, we consider a different measure for search engines, namely the
quality of the pages in a search engine index. We provide a simple, effective algorithm for approximating the quality of an index by performing a random walk on the Web, and we use this methodology to compare the index quality of several major search engines.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/S1389-1286(99)00016-X</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1389-1286 |
ispartof | Computer networks (1999), 1999-05, Vol.31 (11), p.1291-1303 |
issn | 1389-1286 1872-7069 |
language | eng |
recordid | cdi_proquest_miscellaneous_57468595 |
source | Access via ScienceDirect (Elsevier) |
subjects | Applied sciences Comparative analysis Evaluation Exact sciences and technology Index quality Indexing Information and communication sciences Information processing and retrieval Information retrieval. Man machine relationship Information science. Documentation Interconnected networks Internet Measurement Networks and services in france and abroad PageRank Quality Random walk theory Random walks Research process. Evaluation Sciences and techniques of general use Search engines Searching Studies Telecommunications Telecommunications and information theory Teleprocessing networks. Isdn World Wide Web |
title | Measuring index quality using random walks on the Web |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T03%3A15%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Measuring%20index%20quality%20using%20random%20walks%20on%20the%20Web&rft.jtitle=Computer%20networks%20(1999)&rft.au=Henzinger,%20Monika%20R.&rft.date=1999-05-17&rft.volume=31&rft.issue=11&rft.spage=1291&rft.epage=1303&rft.pages=1291-1303&rft.issn=1389-1286&rft.eissn=1872-7069&rft_id=info:doi/10.1016/S1389-1286(99)00016-X&rft_dat=%3Cproquest_cross%3E42299703%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=199552106&rft_id=info:pmid/&rft_els_id=S138912869900016X&rfr_iscdi=true |