Ordering documents within a crawled website

Systems and methods of the present invention provide for one or more server computers communicatively coupled to a network and configured to: access a source code for each of a plurality of web pages within a website hosted on the server computer; identify, within the source code of each of the plur...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Boyd-Wickizer, Silas, Ansel, Jason
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems and methods of the present invention provide for one or more server computers communicatively coupled to a network and configured to: access a source code for each of a plurality of web pages within a website hosted on the server computer; identify, within the source code of each of the plurality of web pages, a plurality of hyperlinks for navigating to at least one of the plurality of web pages; generate a plurality of link groups each comprising at least one common hyperlink between the plurality of hyperlinks; aggregate the plurality of link groups into a unique link group wherein each of the plurality of hyperlinks appears in only one link group; and determine an order of hyperlinks within the unique link group based on an original order of the plurality of hyperlinks.