Formal methods for verification of websites macrostructure integrity

The problem of verifying macrostructure of websites where information resources are represented by means of HTML is studied. The macrostructure of a website is understood as the structure of interrelations between its resources based on hyperlinks. The approach presented involves inventorying inform...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Programming and computer software 2000-07, Vol.26 (4), p.186-191
Hauptverfasser: Kogalovsky, M. R., Efimova, E. N., Rybina, T. A., Brakhin, V. B.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The problem of verifying macrostructure of websites where information resources are represented by means of HTML is studied. The macrostructure of a website is understood as the structure of interrelations between its resources based on hyperlinks. The approach presented involves inventorying information resources of the site, partial syntactic analysis of its HTML-files, locating their hyperlinks, and, in so doing, reconstructing its macrostructure (the hyperlink graph of resources). The description of the site's macrostructure is placed into a relational database and is analyzed by formal methods based on relational algebra and graph theory in order to detect irrelevant (dangling) hyperlinks, resources to which no inner hyperlink points, as well as hypertext pages unattainable from the homepage of the site. In the paper, the prospects of this approach are discussed in relation to websites created with the help of XML technologies. A brief description of an implemented prototype of a verification system for HTML sites is given.
ISSN:0361-7688
1608-3261
DOI:10.1007/BF02759467