GENERATING LOSSLESS STATIC OBJECT MODELS OF DYNAMIC WEBPAGES

Disclosed is technology for generating an accurate and lossless static object model of a dynamic webpage as it is rendered by a browser, including webpages that include a shadow DOM. A method includes receiving, at a computer system, a webpage, instantiating a headless web browser application to ren...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Jakobsen, Per, Jacobsen, Mads
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Disclosed is technology for generating an accurate and lossless static object model of a dynamic webpage as it is rendered by a browser, including webpages that include a shadow DOM. A method includes receiving, at a computer system, a webpage, instantiating a headless web browser application to render the webpage by creating a document object model ("DOM") and a cascading style sheet object model ("CSSOM") in browser memory, the DOM and CSSOM representing dynamic rendered webpage content, injecting a probe script into the headless browser to retrieve the dynamic rendered content, traversing, by the probe script, the DOM, including traversing regular nodes of the DOM and shadow nodes of a shadow DOM, retrieving dynamic information for the regular and shadow nodes, and building a static object model based on the dynamic information.