Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web

We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the ASIST Annual Meeting 2020-10, Vol.57 (1), p.n/a
Hauptverfasser: Schmidt, Thomas, Mosiienko, Anastasiia, Faber, Raffaela, Herzog, Juliane, Wolff, Christian
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 1
container_start_page
container_title Proceedings of the ASIST Annual Meeting
container_volume 57
creator Schmidt, Thomas
Mosiienko, Anastasiia
Faber, Raffaela
Herzog, Juliane
Wolff, Christian
description We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.
doi_str_mv 10.1002/pra2.392
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2453544056</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2453544056</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</originalsourceid><addsrcrecordid>eNp1kM9Kw0AQxoMoWGrBR1jw4iW6_5I0x1LUChVF2nPYbGbbLelu3E1a6kF8BJ_RJ3FDPXgRBmaY-c3HzBdFlwTfEIzpbeMEvWE5PYkGlGUszikjp3_q82jk_QZjTHiWpjQfRB_LVtf6XZsVmi2e5t-fX8KI-uC1R8JUSNpt07Xg0E57bQ0KIULTNZ1HVqE9lF63gLx0AMavbetRa5E2O_CtXokwqsDrlQlpB7VttmACElTaNfTbF9GZErWH0W8eRsv7u8V0Fs-fHx6nk3ksKac0zkpCmSIqySuicokzTsWYk5RLDkqmCuMchKgyoLgcC5ZIJXlKsyonpchSGLNhdHXUbZx968JxxcZ2LnzqC8oTlnCOkzRQ10dKOuu9A1U0Tm-FOxQEF73BRW9wEQwOaHxE97qGw79c8fI6oT3_A0A3fx8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2453544056</pqid></control><display><type>article</type><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><source>Alma/SFX Local Collection</source><creator>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</creator><creatorcontrib>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</creatorcontrib><description>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</description><identifier>ISSN: 2373-9231</identifier><identifier>EISSN: 2373-9231</identifier><identifier>EISSN: 1550-8390</identifier><identifier>DOI: 10.1002/pra2.392</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley &amp; Sons, Inc</publisher><subject>colors ; Computer vision ; html ; HyperText Markup Language ; visual complexity ; web design ; web history ; Websites</subject><ispartof>Proceedings of the ASIST Annual Meeting, 2020-10, Vol.57 (1), p.n/a</ispartof><rights>83rd Annual Meeting of the Association for Information Science &amp; Technology October 25‐29, 2020. Author(s) retain copyright, but ASIS&amp;T receives an exclusive publication license</rights><rights>2020 ASIS&amp;T</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</citedby><cites>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Schmidt, Thomas</creatorcontrib><creatorcontrib>Mosiienko, Anastasiia</creatorcontrib><creatorcontrib>Faber, Raffaela</creatorcontrib><creatorcontrib>Herzog, Juliane</creatorcontrib><creatorcontrib>Wolff, Christian</creatorcontrib><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><title>Proceedings of the ASIST Annual Meeting</title><description>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</description><subject>colors</subject><subject>Computer vision</subject><subject>html</subject><subject>HyperText Markup Language</subject><subject>visual complexity</subject><subject>web design</subject><subject>web history</subject><subject>Websites</subject><issn>2373-9231</issn><issn>2373-9231</issn><issn>1550-8390</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kM9Kw0AQxoMoWGrBR1jw4iW6_5I0x1LUChVF2nPYbGbbLelu3E1a6kF8BJ_RJ3FDPXgRBmaY-c3HzBdFlwTfEIzpbeMEvWE5PYkGlGUszikjp3_q82jk_QZjTHiWpjQfRB_LVtf6XZsVmi2e5t-fX8KI-uC1R8JUSNpt07Xg0E57bQ0KIULTNZ1HVqE9lF63gLx0AMavbetRa5E2O_CtXokwqsDrlQlpB7VttmACElTaNfTbF9GZErWH0W8eRsv7u8V0Fs-fHx6nk3ksKac0zkpCmSIqySuicokzTsWYk5RLDkqmCuMchKgyoLgcC5ZIJXlKsyonpchSGLNhdHXUbZx968JxxcZ2LnzqC8oTlnCOkzRQ10dKOuu9A1U0Tm-FOxQEF73BRW9wEQwOaHxE97qGw79c8fI6oT3_A0A3fx8</recordid><startdate>20201001</startdate><enddate>20201001</enddate><creator>Schmidt, Thomas</creator><creator>Mosiienko, Anastasiia</creator><creator>Faber, Raffaela</creator><creator>Herzog, Juliane</creator><creator>Wolff, Christian</creator><general>John Wiley &amp; Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope></search><sort><creationdate>20201001</creationdate><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><author>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>colors</topic><topic>Computer vision</topic><topic>html</topic><topic>HyperText Markup Language</topic><topic>visual complexity</topic><topic>web design</topic><topic>web history</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Schmidt, Thomas</creatorcontrib><creatorcontrib>Mosiienko, Anastasiia</creatorcontrib><creatorcontrib>Faber, Raffaela</creatorcontrib><creatorcontrib>Herzog, Juliane</creatorcontrib><creatorcontrib>Wolff, Christian</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>Proceedings of the ASIST Annual Meeting</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Schmidt, Thomas</au><au>Mosiienko, Anastasiia</au><au>Faber, Raffaela</au><au>Herzog, Juliane</au><au>Wolff, Christian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</atitle><jtitle>Proceedings of the ASIST Annual Meeting</jtitle><date>2020-10-01</date><risdate>2020</risdate><volume>57</volume><issue>1</issue><epage>n/a</epage><issn>2373-9231</issn><eissn>2373-9231</eissn><eissn>1550-8390</eissn><abstract>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</abstract><cop>Hoboken, USA</cop><pub>John Wiley &amp; Sons, Inc</pub><doi>10.1002/pra2.392</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2373-9231
ispartof Proceedings of the ASIST Annual Meeting, 2020-10, Vol.57 (1), p.n/a
issn 2373-9231
2373-9231
1550-8390
language eng
recordid cdi_proquest_journals_2453544056
source Alma/SFX Local Collection
subjects colors
Computer vision
html
HyperText Markup Language
visual complexity
web design
web history
Websites
title Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T01%3A56%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Utilizing%20HTML%E2%80%90analysis%20and%20computer%20vision%20on%20a%20corpus%20of%20website%20screenshots%20to%20investigate%20design%20developments%20on%20the%20web&rft.jtitle=Proceedings%20of%20the%20ASIST%20Annual%20Meeting&rft.au=Schmidt,%20Thomas&rft.date=2020-10-01&rft.volume=57&rft.issue=1&rft.epage=n/a&rft.issn=2373-9231&rft.eissn=2373-9231&rft_id=info:doi/10.1002/pra2.392&rft_dat=%3Cproquest_cross%3E2453544056%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2453544056&rft_id=info:pmid/&rfr_iscdi=true