Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web
We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of...
Gespeichert in:
Veröffentlicht in: | Proceedings of the ASIST Annual Meeting 2020-10, Vol.57 (1), p.n/a |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | n/a |
---|---|
container_issue | 1 |
container_start_page | |
container_title | Proceedings of the ASIST Annual Meeting |
container_volume | 57 |
creator | Schmidt, Thomas Mosiienko, Anastasiia Faber, Raffaela Herzog, Juliane Wolff, Christian |
description | We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years. |
doi_str_mv | 10.1002/pra2.392 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2453544056</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2453544056</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</originalsourceid><addsrcrecordid>eNp1kM9Kw0AQxoMoWGrBR1jw4iW6_5I0x1LUChVF2nPYbGbbLelu3E1a6kF8BJ_RJ3FDPXgRBmaY-c3HzBdFlwTfEIzpbeMEvWE5PYkGlGUszikjp3_q82jk_QZjTHiWpjQfRB_LVtf6XZsVmi2e5t-fX8KI-uC1R8JUSNpt07Xg0E57bQ0KIULTNZ1HVqE9lF63gLx0AMavbetRa5E2O_CtXokwqsDrlQlpB7VttmACElTaNfTbF9GZErWH0W8eRsv7u8V0Fs-fHx6nk3ksKac0zkpCmSIqySuicokzTsWYk5RLDkqmCuMchKgyoLgcC5ZIJXlKsyonpchSGLNhdHXUbZx968JxxcZ2LnzqC8oTlnCOkzRQ10dKOuu9A1U0Tm-FOxQEF73BRW9wEQwOaHxE97qGw79c8fI6oT3_A0A3fx8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2453544056</pqid></control><display><type>article</type><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><source>Alma/SFX Local Collection</source><creator>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</creator><creatorcontrib>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</creatorcontrib><description>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</description><identifier>ISSN: 2373-9231</identifier><identifier>EISSN: 2373-9231</identifier><identifier>EISSN: 1550-8390</identifier><identifier>DOI: 10.1002/pra2.392</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley & Sons, Inc</publisher><subject>colors ; Computer vision ; html ; HyperText Markup Language ; visual complexity ; web design ; web history ; Websites</subject><ispartof>Proceedings of the ASIST Annual Meeting, 2020-10, Vol.57 (1), p.n/a</ispartof><rights>83rd Annual Meeting of the Association for Information Science & Technology October 25‐29, 2020. Author(s) retain copyright, but ASIS&T receives an exclusive publication license</rights><rights>2020 ASIS&T</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</citedby><cites>FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Schmidt, Thomas</creatorcontrib><creatorcontrib>Mosiienko, Anastasiia</creatorcontrib><creatorcontrib>Faber, Raffaela</creatorcontrib><creatorcontrib>Herzog, Juliane</creatorcontrib><creatorcontrib>Wolff, Christian</creatorcontrib><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><title>Proceedings of the ASIST Annual Meeting</title><description>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</description><subject>colors</subject><subject>Computer vision</subject><subject>html</subject><subject>HyperText Markup Language</subject><subject>visual complexity</subject><subject>web design</subject><subject>web history</subject><subject>Websites</subject><issn>2373-9231</issn><issn>2373-9231</issn><issn>1550-8390</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kM9Kw0AQxoMoWGrBR1jw4iW6_5I0x1LUChVF2nPYbGbbLelu3E1a6kF8BJ_RJ3FDPXgRBmaY-c3HzBdFlwTfEIzpbeMEvWE5PYkGlGUszikjp3_q82jk_QZjTHiWpjQfRB_LVtf6XZsVmi2e5t-fX8KI-uC1R8JUSNpt07Xg0E57bQ0KIULTNZ1HVqE9lF63gLx0AMavbetRa5E2O_CtXokwqsDrlQlpB7VttmACElTaNfTbF9GZErWH0W8eRsv7u8V0Fs-fHx6nk3ksKac0zkpCmSIqySuicokzTsWYk5RLDkqmCuMchKgyoLgcC5ZIJXlKsyonpchSGLNhdHXUbZx968JxxcZ2LnzqC8oTlnCOkzRQ10dKOuu9A1U0Tm-FOxQEF73BRW9wEQwOaHxE97qGw79c8fI6oT3_A0A3fx8</recordid><startdate>20201001</startdate><enddate>20201001</enddate><creator>Schmidt, Thomas</creator><creator>Mosiienko, Anastasiia</creator><creator>Faber, Raffaela</creator><creator>Herzog, Juliane</creator><creator>Wolff, Christian</creator><general>John Wiley & Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope></search><sort><creationdate>20201001</creationdate><title>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</title><author>Schmidt, Thomas ; Mosiienko, Anastasiia ; Faber, Raffaela ; Herzog, Juliane ; Wolff, Christian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2422-7b123f1f59d1f9c0742a84164c4efc6f009eaad7e20b8a35cfc4627d91ba76e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>colors</topic><topic>Computer vision</topic><topic>html</topic><topic>HyperText Markup Language</topic><topic>visual complexity</topic><topic>web design</topic><topic>web history</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Schmidt, Thomas</creatorcontrib><creatorcontrib>Mosiienko, Anastasiia</creatorcontrib><creatorcontrib>Faber, Raffaela</creatorcontrib><creatorcontrib>Herzog, Juliane</creatorcontrib><creatorcontrib>Wolff, Christian</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>Proceedings of the ASIST Annual Meeting</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Schmidt, Thomas</au><au>Mosiienko, Anastasiia</au><au>Faber, Raffaela</au><au>Herzog, Juliane</au><au>Wolff, Christian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web</atitle><jtitle>Proceedings of the ASIST Annual Meeting</jtitle><date>2020-10-01</date><risdate>2020</risdate><volume>57</volume><issue>1</issue><epage>n/a</epage><issn>2373-9231</issn><eissn>2373-9231</eissn><eissn>1550-8390</eissn><abstract>We present preliminary results of a project investigating the design development of popular websites between 1996 and 2020 via HTML analysis and basic computer vision methods. We acquired a corpus of website screenshots of the current top 47 popular websites. We crawled a snapshot of every month of these websites via the wayback machine of the Internet Archive platform since the time snapshots are stored to gather 7,953 screenshots and HTML pages. We report upon quantitative analysis results concerning HTML elements, color distributions and visual complexity throughout the years.</abstract><cop>Hoboken, USA</cop><pub>John Wiley & Sons, Inc</pub><doi>10.1002/pra2.392</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2373-9231 |
ispartof | Proceedings of the ASIST Annual Meeting, 2020-10, Vol.57 (1), p.n/a |
issn | 2373-9231 2373-9231 1550-8390 |
language | eng |
recordid | cdi_proquest_journals_2453544056 |
source | Alma/SFX Local Collection |
subjects | colors Computer vision html HyperText Markup Language visual complexity web design web history Websites |
title | Utilizing HTML‐analysis and computer vision on a corpus of website screenshots to investigate design developments on the web |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T01%3A56%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Utilizing%20HTML%E2%80%90analysis%20and%20computer%20vision%20on%20a%20corpus%20of%20website%20screenshots%20to%20investigate%20design%20developments%20on%20the%20web&rft.jtitle=Proceedings%20of%20the%20ASIST%20Annual%20Meeting&rft.au=Schmidt,%20Thomas&rft.date=2020-10-01&rft.volume=57&rft.issue=1&rft.epage=n/a&rft.issn=2373-9231&rft.eissn=2373-9231&rft_id=info:doi/10.1002/pra2.392&rft_dat=%3Cproquest_cross%3E2453544056%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2453544056&rft_id=info:pmid/&rfr_iscdi=true |