AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors

Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-03
Hauptverfasser:	Yang, Saelyne, Vermeulen, Jo, Fitzmaurice, George, Matejka, Justin
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Human-Computer Interaction Questions Software Video
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Yang, Saelyne Vermeulen, Jo Fitzmaurice, George Matejka, Justin
description	Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified different question types and observed that users frequently described parts of the video in questions. We then asked participants (N=24) to watch tutorial videos and ask questions while annotating the video with relevant visual anchors. Most visual anchors referred to UI elements and the application workspace. Based on these insights, we built AQuA, a pipeline that generates useful answers to questions with visual anchors. We demonstrate this for Fusion 360, showing that we can recognize UI elements in visual anchors and generate answers using GPT-4 augmented with that visual information and software documentation. An evaluation study (N=16) demonstrates that our approach provides better answers than baseline methods.
doi_str_mv	10.48550/arxiv.2403.05213
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2403_05213</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2955062663</sourcerecordid><originalsourceid>FETCH-LOGICAL-a523-ff1d3868cdf8a2411e2289caa6f957400ff09c6cf09f69d9b41d3c422379021b3</originalsourceid><addsrcrecordid>eNotkFFLwzAQx4MgOOY-gE8GfG5NLk3a-FaGOmEg0-JrydrEZWzNTFqr395s8-Xu-PPjuN8hdENJmhWck3vlf-x3ChlhKeFA2QWaAGM0KTKAKzQLYUsIAZED52yC3srVUD7gcujdXvW6xatBh966Lim7MGpvu09sO_zuTD8qr3EVQW_VDn_YVruAR9tv4hyGGJVds3E-XKNLo3ZBz_77FFVPj9V8kSxfn1_m5TJRHFhiDG1ZIYqmNYWCjFINUMhGKWEkzzNCjCGyEU2sRshWrrPIN9GB5ZIAXbMpuj2vPQnXB2_3yv_WR_H6JB6JuzNx8O7rqFVv3eC7eFMNMr5KgBCM_QHkc1rV</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2955062663</pqid></control><display><type>article</type><title>AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Yang, Saelyne ; Vermeulen, Jo ; Fitzmaurice, George ; Matejka, Justin</creator><creatorcontrib>Yang, Saelyne ; Vermeulen, Jo ; Fitzmaurice, George ; Matejka, Justin</creatorcontrib><description>Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified different question types and observed that users frequently described parts of the video in questions. We then asked participants (N=24) to watch tutorial videos and ask questions while annotating the video with relevant visual anchors. Most visual anchors referred to UI elements and the application workspace. Based on these insights, we built AQuA, a pipeline that generates useful answers to questions with visual anchors. We demonstrate this for Fusion 360, showing that we can recognize UI elements in visual anchors and generate answers using GPT-4 augmented with that visual information and software documentation. An evaluation study (N=16) demonstrates that our approach provides better answers than baseline methods.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2403.05213</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer Science - Human-Computer Interaction ; Questions ; Software ; Video</subject><ispartof>arXiv.org, 2024-03</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27924</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/3613904.3642752$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2403.05213$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Yang, Saelyne</creatorcontrib><creatorcontrib>Vermeulen, Jo</creatorcontrib><creatorcontrib>Fitzmaurice, George</creatorcontrib><creatorcontrib>Matejka, Justin</creatorcontrib><title>AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors</title><title>arXiv.org</title><description>Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified different question types and observed that users frequently described parts of the video in questions. We then asked participants (N=24) to watch tutorial videos and ask questions while annotating the video with relevant visual anchors. Most visual anchors referred to UI elements and the application workspace. Based on these insights, we built AQuA, a pipeline that generates useful answers to questions with visual anchors. We demonstrate this for Fusion 360, showing that we can recognize UI elements in visual anchors and generate answers using GPT-4 augmented with that visual information and software documentation. An evaluation study (N=16) demonstrates that our approach provides better answers than baseline methods.</description><subject>Computer Science - Human-Computer Interaction</subject><subject>Questions</subject><subject>Software</subject><subject>Video</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotkFFLwzAQx4MgOOY-gE8GfG5NLk3a-FaGOmEg0-JrydrEZWzNTFqr395s8-Xu-PPjuN8hdENJmhWck3vlf-x3ChlhKeFA2QWaAGM0KTKAKzQLYUsIAZED52yC3srVUD7gcujdXvW6xatBh966Lim7MGpvu09sO_zuTD8qr3EVQW_VDn_YVruAR9tv4hyGGJVds3E-XKNLo3ZBz_77FFVPj9V8kSxfn1_m5TJRHFhiDG1ZIYqmNYWCjFINUMhGKWEkzzNCjCGyEU2sRshWrrPIN9GB5ZIAXbMpuj2vPQnXB2_3yv_WR_H6JB6JuzNx8O7rqFVv3eC7eFMNMr5KgBCM_QHkc1rV</recordid><startdate>20240308</startdate><enddate>20240308</enddate><creator>Yang, Saelyne</creator><creator>Vermeulen, Jo</creator><creator>Fitzmaurice, George</creator><creator>Matejka, Justin</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240308</creationdate><title>AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors</title><author>Yang, Saelyne ; Vermeulen, Jo ; Fitzmaurice, George ; Matejka, Justin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a523-ff1d3868cdf8a2411e2289caa6f957400ff09c6cf09f69d9b41d3c422379021b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Human-Computer Interaction</topic><topic>Questions</topic><topic>Software</topic><topic>Video</topic><toplevel>online_resources</toplevel><creatorcontrib>Yang, Saelyne</creatorcontrib><creatorcontrib>Vermeulen, Jo</creatorcontrib><creatorcontrib>Fitzmaurice, George</creatorcontrib><creatorcontrib>Matejka, Justin</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yang, Saelyne</au><au>Vermeulen, Jo</au><au>Fitzmaurice, George</au><au>Matejka, Justin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors</atitle><jtitle>arXiv.org</jtitle><date>2024-03-08</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Tutorial videos are a popular help source for learning feature-rich software. However, getting quick answers to questions about tutorial videos is difficult. We present an automated approach for responding to tutorial questions. By analyzing 633 questions found in 5,944 video comments, we identified different question types and observed that users frequently described parts of the video in questions. We then asked participants (N=24) to watch tutorial videos and ask questions while annotating the video with relevant visual anchors. Most visual anchors referred to UI elements and the application workspace. Based on these insights, we built AQuA, a pipeline that generates useful answers to questions with visual anchors. We demonstrate this for Fusion 360, showing that we can recognize UI elements in visual anchors and generate answers using GPT-4 augmented with that visual information and software documentation. An evaluation study (N=16) demonstrates that our approach provides better answers than baseline methods.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2403.05213</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-03
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2403_05213
source	arXiv.org; Free E- Journals
subjects	Computer Science - Human-Computer Interaction Questions Software Video
title	AQuA: Automated Question-Answering in Software Tutorial Videos with Visual Anchors
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T06%3A54%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AQuA:%20Automated%20Question-Answering%20in%20Software%20Tutorial%20Videos%20with%20Visual%20Anchors&rft.jtitle=arXiv.org&rft.au=Yang,%20Saelyne&rft.date=2024-03-08&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2403.05213&rft_dat=%3Cproquest_arxiv%3E2955062663%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2955062663&rft_id=info:pmid/&rfr_iscdi=true