SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classificati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gandhi, Apurva, Serrao, Ryan, Fang, Biyi, Antonius, Gilbert, Hong, Jenna, Nguyen, Tra My, Yi, Sheng, Nosakhare, Ehi, Shaffer, Irene, Srinivasan, Soundararajan, Gupta, Vivek
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Gandhi, Apurva
Serrao, Ryan
Fang, Biyi
Antonius, Gilbert
Hong, Jenna
Nguyen, Tra My
Yi, Sheng
Nosakhare, Ehi
Shaffer, Irene
Srinivasan, Soundararajan
Gupta, Vivek
description We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.
doi_str_mv 10.48550/arxiv.2211.04454
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2211_04454</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2211_04454</sourcerecordid><originalsourceid>FETCH-LOGICAL-a674-a8e8477a527cfcc59600f2dfb37ec72019a48654d1c0eb3aeb3207b01f3eec1e3</originalsourceid><addsrcrecordid>eNotz7FOwzAUBVAvDKjwAUy8H0iwHTtO2aIohUqRGOo9enGeIWrjBDeg8veU0uHqDle60mHsQfBUFVrzJ4yn4TuVUoiUK6XVLbO7prT1M5Swo88vCo6gwY4OQ3iHcp7jhO4D_BTB4nEP9WmJ6JZhCuDjNMImEiXndYRt2FMP1RQWCssdu_F4ONL9tVfMbmpbvSbN28u2KpsEc6MSLKhQxqCWxnnn9Drn3Mved5khZyQXa1RFrlUvHKcuw3MkNx0XPiNygrIVe_y_vbDaOQ4jxp_2j9deeNkviH5J_A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content</title><source>arXiv.org</source><creator>Gandhi, Apurva ; Serrao, Ryan ; Fang, Biyi ; Antonius, Gilbert ; Hong, Jenna ; Nguyen, Tra My ; Yi, Sheng ; Nosakhare, Ehi ; Shaffer, Irene ; Srinivasan, Soundararajan ; Gupta, Vivek</creator><creatorcontrib>Gandhi, Apurva ; Serrao, Ryan ; Fang, Biyi ; Antonius, Gilbert ; Hong, Jenna ; Nguyen, Tra My ; Yi, Sheng ; Nosakhare, Ehi ; Shaffer, Irene ; Srinivasan, Soundararajan ; Gupta, Vivek</creatorcontrib><description>We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.</description><identifier>DOI: 10.48550/arxiv.2211.04454</identifier><language>eng</language><subject>Computer Science - Computation and Language ; Computer Science - Learning</subject><creationdate>2022-11</creationdate><rights>http://creativecommons.org/licenses/by-nc-nd/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2211.04454$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2211.04454$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Gandhi, Apurva</creatorcontrib><creatorcontrib>Serrao, Ryan</creatorcontrib><creatorcontrib>Fang, Biyi</creatorcontrib><creatorcontrib>Antonius, Gilbert</creatorcontrib><creatorcontrib>Hong, Jenna</creatorcontrib><creatorcontrib>Nguyen, Tra My</creatorcontrib><creatorcontrib>Yi, Sheng</creatorcontrib><creatorcontrib>Nosakhare, Ehi</creatorcontrib><creatorcontrib>Shaffer, Irene</creatorcontrib><creatorcontrib>Srinivasan, Soundararajan</creatorcontrib><creatorcontrib>Gupta, Vivek</creatorcontrib><title>SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content</title><description>We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.</description><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUBVAvDKjwAUy8H0iwHTtO2aIohUqRGOo9enGeIWrjBDeg8veU0uHqDle60mHsQfBUFVrzJ4yn4TuVUoiUK6XVLbO7prT1M5Swo88vCo6gwY4OQ3iHcp7jhO4D_BTB4nEP9WmJ6JZhCuDjNMImEiXndYRt2FMP1RQWCssdu_F4ONL9tVfMbmpbvSbN28u2KpsEc6MSLKhQxqCWxnnn9Drn3Mved5khZyQXa1RFrlUvHKcuw3MkNx0XPiNygrIVe_y_vbDaOQ4jxp_2j9deeNkviH5J_A</recordid><startdate>20221108</startdate><enddate>20221108</enddate><creator>Gandhi, Apurva</creator><creator>Serrao, Ryan</creator><creator>Fang, Biyi</creator><creator>Antonius, Gilbert</creator><creator>Hong, Jenna</creator><creator>Nguyen, Tra My</creator><creator>Yi, Sheng</creator><creator>Nosakhare, Ehi</creator><creator>Shaffer, Irene</creator><creator>Srinivasan, Soundararajan</creator><creator>Gupta, Vivek</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221108</creationdate><title>SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content</title><author>Gandhi, Apurva ; Serrao, Ryan ; Fang, Biyi ; Antonius, Gilbert ; Hong, Jenna ; Nguyen, Tra My ; Yi, Sheng ; Nosakhare, Ehi ; Shaffer, Irene ; Srinivasan, Soundararajan ; Gupta, Vivek</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a674-a8e8477a527cfcc59600f2dfb37ec72019a48654d1c0eb3aeb3207b01f3eec1e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Gandhi, Apurva</creatorcontrib><creatorcontrib>Serrao, Ryan</creatorcontrib><creatorcontrib>Fang, Biyi</creatorcontrib><creatorcontrib>Antonius, Gilbert</creatorcontrib><creatorcontrib>Hong, Jenna</creatorcontrib><creatorcontrib>Nguyen, Tra My</creatorcontrib><creatorcontrib>Yi, Sheng</creatorcontrib><creatorcontrib>Nosakhare, Ehi</creatorcontrib><creatorcontrib>Shaffer, Irene</creatorcontrib><creatorcontrib>Srinivasan, Soundararajan</creatorcontrib><creatorcontrib>Gupta, Vivek</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gandhi, Apurva</au><au>Serrao, Ryan</au><au>Fang, Biyi</au><au>Antonius, Gilbert</au><au>Hong, Jenna</au><au>Nguyen, Tra My</au><au>Yi, Sheng</au><au>Nosakhare, Ehi</au><au>Shaffer, Irene</au><au>Srinivasan, Soundararajan</au><au>Gupta, Vivek</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content</atitle><date>2022-11-08</date><risdate>2022</risdate><abstract>We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.</abstract><doi>10.48550/arxiv.2211.04454</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2211.04454
ispartof
issn
language eng
recordid cdi_arxiv_primary_2211_04454
source arXiv.org
subjects Computer Science - Computation and Language
Computer Science - Learning
title SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T17%3A21%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SLATE:%20A%20Sequence%20Labeling%20Approach%20for%20Task%20Extraction%20from%20Free-form%20Inked%20Content&rft.au=Gandhi,%20Apurva&rft.date=2022-11-08&rft_id=info:doi/10.48550/arxiv.2211.04454&rft_dat=%3Carxiv_GOX%3E2211_04454%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true