CX DB8: A queryable extractive summarizer and semantic search engine

Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be perfor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Roush, Allen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Roush, Allen
description Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8
doi_str_mv 10.48550/arxiv.2012.03942
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2012_03942</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2012_03942</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-d5cb3d2f7be83e81271aadc1d5381e13dc4e39f27fc63b2a2751cc0947c202393</originalsourceid><addsrcrecordid>eNotz71Ow0AQBOBrKFDgAai4F7C52_XlbLrg8CdFoklBZ6331nBSbMHZiRKenhBSzVSj-ZS6sSYvSufMHaV93OVgLOQGqwIu1bJ-18uH8l4v9PdW0oHajWjZT4l4ijvR47bvKcUfSZqGoEfpaZgiHwsl_tQyfMRBrtRFR5tRrs85U-unx3X9kq3enl_rxSqjuYcsOG4xQOdbKVFKC94SBbbBYWnFYuBCsOrAdzzHFgi8s8ymKjyDAaxwpm7_Z0-M5ivF47VD88dpThz8Bdy2RJg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><source>arXiv.org</source><creator>Roush, Allen</creator><creatorcontrib>Roush, Allen</creatorcontrib><description>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</description><identifier>DOI: 10.48550/arxiv.2012.03942</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2020-12</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2012.03942$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2012.03942$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Roush, Allen</creatorcontrib><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><description>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71Ow0AQBOBrKFDgAai4F7C52_XlbLrg8CdFoklBZ6331nBSbMHZiRKenhBSzVSj-ZS6sSYvSufMHaV93OVgLOQGqwIu1bJ-18uH8l4v9PdW0oHajWjZT4l4ijvR47bvKcUfSZqGoEfpaZgiHwsl_tQyfMRBrtRFR5tRrs85U-unx3X9kq3enl_rxSqjuYcsOG4xQOdbKVFKC94SBbbBYWnFYuBCsOrAdzzHFgi8s8ymKjyDAaxwpm7_Z0-M5ivF47VD88dpThz8Bdy2RJg</recordid><startdate>20201207</startdate><enddate>20201207</enddate><creator>Roush, Allen</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20201207</creationdate><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><author>Roush, Allen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-d5cb3d2f7be83e81271aadc1d5381e13dc4e39f27fc63b2a2751cc0947c202393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Roush, Allen</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Roush, Allen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CX DB8: A queryable extractive summarizer and semantic search engine</atitle><date>2020-12-07</date><risdate>2020</risdate><abstract>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</abstract><doi>10.48550/arxiv.2012.03942</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2012.03942
ispartof
issn
language eng
recordid cdi_arxiv_primary_2012_03942
source arXiv.org
subjects Computer Science - Computation and Language
title CX DB8: A queryable extractive summarizer and semantic search engine
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T09%3A49%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CX%20DB8:%20A%20queryable%20extractive%20summarizer%20and%20semantic%20search%20engine&rft.au=Roush,%20Allen&rft.date=2020-12-07&rft_id=info:doi/10.48550/arxiv.2012.03942&rft_dat=%3Carxiv_GOX%3E2012_03942%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true