CX DB8: A queryable extractive summarizer and semantic search engine

Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be perfor...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Roush, Allen
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Roush, Allen
description	Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8
doi_str_mv	10.48550/arxiv.2012.03942
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2012_03942</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2012_03942</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-d5cb3d2f7be83e81271aadc1d5381e13dc4e39f27fc63b2a2751cc0947c202393</originalsourceid><addsrcrecordid>eNotz71Ow0AQBOBrKFDgAai4F7C52_XlbLrg8CdFoklBZ6331nBSbMHZiRKenhBSzVSj-ZS6sSYvSufMHaV93OVgLOQGqwIu1bJ-18uH8l4v9PdW0oHajWjZT4l4ijvR47bvKcUfSZqGoEfpaZgiHwsl_tQyfMRBrtRFR5tRrs85U-unx3X9kq3enl_rxSqjuYcsOG4xQOdbKVFKC94SBbbBYWnFYuBCsOrAdzzHFgi8s8ymKjyDAaxwpm7_Z0-M5ivF47VD88dpThz8Bdy2RJg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><source>arXiv.org</source><creator>Roush, Allen</creator><creatorcontrib>Roush, Allen</creatorcontrib><description>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</description><identifier>DOI: 10.48550/arxiv.2012.03942</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2020-12</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2012.03942$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2012.03942$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Roush, Allen</creatorcontrib><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><description>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71Ow0AQBOBrKFDgAai4F7C52_XlbLrg8CdFoklBZ6331nBSbMHZiRKenhBSzVSj-ZS6sSYvSufMHaV93OVgLOQGqwIu1bJ-18uH8l4v9PdW0oHajWjZT4l4ijvR47bvKcUfSZqGoEfpaZgiHwsl_tQyfMRBrtRFR5tRrs85U-unx3X9kq3enl_rxSqjuYcsOG4xQOdbKVFKC94SBbbBYWnFYuBCsOrAdzzHFgi8s8ymKjyDAaxwpm7_Z0-M5ivF47VD88dpThz8Bdy2RJg</recordid><startdate>20201207</startdate><enddate>20201207</enddate><creator>Roush, Allen</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20201207</creationdate><title>CX DB8: A queryable extractive summarizer and semantic search engine</title><author>Roush, Allen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-d5cb3d2f7be83e81271aadc1d5381e13dc4e39f27fc63b2a2751cc0947c202393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Roush, Allen</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Roush, Allen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CX DB8: A queryable extractive summarizer and semantic search engine</atitle><date>2020-12-07</date><risdate>2020</risdate><abstract>Competitive Debate's increasingly technical nature has left competitors looking for tools to accelerate evidence production. We find that the unique type of extractive summarization performed by competitive debaters - summarization with a bias towards a particular target meaning - can be performed using the latest innovations in unsupervised pre-trained text vectorization models. We introduce CX_DB8, a queryable word-level extractive summarizer and evidence creation framework, which allows for rapid, biasable summarization of arbitarily sized texts. CX_DB8s usage of the embedding framework Flair means that as the underlying models improve, CX_DB8 will also improve. We observe that CX_DB8 also functions as a semantic search engine, and has application as a supplement to traditional "find" functionality in programs and webpages. CX_DB8 is currently used by competitive debaters and is made available to the public at https://github.com/Hellisotherpeople/CX_DB8</abstract><doi>10.48550/arxiv.2012.03942</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2012.03942
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2012_03942
source	arXiv.org
subjects	Computer Science - Computation and Language
title	CX DB8: A queryable extractive summarizer and semantic search engine
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T09%3A49%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CX%20DB8:%20A%20queryable%20extractive%20summarizer%20and%20semantic%20search%20engine&rft.au=Roush,%20Allen&rft.date=2020-12-07&rft_id=info:doi/10.48550/arxiv.2012.03942&rft_dat=%3Carxiv_GOX%3E2012_03942%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true