From BERT to GPT-3 codex: harnessing the potential of very large language models for data management

Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings of the VLDB Endowment 2022-08, Vol.15 (12), p.3770-3773
1. Verfasser:	Trummer, Immanuel
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3773
container_issue	12
container_start_page	3770
container_title	Proceedings of the VLDB Endowment
container_volume	15
creator	Trummer, Immanuel
description	Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.
doi_str_mv	10.14778/3554821.3554896
format	Article
fullrecord	<record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_14778_3554821_3554896</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_14778_3554821_3554896</sourcerecordid><originalsourceid>FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</originalsourceid><addsrcrecordid>eNpNj8FKAzEURYNYsLbuu5wfSPveS_KSLLW0VSgoMq5DJs2AYhlJutC_V-osujqXuzhwhFggLFFb61bKGO0Il2d6vhJTQgPSgbfXF_tG3Nb6AcCO0U3FYluGY_OweW2b09DsXlqpmjQc8vdcTPr4WfPdyJl4227a9aPcP--e1vd7mdDzSZLqeoqaE2Z76J1HZTxa6ojg7-cu5pQhppgzGwJOhBa0Acea2SI7NRPw701lqLXkPnyV92MsPwEhnMvCWBbGMvULEhM8xA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</title><source>ACM Digital Library Complete</source><creator>Trummer, Immanuel</creator><creatorcontrib>Trummer, Immanuel</creatorcontrib><description>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</description><identifier>ISSN: 2150-8097</identifier><identifier>EISSN: 2150-8097</identifier><identifier>DOI: 10.14778/3554821.3554896</identifier><language>eng</language><ispartof>Proceedings of the VLDB Endowment, 2022-08, Vol.15 (12), p.3770-3773</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,27911,27912</link.rule.ids></links><search><creatorcontrib>Trummer, Immanuel</creatorcontrib><title>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</title><title>Proceedings of the VLDB Endowment</title><description>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</description><issn>2150-8097</issn><issn>2150-8097</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNpNj8FKAzEURYNYsLbuu5wfSPveS_KSLLW0VSgoMq5DJs2AYhlJutC_V-osujqXuzhwhFggLFFb61bKGO0Il2d6vhJTQgPSgbfXF_tG3Nb6AcCO0U3FYluGY_OweW2b09DsXlqpmjQc8vdcTPr4WfPdyJl4227a9aPcP--e1vd7mdDzSZLqeoqaE2Z76J1HZTxa6ojg7-cu5pQhppgzGwJOhBa0Acea2SI7NRPw701lqLXkPnyV92MsPwEhnMvCWBbGMvULEhM8xA</recordid><startdate>20220801</startdate><enddate>20220801</enddate><creator>Trummer, Immanuel</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20220801</creationdate><title>From BERT to GPT-3 codex</title><author>Trummer, Immanuel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Trummer, Immanuel</creatorcontrib><collection>CrossRef</collection><jtitle>Proceedings of the VLDB Endowment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Trummer, Immanuel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</atitle><jtitle>Proceedings of the VLDB Endowment</jtitle><date>2022-08-01</date><risdate>2022</risdate><volume>15</volume><issue>12</issue><spage>3770</spage><epage>3773</epage><pages>3770-3773</pages><issn>2150-8097</issn><eissn>2150-8097</eissn><abstract>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</abstract><doi>10.14778/3554821.3554896</doi><tpages>4</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 2150-8097
ispartof	Proceedings of the VLDB Endowment, 2022-08, Vol.15 (12), p.3770-3773
issn	2150-8097 2150-8097
language	eng
recordid	cdi_crossref_primary_10_14778_3554821_3554896
source	ACM Digital Library Complete
title	From BERT to GPT-3 codex: harnessing the potential of very large language models for data management
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T16%3A07%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=From%20BERT%20to%20GPT-3%20codex:%20harnessing%20the%20potential%20of%20very%20large%20language%20models%20for%20data%20management&rft.jtitle=Proceedings%20of%20the%20VLDB%20Endowment&rft.au=Trummer,%20Immanuel&rft.date=2022-08-01&rft.volume=15&rft.issue=12&rft.spage=3770&rft.epage=3773&rft.pages=3770-3773&rft.issn=2150-8097&rft.eissn=2150-8097&rft_id=info:doi/10.14778/3554821.3554896&rft_dat=%3Ccrossref%3E10_14778_3554821_3554896%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true