From BERT to GPT-3 codex: harnessing the potential of very large language models for data management

Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the VLDB Endowment 2022-08, Vol.15 (12), p.3770-3773
1. Verfasser: Trummer, Immanuel
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3773
container_issue 12
container_start_page 3770
container_title Proceedings of the VLDB Endowment
container_volume 15
creator Trummer, Immanuel
description Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.
doi_str_mv 10.14778/3554821.3554896
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_14778_3554821_3554896</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_14778_3554821_3554896</sourcerecordid><originalsourceid>FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</originalsourceid><addsrcrecordid>eNpNj8FKAzEURYNYsLbuu5wfSPveS_KSLLW0VSgoMq5DJs2AYhlJutC_V-osujqXuzhwhFggLFFb61bKGO0Il2d6vhJTQgPSgbfXF_tG3Nb6AcCO0U3FYluGY_OweW2b09DsXlqpmjQc8vdcTPr4WfPdyJl4227a9aPcP--e1vd7mdDzSZLqeoqaE2Z76J1HZTxa6ojg7-cu5pQhppgzGwJOhBa0Acea2SI7NRPw701lqLXkPnyV92MsPwEhnMvCWBbGMvULEhM8xA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</title><source>ACM Digital Library Complete</source><creator>Trummer, Immanuel</creator><creatorcontrib>Trummer, Immanuel</creatorcontrib><description>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</description><identifier>ISSN: 2150-8097</identifier><identifier>EISSN: 2150-8097</identifier><identifier>DOI: 10.14778/3554821.3554896</identifier><language>eng</language><ispartof>Proceedings of the VLDB Endowment, 2022-08, Vol.15 (12), p.3770-3773</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,27911,27912</link.rule.ids></links><search><creatorcontrib>Trummer, Immanuel</creatorcontrib><title>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</title><title>Proceedings of the VLDB Endowment</title><description>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</description><issn>2150-8097</issn><issn>2150-8097</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNpNj8FKAzEURYNYsLbuu5wfSPveS_KSLLW0VSgoMq5DJs2AYhlJutC_V-osujqXuzhwhFggLFFb61bKGO0Il2d6vhJTQgPSgbfXF_tG3Nb6AcCO0U3FYluGY_OweW2b09DsXlqpmjQc8vdcTPr4WfPdyJl4227a9aPcP--e1vd7mdDzSZLqeoqaE2Z76J1HZTxa6ojg7-cu5pQhppgzGwJOhBa0Acea2SI7NRPw701lqLXkPnyV92MsPwEhnMvCWBbGMvULEhM8xA</recordid><startdate>20220801</startdate><enddate>20220801</enddate><creator>Trummer, Immanuel</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20220801</creationdate><title>From BERT to GPT-3 codex</title><author>Trummer, Immanuel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c196t-23bf2a46c1e7df891359172b220f2a6baece0acaee65206c21704508646671683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Trummer, Immanuel</creatorcontrib><collection>CrossRef</collection><jtitle>Proceedings of the VLDB Endowment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Trummer, Immanuel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>From BERT to GPT-3 codex: harnessing the potential of very large language models for data management</atitle><jtitle>Proceedings of the VLDB Endowment</jtitle><date>2022-08-01</date><risdate>2022</risdate><volume>15</volume><issue>12</issue><spage>3770</spage><epage>3773</epage><pages>3770-3773</pages><issn>2150-8097</issn><eissn>2150-8097</eissn><abstract>Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.</abstract><doi>10.14778/3554821.3554896</doi><tpages>4</tpages></addata></record>
fulltext fulltext
identifier ISSN: 2150-8097
ispartof Proceedings of the VLDB Endowment, 2022-08, Vol.15 (12), p.3770-3773
issn 2150-8097
2150-8097
language eng
recordid cdi_crossref_primary_10_14778_3554821_3554896
source ACM Digital Library Complete
title From BERT to GPT-3 codex: harnessing the potential of very large language models for data management
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T16%3A07%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=From%20BERT%20to%20GPT-3%20codex:%20harnessing%20the%20potential%20of%20very%20large%20language%20models%20for%20data%20management&rft.jtitle=Proceedings%20of%20the%20VLDB%20Endowment&rft.au=Trummer,%20Immanuel&rft.date=2022-08-01&rft.volume=15&rft.issue=12&rft.spage=3770&rft.epage=3773&rft.pages=3770-3773&rft.issn=2150-8097&rft.eissn=2150-8097&rft_id=info:doi/10.14778/3554821.3554896&rft_dat=%3Ccrossref%3E10_14778_3554821_3554896%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true