CELI: Controller-Embedded Language Model Interactions
We introduce Controller-Embedded Language Model Interactions (CELI), a framework that integrates control logic directly within language model (LM) prompts, facilitating complex, multi-stage task execution. CELI addresses limitations of existing prompt engineering and workflow optimization techniques...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Wagner, Jan-Samuel DeCaprio, Dave Raja, Abishek Chiffon Muthu Holman, Jonathan M Brady, Lauren K Cheung, Sky C Barzekar, Hosein Yang, Eric MartinezII, Mark Anthony Soong, David Sridhar, Sriram Si, Han Higgs, Brandon W Hamadeh, Hisham Ogden, Scott |
description | We introduce Controller-Embedded Language Model Interactions (CELI), a
framework that integrates control logic directly within language model (LM)
prompts, facilitating complex, multi-stage task execution. CELI addresses
limitations of existing prompt engineering and workflow optimization techniques
by embedding control logic directly within the operational context of language
models, enabling dynamic adaptation to evolving task requirements. Our
framework transfers control from the traditional programming execution
environment to the LMs, allowing them to autonomously manage computational
workflows while maintaining seamless interaction with external systems and
functions. CELI supports arbitrary function calls with variable arguments,
bridging the gap between LMs' adaptive reasoning capabilities and conventional
software paradigms' structured control mechanisms. To evaluate CELI's
versatility and effectiveness, we conducted case studies in two distinct
domains: code generation (HumanEval benchmark) and multi-stage content
generation (Wikipedia-style articles). The results demonstrate notable
performance improvements across a range of domains. CELI achieved a 4.9
percentage point improvement over the best reported score of the baseline GPT-4
model on the HumanEval code generation benchmark. In multi-stage content
generation, 94.4% of CELI-produced Wikipedia-style articles met or exceeded
first draft quality when optimally configured, with 44.4% achieving high
quality. These outcomes underscore CELI's potential for optimizing AI-driven
workflows across diverse computational domains. |
doi_str_mv | 10.48550/arxiv.2410.14627 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2410_14627</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410_14627</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2410_146273</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMgEKGJqYGZlzMpg6u_p4Wik45-eVFOXn5KQW6brmJqWmpKSmKPgk5qWXJqanKvjmp6TmKHjmlaQWJSaXZObnFfMwsKYl5hSn8kJpbgZ5N9cQZw9dsAXxBUWZuYlFlfEgi-LBFhkTVgEAB54x9A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>CELI: Controller-Embedded Language Model Interactions</title><source>arXiv.org</source><creator>Wagner, Jan-Samuel ; DeCaprio, Dave ; Raja, Abishek Chiffon Muthu ; Holman, Jonathan M ; Brady, Lauren K ; Cheung, Sky C ; Barzekar, Hosein ; Yang, Eric ; MartinezII, Mark Anthony ; Soong, David ; Sridhar, Sriram ; Si, Han ; Higgs, Brandon W ; Hamadeh, Hisham ; Ogden, Scott</creator><creatorcontrib>Wagner, Jan-Samuel ; DeCaprio, Dave ; Raja, Abishek Chiffon Muthu ; Holman, Jonathan M ; Brady, Lauren K ; Cheung, Sky C ; Barzekar, Hosein ; Yang, Eric ; MartinezII, Mark Anthony ; Soong, David ; Sridhar, Sriram ; Si, Han ; Higgs, Brandon W ; Hamadeh, Hisham ; Ogden, Scott</creatorcontrib><description>We introduce Controller-Embedded Language Model Interactions (CELI), a
framework that integrates control logic directly within language model (LM)
prompts, facilitating complex, multi-stage task execution. CELI addresses
limitations of existing prompt engineering and workflow optimization techniques
by embedding control logic directly within the operational context of language
models, enabling dynamic adaptation to evolving task requirements. Our
framework transfers control from the traditional programming execution
environment to the LMs, allowing them to autonomously manage computational
workflows while maintaining seamless interaction with external systems and
functions. CELI supports arbitrary function calls with variable arguments,
bridging the gap between LMs' adaptive reasoning capabilities and conventional
software paradigms' structured control mechanisms. To evaluate CELI's
versatility and effectiveness, we conducted case studies in two distinct
domains: code generation (HumanEval benchmark) and multi-stage content
generation (Wikipedia-style articles). The results demonstrate notable
performance improvements across a range of domains. CELI achieved a 4.9
percentage point improvement over the best reported score of the baseline GPT-4
model on the HumanEval code generation benchmark. In multi-stage content
generation, 94.4% of CELI-produced Wikipedia-style articles met or exceeded
first draft quality when optimally configured, with 44.4% achieving high
quality. These outcomes underscore CELI's potential for optimizing AI-driven
workflows across diverse computational domains.</description><identifier>DOI: 10.48550/arxiv.2410.14627</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Software Engineering</subject><creationdate>2024-10</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2410.14627$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2410.14627$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wagner, Jan-Samuel</creatorcontrib><creatorcontrib>DeCaprio, Dave</creatorcontrib><creatorcontrib>Raja, Abishek Chiffon Muthu</creatorcontrib><creatorcontrib>Holman, Jonathan M</creatorcontrib><creatorcontrib>Brady, Lauren K</creatorcontrib><creatorcontrib>Cheung, Sky C</creatorcontrib><creatorcontrib>Barzekar, Hosein</creatorcontrib><creatorcontrib>Yang, Eric</creatorcontrib><creatorcontrib>MartinezII, Mark Anthony</creatorcontrib><creatorcontrib>Soong, David</creatorcontrib><creatorcontrib>Sridhar, Sriram</creatorcontrib><creatorcontrib>Si, Han</creatorcontrib><creatorcontrib>Higgs, Brandon W</creatorcontrib><creatorcontrib>Hamadeh, Hisham</creatorcontrib><creatorcontrib>Ogden, Scott</creatorcontrib><title>CELI: Controller-Embedded Language Model Interactions</title><description>We introduce Controller-Embedded Language Model Interactions (CELI), a
framework that integrates control logic directly within language model (LM)
prompts, facilitating complex, multi-stage task execution. CELI addresses
limitations of existing prompt engineering and workflow optimization techniques
by embedding control logic directly within the operational context of language
models, enabling dynamic adaptation to evolving task requirements. Our
framework transfers control from the traditional programming execution
environment to the LMs, allowing them to autonomously manage computational
workflows while maintaining seamless interaction with external systems and
functions. CELI supports arbitrary function calls with variable arguments,
bridging the gap between LMs' adaptive reasoning capabilities and conventional
software paradigms' structured control mechanisms. To evaluate CELI's
versatility and effectiveness, we conducted case studies in two distinct
domains: code generation (HumanEval benchmark) and multi-stage content
generation (Wikipedia-style articles). The results demonstrate notable
performance improvements across a range of domains. CELI achieved a 4.9
percentage point improvement over the best reported score of the baseline GPT-4
model on the HumanEval code generation benchmark. In multi-stage content
generation, 94.4% of CELI-produced Wikipedia-style articles met or exceeded
first draft quality when optimally configured, with 44.4% achieving high
quality. These outcomes underscore CELI's potential for optimizing AI-driven
workflows across diverse computational domains.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Software Engineering</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMgEKGJqYGZlzMpg6u_p4Wik45-eVFOXn5KQW6brmJqWmpKSmKPgk5qWXJqanKvjmp6TmKHjmlaQWJSaXZObnFfMwsKYl5hSn8kJpbgZ5N9cQZw9dsAXxBUWZuYlFlfEgi-LBFhkTVgEAB54x9A</recordid><startdate>20241018</startdate><enddate>20241018</enddate><creator>Wagner, Jan-Samuel</creator><creator>DeCaprio, Dave</creator><creator>Raja, Abishek Chiffon Muthu</creator><creator>Holman, Jonathan M</creator><creator>Brady, Lauren K</creator><creator>Cheung, Sky C</creator><creator>Barzekar, Hosein</creator><creator>Yang, Eric</creator><creator>MartinezII, Mark Anthony</creator><creator>Soong, David</creator><creator>Sridhar, Sriram</creator><creator>Si, Han</creator><creator>Higgs, Brandon W</creator><creator>Hamadeh, Hisham</creator><creator>Ogden, Scott</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241018</creationdate><title>CELI: Controller-Embedded Language Model Interactions</title><author>Wagner, Jan-Samuel ; DeCaprio, Dave ; Raja, Abishek Chiffon Muthu ; Holman, Jonathan M ; Brady, Lauren K ; Cheung, Sky C ; Barzekar, Hosein ; Yang, Eric ; MartinezII, Mark Anthony ; Soong, David ; Sridhar, Sriram ; Si, Han ; Higgs, Brandon W ; Hamadeh, Hisham ; Ogden, Scott</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2410_146273</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Software Engineering</topic><toplevel>online_resources</toplevel><creatorcontrib>Wagner, Jan-Samuel</creatorcontrib><creatorcontrib>DeCaprio, Dave</creatorcontrib><creatorcontrib>Raja, Abishek Chiffon Muthu</creatorcontrib><creatorcontrib>Holman, Jonathan M</creatorcontrib><creatorcontrib>Brady, Lauren K</creatorcontrib><creatorcontrib>Cheung, Sky C</creatorcontrib><creatorcontrib>Barzekar, Hosein</creatorcontrib><creatorcontrib>Yang, Eric</creatorcontrib><creatorcontrib>MartinezII, Mark Anthony</creatorcontrib><creatorcontrib>Soong, David</creatorcontrib><creatorcontrib>Sridhar, Sriram</creatorcontrib><creatorcontrib>Si, Han</creatorcontrib><creatorcontrib>Higgs, Brandon W</creatorcontrib><creatorcontrib>Hamadeh, Hisham</creatorcontrib><creatorcontrib>Ogden, Scott</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wagner, Jan-Samuel</au><au>DeCaprio, Dave</au><au>Raja, Abishek Chiffon Muthu</au><au>Holman, Jonathan M</au><au>Brady, Lauren K</au><au>Cheung, Sky C</au><au>Barzekar, Hosein</au><au>Yang, Eric</au><au>MartinezII, Mark Anthony</au><au>Soong, David</au><au>Sridhar, Sriram</au><au>Si, Han</au><au>Higgs, Brandon W</au><au>Hamadeh, Hisham</au><au>Ogden, Scott</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CELI: Controller-Embedded Language Model Interactions</atitle><date>2024-10-18</date><risdate>2024</risdate><abstract>We introduce Controller-Embedded Language Model Interactions (CELI), a
framework that integrates control logic directly within language model (LM)
prompts, facilitating complex, multi-stage task execution. CELI addresses
limitations of existing prompt engineering and workflow optimization techniques
by embedding control logic directly within the operational context of language
models, enabling dynamic adaptation to evolving task requirements. Our
framework transfers control from the traditional programming execution
environment to the LMs, allowing them to autonomously manage computational
workflows while maintaining seamless interaction with external systems and
functions. CELI supports arbitrary function calls with variable arguments,
bridging the gap between LMs' adaptive reasoning capabilities and conventional
software paradigms' structured control mechanisms. To evaluate CELI's
versatility and effectiveness, we conducted case studies in two distinct
domains: code generation (HumanEval benchmark) and multi-stage content
generation (Wikipedia-style articles). The results demonstrate notable
performance improvements across a range of domains. CELI achieved a 4.9
percentage point improvement over the best reported score of the baseline GPT-4
model on the HumanEval code generation benchmark. In multi-stage content
generation, 94.4% of CELI-produced Wikipedia-style articles met or exceeded
first draft quality when optimally configured, with 44.4% achieving high
quality. These outcomes underscore CELI's potential for optimizing AI-driven
workflows across diverse computational domains.</abstract><doi>10.48550/arxiv.2410.14627</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2410.14627 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2410_14627 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Computation and Language Computer Science - Software Engineering |
title | CELI: Controller-Embedded Language Model Interactions |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T01%3A48%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CELI:%20Controller-Embedded%20Language%20Model%20Interactions&rft.au=Wagner,%20Jan-Samuel&rft.date=2024-10-18&rft_id=info:doi/10.48550/arxiv.2410.14627&rft_dat=%3Carxiv_GOX%3E2410_14627%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |