CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology
Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as aut...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2024-09 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Rasheed, Zeeshan Malik, Abdul Sami Kai-Kristian Kemell Waseem, Muhammad Saari, Mika Systä, Kari Abrahamsson, Pekka |
description | Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2922281751</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2922281751</sourcerecordid><originalsourceid>FETCH-proquest_journals_29222817513</originalsourceid><addsrcrecordid>eNqNiksKwjAUAIMgKOodAq4L7au11Z34wYWC0LoUCfU1tqR5mo_i7VXwAK4GZqbD-hDHUZBNAHpsZG0ThiFMU0iSuM9OS7rggUw95zthJAZ5KRTy_GUdtrwiwxfekaaWvOU5Ve4pDPIVPlDRrUXt-NHWWvK9V64OFvJrCiyvmhTJ15B1K6Esjn4csPFmXSy3wc3Q3aN154a80Z90hhkAZFGaRPF_1xvw80O8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2922281751</pqid></control><display><type>article</type><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><source>Free E- Journals</source><creator>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</creator><creatorcontrib>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</creatorcontrib><description>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Automation ; Benchmarks ; Large language models ; Multiagent systems ; Performance evaluation ; Scale models ; Software ; Software development ; Software engineering ; Systems design ; Task complexity</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Rasheed, Zeeshan</creatorcontrib><creatorcontrib>Malik, Abdul Sami</creatorcontrib><creatorcontrib>Kai-Kristian Kemell</creatorcontrib><creatorcontrib>Waseem, Muhammad</creatorcontrib><creatorcontrib>Saari, Mika</creatorcontrib><creatorcontrib>Systä, Kari</creatorcontrib><creatorcontrib>Abrahamsson, Pekka</creatorcontrib><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><title>arXiv.org</title><description>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</description><subject>Automation</subject><subject>Benchmarks</subject><subject>Large language models</subject><subject>Multiagent systems</subject><subject>Performance evaluation</subject><subject>Scale models</subject><subject>Software</subject><subject>Software development</subject><subject>Software engineering</subject><subject>Systems design</subject><subject>Task complexity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNiksKwjAUAIMgKOodAq4L7au11Z34wYWC0LoUCfU1tqR5mo_i7VXwAK4GZqbD-hDHUZBNAHpsZG0ThiFMU0iSuM9OS7rggUw95zthJAZ5KRTy_GUdtrwiwxfekaaWvOU5Ve4pDPIVPlDRrUXt-NHWWvK9V64OFvJrCiyvmhTJ15B1K6Esjn4csPFmXSy3wc3Q3aN154a80Z90hhkAZFGaRPF_1xvw80O8</recordid><startdate>20240917</startdate><enddate>20240917</enddate><creator>Rasheed, Zeeshan</creator><creator>Malik, Abdul Sami</creator><creator>Kai-Kristian Kemell</creator><creator>Waseem, Muhammad</creator><creator>Saari, Mika</creator><creator>Systä, Kari</creator><creator>Abrahamsson, Pekka</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240917</creationdate><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><author>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_29222817513</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automation</topic><topic>Benchmarks</topic><topic>Large language models</topic><topic>Multiagent systems</topic><topic>Performance evaluation</topic><topic>Scale models</topic><topic>Software</topic><topic>Software development</topic><topic>Software engineering</topic><topic>Systems design</topic><topic>Task complexity</topic><toplevel>online_resources</toplevel><creatorcontrib>Rasheed, Zeeshan</creatorcontrib><creatorcontrib>Malik, Abdul Sami</creatorcontrib><creatorcontrib>Kai-Kristian Kemell</creatorcontrib><creatorcontrib>Waseem, Muhammad</creatorcontrib><creatorcontrib>Saari, Mika</creatorcontrib><creatorcontrib>Systä, Kari</creatorcontrib><creatorcontrib>Abrahamsson, Pekka</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rasheed, Zeeshan</au><au>Malik, Abdul Sami</au><au>Kai-Kristian Kemell</au><au>Waseem, Muhammad</au><au>Saari, Mika</au><au>Systä, Kari</au><au>Abrahamsson, Pekka</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</atitle><jtitle>arXiv.org</jtitle><date>2024-09-17</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2024-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2922281751 |
source | Free E- Journals |
subjects | Automation Benchmarks Large language models Multiagent systems Performance evaluation Scale models Software Software development Software engineering Systems design Task complexity |
title | CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T14%3A46%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=CodePori:%20Large-Scale%20System%20for%20Autonomous%20Software%20Development%20Using%20Multi-Agent%20Technology&rft.jtitle=arXiv.org&rft.au=Rasheed,%20Zeeshan&rft.date=2024-09-17&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2922281751%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2922281751&rft_id=info:pmid/&rfr_iscdi=true |