CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology

Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as aut...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-09
Hauptverfasser:	Rasheed, Zeeshan, Malik, Abdul Sami, Kai-Kristian Kemell, Waseem, Muhammad, Saari, Mika, Systä, Kari, Abrahamsson, Pekka
Format:	Artikel
Sprache:	eng
Schlagworte:	Automation Benchmarks Large language models Multiagent systems Performance evaluation Scale models Software Software development Software engineering Systems design Task complexity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Rasheed, Zeeshan Malik, Abdul Sami Kai-Kristian Kemell Waseem, Muhammad Saari, Mika Systä, Kari Abrahamsson, Pekka
description	Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2922281751</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2922281751</sourcerecordid><originalsourceid>FETCH-proquest_journals_29222817513</originalsourceid><addsrcrecordid>eNqNiksKwjAUAIMgKOodAq4L7au11Z34wYWC0LoUCfU1tqR5mo_i7VXwAK4GZqbD-hDHUZBNAHpsZG0ThiFMU0iSuM9OS7rggUw95zthJAZ5KRTy_GUdtrwiwxfekaaWvOU5Ve4pDPIVPlDRrUXt-NHWWvK9V64OFvJrCiyvmhTJ15B1K6Esjn4csPFmXSy3wc3Q3aN154a80Z90hhkAZFGaRPF_1xvw80O8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2922281751</pqid></control><display><type>article</type><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><source>Free E- Journals</source><creator>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</creator><creatorcontrib>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</creatorcontrib><description>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Automation ; Benchmarks ; Large language models ; Multiagent systems ; Performance evaluation ; Scale models ; Software ; Software development ; Software engineering ; Systems design ; Task complexity</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Rasheed, Zeeshan</creatorcontrib><creatorcontrib>Malik, Abdul Sami</creatorcontrib><creatorcontrib>Kai-Kristian Kemell</creatorcontrib><creatorcontrib>Waseem, Muhammad</creatorcontrib><creatorcontrib>Saari, Mika</creatorcontrib><creatorcontrib>Systä, Kari</creatorcontrib><creatorcontrib>Abrahamsson, Pekka</creatorcontrib><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><title>arXiv.org</title><description>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</description><subject>Automation</subject><subject>Benchmarks</subject><subject>Large language models</subject><subject>Multiagent systems</subject><subject>Performance evaluation</subject><subject>Scale models</subject><subject>Software</subject><subject>Software development</subject><subject>Software engineering</subject><subject>Systems design</subject><subject>Task complexity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNiksKwjAUAIMgKOodAq4L7au11Z34wYWC0LoUCfU1tqR5mo_i7VXwAK4GZqbD-hDHUZBNAHpsZG0ThiFMU0iSuM9OS7rggUw95zthJAZ5KRTy_GUdtrwiwxfekaaWvOU5Ve4pDPIVPlDRrUXt-NHWWvK9V64OFvJrCiyvmhTJ15B1K6Esjn4csPFmXSy3wc3Q3aN154a80Z90hhkAZFGaRPF_1xvw80O8</recordid><startdate>20240917</startdate><enddate>20240917</enddate><creator>Rasheed, Zeeshan</creator><creator>Malik, Abdul Sami</creator><creator>Kai-Kristian Kemell</creator><creator>Waseem, Muhammad</creator><creator>Saari, Mika</creator><creator>Systä, Kari</creator><creator>Abrahamsson, Pekka</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240917</creationdate><title>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</title><author>Rasheed, Zeeshan ; Malik, Abdul Sami ; Kai-Kristian Kemell ; Waseem, Muhammad ; Saari, Mika ; Systä, Kari ; Abrahamsson, Pekka</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_29222817513</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automation</topic><topic>Benchmarks</topic><topic>Large language models</topic><topic>Multiagent systems</topic><topic>Performance evaluation</topic><topic>Scale models</topic><topic>Software</topic><topic>Software development</topic><topic>Software engineering</topic><topic>Systems design</topic><topic>Task complexity</topic><toplevel>online_resources</toplevel><creatorcontrib>Rasheed, Zeeshan</creatorcontrib><creatorcontrib>Malik, Abdul Sami</creatorcontrib><creatorcontrib>Kai-Kristian Kemell</creatorcontrib><creatorcontrib>Waseem, Muhammad</creatorcontrib><creatorcontrib>Saari, Mika</creatorcontrib><creatorcontrib>Systä, Kari</creatorcontrib><creatorcontrib>Abrahamsson, Pekka</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rasheed, Zeeshan</au><au>Malik, Abdul Sami</au><au>Kai-Kristian Kemell</au><au>Waseem, Muhammad</au><au>Saari, Mika</au><au>Systä, Kari</au><au>Abrahamsson, Pekka</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology</atitle><jtitle>arXiv.org</jtitle><date>2024-09-17</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Context: Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) have transformed the field of Software Engineering (SE). Existing LLM-based multi-agent models have successfully addressed basic dialogue tasks. However, the potential of LLMs for more challenging tasks, such as automated code generation for large and complex projects, has been investigated in only a few existing works. Objective: This paper aims to investigate the potential of LLM-based agents in the software industry, particularly in enhancing productivity and reducing time-to-market for complex software solutions. Our primary objective is to gain insights into how these agents can fundamentally transform the development of large-scale software. Methods: We introduce CodePori, a novel system designed to automate code generation for large and complex software projects based on functional and non-functional requirements defined by stakeholders. To assess the proposed system performance, we utilized the HumanEval benchmark and manually tested the CodePori model, providing 20 different project descriptions as input and then evaluated the code accuracy by manually executing the code. Results: CodePori is able to generate running code for large-scale projects, aligned with the typical software development process. The HumanEval benchmark results indicate that CodePori improves code accuracy by 89%. A manual assessment conducted by the first author shows that the CodePori system achieved an accuracy rate of 85%. Conclusion: Based on the results, our conclusion is that proposed system demonstrates the transformative potential of LLM-based agents in SE, highlighting their practical applications and opening new opportunities for broader adoption in both industry and academia. Our project is publicly available at https://github.com/GPT-Laboratory/CodePori.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-09
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2922281751
source	Free E- Journals
subjects	Automation Benchmarks Large language models Multiagent systems Performance evaluation Scale models Software Software development Software engineering Systems design Task complexity
title	CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T14%3A46%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=CodePori:%20Large-Scale%20System%20for%20Autonomous%20Software%20Development%20Using%20Multi-Agent%20Technology&rft.jtitle=arXiv.org&rft.au=Rasheed,%20Zeeshan&rft.date=2024-09-17&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2922281751%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2922281751&rft_id=info:pmid/&rfr_iscdi=true