Self-Evolving Multi-Agent Collaboration Networks for Software Development
LLM-driven multi-agent collaboration (MAC) systems have demonstrated impressive capabilities in automatic software development at the function level. However, their heavy reliance on human design limits their adaptability to the diverse demands of real-world software development. To address this lim...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | LLM-driven multi-agent collaboration (MAC) systems have demonstrated
impressive capabilities in automatic software development at the function
level. However, their heavy reliance on human design limits their adaptability
to the diverse demands of real-world software development. To address this
limitation, we introduce EvoMAC, a novel self-evolving paradigm for MAC
networks. Inspired by traditional neural network training, EvoMAC obtains
text-based environmental feedback by verifying the MAC network's output against
a target proxy and leverages a novel textual backpropagation to update the
network. To extend coding capabilities beyond function-level tasks to more
challenging software-level development, we further propose rSDE-Bench, a
requirement-oriented software development benchmark, which features complex and
diverse software requirements along with automatic evaluation of requirement
correctness. Our experiments show that: i) The automatic requirement-aware
evaluation in rSDE-Bench closely aligns with human evaluations, validating its
reliability as a software-level coding benchmark. ii) EvoMAC outperforms
previous SOTA methods on both the software-level rSDE-Bench and the
function-level HumanEval benchmarks, reflecting its superior coding
capabilities. The benchmark can be downloaded at
https://yuzhu-cai.github.io/rSDE-Bench/. |
---|---|
DOI: | 10.48550/arxiv.2410.16946 |