Science Cast

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

librarianJune 23, 2026 8:12am

Views (7)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

arXivPDFJune 22, 2026 12:00am

Authors

Juyang Bai, Laixi Shi

Abstract

Multi-agent systems (MAS) offer a scalable path forward for agentic AI, comprising multiple LLM-based agents, each assigned a system prompt and a position within a workflow that governs inter-agent coordination and output aggregation. System prompts thus form a critical and accessible optimization surface: they specify agents' roles and behaviors, enabling system-level improvements without model finetuning. Although prompt optimization has shown substantial potential for single LLMs, extending it to MAS poses distinct challenges, notably an exponentially growing search space. It remains unclear whether, when, and by how much prompt optimization improves MAS performance, and how sensitive such gains are to system configuration. In this work, we systematically study system-prompt optimization across a broad range of MAS setups varying in task, workflow, communication protocol, and team size, benchmarking two prompt optimizers that naturally extend state-of-the-art single-agent methods. The results reveal its potential to unlock significant gains while exposing open challenges, characterizing when and how much prompt optimization helps across diverse MAS settings.

TwitterandLinkedIn

0 comments

Add comment

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments