GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
Paper • 2603.19677 • Published
Model: nvidia/Nemotron-Orchestrator-8B (Qwen3, 8.19B params) Training: DAPO via verl 0.7.1 on 2x H100 NVL 94GB Framework: YGN-SAGE — Self-Adaptive Generation Engine
First empirical proof that multi-agent topology improves performance on a recognized benchmark.
| Axis | Bare Model | SAGE | Delta |
|---|---|---|---|
| depth | 40% | 67% | +27pp |
| horizon | 0% | pending | HIGH potential |
| robustness | 0% | pending | HIGH potential |
| breadth | 60% | 40% | -20pp (overhead) |
| parallel | 60% | 40% | -20pp (overhead) |
Insight: Topology helps when base accuracy < 60% (aligns with AdaptOrch, arXiv 2602.16873).
| Phase | Status | Metrics |
|---|---|---|
| SFT warmup | Done | loss 2.87 to 1.30 |
| Phase A (structural) | Done | step 1050, reward 0.225 |
| DAPO targeted | In Progress | step 104/1920, reward 0.184, DAPO token-level loss |
| Phase C (micro-decisions) | Pending | SageTopologyEnv 4-state machine ready |
The model doesn't just generate topologies - it operates them at runtime:
continue, upgrade (re-run with better model), or reroute| Path | Description | Size |
|---|---|---|
checkpoints/combined_step_100/ |
Full FSDP checkpoint (for resume) | 34 GB |
sft_merged/ |
SFT base model | 15.6 GB |
phase_a_step_100/ |
LoRA adapter | 667 MB |