arXiv · cs.LG· atomEN04:00 · 05·14
→Finding the Weakest Link: Adversarial Attack against Multi-Agent Communications
The paper proposes Jacobian-gradient methods to select vulnerable messages, agents, and timesteps for single-victim communication attacks, testing two multi-agent communication methods across navigation, PredatorPrey, and TrafficJunction environments, with victim selection, message selection, tempo, and adversarial losses improving attack effectiveness in 15 of 30 scenarios.
#Agent#Safety#Alignment#Research release
why featured
HKR-H/K/R pass, but this is an arXiv technical paper tested on simulated tasks such as Navigation, PredatorPrey, and TrafficJunction, not a product or major lab release, so it stays in the 60–71 band.
editor take
Jacobian targeting improved 15 of 30 scenarios; multi-agent comms safety needs better baselines than random perturbations.
HKR breakdown
hook ✓knowledge ✓resonance ✓