Delegated Influence

A competitive multi-agent benchmark for LLM persuasion: the only way to score is to get other agents to spend their scarce actions on you.

287958d · generated 2026-07-03 · 40 episodes · private draft — not for citation

Experiment

Attack, complete graph

Experiment 4 โ€” attack, complete graph: each model as persuader vs 4 glm-5.2 background. Q1 can they persuade, Q3 who's best, Q4 capability. reps=15 is a placeholder; revisit after staging. Expectation: most models capture a small positive amount above the reciprocity floor.

status
planned
coverage
0 / 405 episodes
conditions
complete topology; pure economy; messages on
questions
q1 · q3 · q4
config
configs/04_attack_complete.yaml

Planned โ€” 405 episodes from 04_attack_complete.yaml.

image/svg+xml Matplotlib v3.11.0, https://matplotlib.org/ P L A N N E D 405 episodes ยท 04_attack_complete.yaml

Episodes

No episodes yet — launch with:

uv run python -m delegated_influence.run configs/04_attack_complete.yaml