AI agents that argue with each other to improve decisions

rrockcat12 about 8 hours ago 16 commentsRead Article on github.com

DE version is available. Content is displayed in original English for accuracy.

⚡ Community Insights

Discussion Sentiment

80% Positive

Analyzed from 743 words in the discussion.

Discussion (16 Comments)Read Original on HackerNews

ChadMoran•about 6 hours ago

I've been doing this with Claude Code and agent teams.

I have a /red-team skill that will use an agent team to criticize it's own work, grade and rank feedback, incorporate relevant feedback and then start over. It has increased the quality of output.

dataviz1000•about 3 hours ago

The red team adversaries are so effective. If Claude is blind to a bug, it won't surface using the same model from a red team adversary perspective. It requires using a different model which gpt-5.5 is great for. Yesterday I tried for the first time using gpt-5.5 as a adversary against the tests themselves. Later I thought it would be interesting to create a trickster agent which breaks the code after copy entire project into /tmp/ in order to control every aspect of it. Claude insists this called mutation testing. It would create regressions and then run all the tests. Finally it was able to unsupervised create an effective test harness.

zby•about 6 hours ago

I don't know - looks like an interesting idea - but ... I am struggling to put that in a polite manner. When I go into the repo and find out that it does stuff like lip syncing of talking avatars then I start to think what percentage of the development effort goes into marketing?

gertlabs•about 6 hours ago

Self organizing systems is an area of research to which I think LLMs will contribute immensely.

But as of now, even newer AI models are not particularly insightful. I'm always surprised by how suboptimal near-frontier LLMs are at collaborating in some of the easier cooperative environments on my benchmarking and RL platform. For example, check out a replay of consensus grid here: https://gertlabs.com/spectate

dataviz1000•about 3 hours ago

Have you tried recursive self-reflective agents?

The agent makes a copy of itself in /tmp/. Runs. Evaluates. Updates itself. Makes a copy of itself. Runs. Evaluates. Updates itself. Makes a ...... you get the idea.

They will not stop if the recursion is given a hard to meet termination condition. Also, if it can cheat to solve the termination condition it will.

gertlabs•about 1 hour ago

I have not run one personally, but I love the idea. Reminds me of yoyo-evolve. My friend made this repo: https://github.com/dwolner/cosmic-insight

AntiUSAbah•about 6 hours ago

While interesting, its not clear to me with just looking at concensus grid how they are prompted.

Do you tell them to think and coordinate the next step through some type of sync/talking mechanism or is it turn by turn?

I suspect turn by turn as it is similiar to other experiements and in this case, it wouldn't work because they wouldn't have a certain amount of time to think about the next step together?

gertlabs•about 5 hours ago

All of our environments are tick based (with ticks of varying speeds), and this is explained in the prompt given to the models, along with the latest observation and a history of recent events/conversations/actions.

So that does make the game more challenging, versus some other simulations we have where multiple conversation turns happen before action. But the inefficiencies I'm describing are different; for example, an agent reaches part of the destination area but is clearly blocking another player who needs to pass, and most models will just stay put instead of moving along to another target spot.

AntiUSAbah•about 5 hours ago

So is "Game Overview" the prompt? Because i can't seem to see any indication / hint given to the models that its a game they should work together on and commmunicate etc.

bit1993•about 3 hours ago

If you agree that there is no absolute truth in complex problems, and that different parties can have different perspectives which are all true/correct from their point of view than this type of systems can be far more inefficient to make decisions compared to a single entity. It's like having multiple CEOs in a company, or design by commity.

oldsecondhand•about 6 hours ago

Sounds like a less efficient version of the mixture of experts approach.

gavmor•about 6 hours ago

How does mixture of experts architecture work? Are they debating, or merely delegating?

From what I've read, for each token or input patch, the gate computes a set of probabilities (or scores) over the experts, then selects a small subset (often the top‑[k]) and routes that input only to those.

Ie each expert computes its own transformation on the same original input (or a shared intermediate representation), and then their outputs are combined at the next layer via the gate’s weights.

That’s post hoc combination, not B reasoning over A’s reasoning.

AntiUSAbah•about 5 hours ago

A MoE model is one model with expert parts which use less tokens. Which makes it easier for an expert to diverge to a better optimum state. Its easier to only need to know medicin instead of everything and being able to separate everything away from medicin even if certain names, concepts etc. are the same.

AI agents discussing things with each others would be more like one thinking model thinking throught the problem with different personas.

With different underlying models, you can leverage the best model for one persona. Like people said before (6 month ago, no clue if this is still valid) that they prefer GPT for planning and Claude for executing / coding.

oofda•about 3 hours ago

I do this with Gemini and local models

Gemini is the planner and researcher, local models basically "just type syntax"

Seems to make it so none of them get stuck in a loop

submeta•about 5 hours ago

I had good results with combining Claude Code with Codex, let them have back and forth sessions. Their prompts were magnitudes better than mine, also their evaluation and criticism of the other LLM

What I haven’t taken time for is finding out about how I‘d automate their back-and-forth and stop manually copy/pasting their responses.