How It Works — Conceptual Framework

Intellectual foundations

arbitrIQ's architecture draws on convergent findings from decision science, intelligence analysis, epistemology, and machine learning research.

Adversarial collaboration

When researchers with opposing hypotheses design experiments together, they produce stronger evidence than either would alone. The disagreement becomes a methodological asset, not a social liability.

arbitrIQ operationalizes this principle: the Advocate and Opposition are structurally prevented from converging, forcing genuine engagement with the strongest form of each position.

Kahneman, D. (2003). "Experiences of Collaborative Research." American Psychologist, 58(9), 723–730. Mellers, B. et al. (2001). "Do Frequency Representations Eliminate Conjunction Effects?" Psychological Science, 12(4), 269–275.

Structured Analytic Techniques

Intelligence agencies developed formalized methods — devil's advocacy, Team A/Team B analysis, Analysis of Competing Hypotheses — precisely because unstructured expert judgment systematically underperforms structured dissent under conditions of complexity and uncertainty.

arbitrIQ's dimension-by-dimension debate is a computational implementation of these techniques, applied at a scale and speed that human teams cannot sustain consistently.

Heuer, R.J. (1999). Psychology of Intelligence Analysis. CIA Center for the Study of Intelligence. U.S. Government (2009). A Tradecraft Primer: Structured Analytic Techniques for Improving Intelligence Analysis.

The Diversity Prediction Theorem

Collective error equals average individual error minus prediction diversity. This is not a heuristic — it is a mathematical identity. Aggregate accuracy improves with diversity of judgment, even when individual judges are imperfect.

arbitrIQ exploits this directly: independently-trained models with different architectures, training data, and reasoning patterns produce structurally diverse assessments. The ensemble is provably more accurate than any single member.

Page, S.E. (2007). The Difference: How the Power of Diversity Creates Better Groups, Firms, Schools, and Societies. Princeton University Press. Surowiecki, J. (2004). The Wisdom of Crowds.

Sycophancy and confirmation bias in LLMs

Large language models exhibit systematic sycophantic behavior: they tend to agree with the user's stated position, even when that position is wrong. Single-model interactions amplify the user's existing priors rather than challenging them.

arbitrIQ's adversarial structure is specifically designed to defeat this failure mode. The Opposition agent has no access to the user's preferred outcome — its mandate is to find weaknesses regardless of the user's expectations.

Perez, E. et al. (2023). "Discovering Language Model Behaviors with Model-Written Evaluations." Findings of ACL. Sharma, M. et al. (2024). "Towards Understanding Sycophancy in Language Models." ICLR 2024.

Cognitive biases and judgment noise

Human decision-making is distorted by two distinct and compounding failures. First, cognitive biases — confirmation bias, anchoring, overconfidence, loss aversion — are not random errors but predictable structural deviations. Under uncertainty, intuitive reasoning (System 1) dominates analytical thinking (System 2), producing coherent but often incorrect narratives. Second, even when bias is controlled for, human judgment suffers from noise: different experts given identical information routinely reach different conclusions, driven by context dependence, framing effects, and individual interpretation.

arbitrIQ addresses both failures simultaneously. Bias is countered by externalizing judgment across independent agents — no single cognitive frame can dominate. Noise is countered by enforcing a consistent evaluative architecture: every decision passes through the same structured protocol, eliminating arbitrary variability while preserving genuine disagreement.

Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux. Kahneman, D., Sibony, O. & Sunstein, C.R. (2021). Noise: A Flaw in Human Judgment. Little, Brown Spark.

Expert prediction failure and cognitive diversity

Long-term studies of expert predictions demonstrate that domain experts perform only marginally better than chance in complex, uncertain environments — and that confidence is inversely correlated with accuracy. The mechanism is not lack of intelligence but overreliance on single explanatory frameworks and resistance to updating beliefs. Tetlock's taxonomy clarifies the solution: "foxes," who draw on multiple models and adapt their reasoning, consistently outperform "hedgehogs," who rely on a single overarching theory. The critical variable is not expertise but model diversity and epistemic flexibility.

arbitrIQ operationalizes this finding by design. Its multi-agent architecture enforces fox-like reasoning: competing structured arguments drawn from distinct analytical frameworks, with no single perspective able to dominate the decision process. Dependence on individual expertise or authority is replaced by structured adversarial engagement.

Tetlock, P.E. (2005). Expert Political Judgment: How Good Is It? How Can We Know? Princeton University Press. Tetlock, P.E. & Gardner, D. (2015). Superforecasting: The Art and Science of Prediction. Crown.

AI safety via debate

Irving, Christiano, and Amodei (2018) proposed that AI systems can be made more truthful by having them debate each other under human judgment. Their key insight: it is easier for a human to judge a debate than to find the truth independently. A strong debater cannot win by lying if the opponent can expose the lie.

arbitrIQ applies this principle to strategic decision-making. The decision-maker doesn't need to be an expert in every dimension — they need to see the strongest arguments from both sides and assess which survived challenge.

Irving, G., Christiano, P. & Amodei, D. (2018). "AI Safety via Debate." arXiv:1805.00899.

The agent architecture

Four specialized roles, each with a distinct epistemic mandate. No agent has a complete view — the architecture enforces the division of cognitive labor.

Director

Ingests uploaded documents and contextual data. Decomposes the strategic question into specific dimensions — each representing a distinct analytical axis that requires independent examination. After all debates conclude, the Director synthesizes the evaluator reports and debate transcripts into a unified executive report.

Mandate: scope the inquiry, ensure completeness, and produce the final synthesis. The Director never participates in the debate itself.

Advocate

Constructs the strongest possible case in favor of the proposition, drawing on uploaded evidence, web research, and structured reasoning. Must respond substantively to every challenge from the Opposition — cannot concede without providing counter-evidence.

Mandate: defend the proposition at its strongest, not at its most convenient. Steelmanning, not strawmanning, the affirmative case.

Opposition

Systematically attacks the proposition. Surfaces counter-evidence, identifies hidden assumptions, stress-tests financial projections, and exposes risks the Advocate has not addressed. Structurally prevented from agreeing to disagree.

Mandate: find genuine weaknesses. The Opposition succeeds when it forces the Advocate to modify, qualify, or abandon claims — not when it generates rhetorical noise.

Evaluator

Intervenes once per dimension, after all debate turns are complete. Assesses argument quality, evidence strength, logical coherence, and the degree of genuine engagement between sides. Produces a structured score and identifies what remains unresolved.

Mandate: impartial adjudication. The Evaluator is a different model from both debaters, ensuring no systematic alignment with either position.

The debate protocol

For each dimension, the protocol proceeds in three phases. The iterative cycle is between Advocate and Opposition only — the Evaluator intervenes once, after the debate concludes.

Phase 1 · Planning

Director decomposes the question into dimensions
Each dimension scopes a specific analytical axis — e.g., financial viability, regulatory risk, competitive dynamics.

Phase 2 · Iterative contradiction (2–10 turns)

Advocate defends

⇄

Opposition challenges

Cycle repeats — each turn deepens, responds to prior arguments, and narrows the space of genuine disagreement

Phase 3 · Evaluation (once per dimension)

Evaluator scores the completed debate
Assesses argument quality, evidence strength, logical coherence. Identifies what was resolved and what remains uncertain.

Final · Synthesis (once, across all dimensions)

Director synthesizes all evaluator reports and transcripts
Produces the decision-ready executive report with integrated scoring, rationale, and explicit uncertainty mapping.

The critical design choice: the Advocate and Opposition iterate without premature adjudication. The Evaluator's assessment is based on the full exchange, not on partial snapshots — ensuring that late-emerging arguments and concessions are weighted appropriately.

Model diversity as an epistemic resource

arbitrIQ assigns different independently-trained models to each agent role. This is not a cosmetic choice — it is a direct application of the diversity prediction theorem: ensemble accuracy improves with genuine diversity of reasoning, even when individual reasoners are imperfect.

Models from Anthropic, OpenAI, and Google differ in training data, RLHF processes, architectural choices, and failure modes. When forced into adversarial interaction, these differences produce debates of substantially higher quality than any single-model self-critique.

Access to all frontier models from Anthropic, OpenAI, and Google

Sycophancy cancellation

LLMs are trained to agree with users. In arbitrIQ, the Opposition has no access to the user's preferred outcome — its mandate is structural, not social. The debate framework defeats pleaser bias by design.

Correlated error reduction

Models trained on different data with different objectives produce different errors. Under adversarial pressure, errors that would survive single-model review are exposed by the opposing model's distinct failure profile.

Reasoning style diversity

Different model families exhibit distinct reasoning patterns — some favor quantitative analysis, others narrative coherence, others risk-centric framing. Structured opposition surfaces these differences as analytical assets rather than noise.

Forced grounding depth

When challenged by an adversary drawing on live web search, models cannot rely on training-data priors alone. The iterative cycle forces progressively deeper engagement with current evidence.

The methodology behind
structured contradiction

The core principle: every argument must survive opposition.

Intellectual foundations

Adversarial collaboration

Structured Analytic Techniques

The Diversity Prediction Theorem

Sycophancy and confirmation bias in LLMs

Cognitive biases and judgment noise

Expert prediction failure and cognitive diversity

AI safety via debate

The agent architecture

Director

Advocate

Opposition

Evaluator

The debate protocol

Model diversity as an epistemic resource

Sycophancy cancellation

Correlated error reduction

Reasoning style diversity

Forced grounding depth

Configurable governance

Dimensions

Debate Depth

Model Diversity

Web Search

Report Detail

Why architecture matters more than model selection

Go deeper

Ready to see structured contradiction in action?

The methodology behindstructured contradiction

The core principle: every argument must survive opposition.

Intellectual foundations

Adversarial collaboration

Structured Analytic Techniques

The Diversity Prediction Theorem

Sycophancy and confirmation bias in LLMs

Cognitive biases and judgment noise

Expert prediction failure and cognitive diversity

AI safety via debate

The agent architecture

Director

Advocate

Opposition

Evaluator

The debate protocol

Model diversity as an epistemic resource

Sycophancy cancellation

Correlated error reduction

Reasoning style diversity

Forced grounding depth

Configurable governance

Dimensions

Debate Depth

Model Diversity

Web Search

Report Detail

Why architecture matters more than model selection

Go deeper

Ready to see structured contradiction in action?

🔐 We value your privacy

The methodology behind
structured contradiction