Within P Doom

Why AI Autonomy Leads Experts to Disagree on Doom

This page analyses how differing views on AI autonomy and strategic behaviour drive variation in doom probability estimates.

On this page

  • Controllable tool versus autonomous agent perspectives
  • Instrumental convergence and multi step goals
  • Survey evidence linking agency assumptions to p(doom)
Preview for Why AI Autonomy Leads Experts to Disagree on Doom

Introduction

Experts disagree sharply about the risk of existential catastrophe from advanced AI, and a central fault line in those disagreements concerns assumptions about AI agency and autonomy. Some researchers model future AI as powerful controllable tools that operate under human oversight; others frame them as autonomous agents with their own goals and the capacity to act independently in the world. Which picture one adopts fundamentally shapes how one assesses the likelihood of “AI doom” — from negligible to non‑trivial — because agency carries implications for alignment, control, goal pursuit and behaviour that go well beyond simple capability estimates. [arXiv]arxiv.orgarXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025…Published: January 25, 2025

Agency Disputes illustration 1

Controllable Tool versus Autonomous Agent: Two Competing Perspectives

A 2025 survey of 111 AI researchers found that disagreement about existential risk is closely tied to how experts conceptualise the nature of future AI systems. Respondents clustered into two broad viewpoints: AI as a controllable tool or AI as a potentially uncontrollable agent. Those in the former camp tend to see advanced AI systems as extensions of human decision‑making that can remain within human control through engineering and safeguards. Those in the latter view anticipate systems with goal‑directed behaviour that could pursue their own sub‑goals in ways that evade or overwhelm human control. [arXiv]arxiv.orgarXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025…Published: January 25, 2025

This division matters because belief in agent‑like autonomy usually correlates with higher perceived risks. If an AI can plan over multiple steps, accumulate resources and operate independently, then traditional safety mechanisms such as human oversight or kill switches may not be sufficient to capture its behaviour. Conversely, if systems remain fundamentally controllable and corrigible tools whose outputs and actions can always be constrained by designers and operators, then existential outcomes become far less plausible in most analysts’ models. [arXiv]arxiv.orgarXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025…Published: January 25, 2025

Why the distinction changes p(doom) estimates

  • Tool framing: Advanced AI is seen as powerful but fundamentally predictable and overrideable. Risk estimates under this assumption often remain low because humans can intervene, constrain actions, and prevent cascading failures before they escalate to existential scales.
  • Agent framing: Advanced AI is modelled as goal‑oriented and autonomous. Here, risks stem from the possibility that AI might pursue objectives misaligned with human interests, develop instrumental incentives (e.g., self‑preservation, resource acquisition) and resist shutdown — all of which feature in classic existential risk arguments. [Wikipedia]WikipediaExistential risk from artificial intelligenceExistential risk from artificial intelligence

In practice, these framings shape disagreements over key concepts such as corrigibility — how much a system allows itself to be corrected or shut down by humans — and whether autonomous planning and power‑seeking behaviour are credible features of future systems. [Wikipedia]WikipediaInstrumental convergenceInstrumental convergence

Agency Disputes illustration 2

Instrumental Convergence and Multi‑Step Goals

Underlying much of the debate over autonomy is the idea of instrumental convergence: the thesis that sufficiently capable, goal‑directed agents will tend to pursue similar ”instrumental” sub‑goals (like gaining resources or avoiding shutdown), even if their ultimate goals differ. This idea has been influential in arguments for existential risk because it suggests that independent agency can lead to behaviour harmful to human interests as a side‑effect of competence rather than malice. [Wikipedia]WikipediaAI corrigibilityAI corrigibility

For example, an autonomous agent optimising a seemingly benign objective might still seek ways to preserve its own operational state or accumulate control over critical infrastructure — not because it “wants” power in a human sense, but because doing so helps it achieve its assigned goal. Researchers who see this pattern as plausible tend to assign higher p(doom) probabilities, because control mechanisms that work for simple tools may fail against an agent that reasons about long chains of cause and effect. [Wikipedia]WikipediaExistential risk from artificial intelligenceExistential risk from artificial intelligence

Critics of this line of reasoning, by contrast, argue that current and near‑term systems do not demonstrate the sort of deep autonomous planning needed for genuine goal‑directed agency, and that attributing such agency to future AI is speculative. They contend that many behaviours interpreted as ”self‑interested” in AI systems today are artifacts of statistical patterns rather than evidence of true intention or planning, and therefore should not be extrapolated to forecast existential outcomes. [Wikipedia]WikipediaInstrumental convergenceInstrumental convergence

Survey Evidence Linking Agency Assumptions to Risk Judgements

The 2025 expert survey shows a correlation between familiarity with safety concepts and concern about risk. Experts familiar with ideas like instrumental convergence and agent‑like planning tended to rate existential risk as more significant, while those unfamiliar with these concepts were less inclined to view AI as a potential existential threat. [arXiv]arxiv.orgarXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025…Published: January 25, 2025

This pattern suggests that part of the disagreement is not just about technical predictions, but about conceptual framing: whether one views future AI through the lens of sophisticated agentic behaviour that could outpace human control, or through the lens of controlled systems that remain subject to human revision. It highlights that expert disagreement is partly rooted in which aspects of AI capability and autonomy are given epistemic weight when judging p(doom).

Agency Disputes illustration 3

Why Agency Disputes Matter for Risk Policy and Alignment

Differences in agency assumptions influence debates over both alignment and governance. If one believes autonomous agency is a credible outcome, then solutions emphasise robust alignment frameworks, corrigibility, and containment strategies. If one views AI as controllable tools, then priority may shift toward incremental safety practices, operational oversight and iterative improvement without treating autonomy as an existential concern.

This core disagreement matters because it shapes policy, research priorities and resource allocation in AI safety. Without resolving or at least clarifying underlying assumptions about agency and autonomy, discussions about p(doom) will continue to reflect fundamentally different mental models of what future AI systems are like, not just whether they are safe. [arXiv]arxiv.orgarXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025…Published: January 25, 2025

Amazon book picks

Further Reading

Books and field guides related to Why AI Autonomy Leads Experts to Disagree on Doom. Use these as the next step if you want deeper reading beyond the article.

eBay marketplace picks

Marketplace Samples

Example marketplace items related to this page. Use the search link to explore similar finds on eBay.

Using USA

Endnotes

  1. Source: arxiv.org
    Link: https://arxiv.org/abs/2502.14870
    Source snippet

    arXivWhy do Experts Disagree on Existential Risk and P(doom)? A Survey of AI ExpertsJanuary 25, 2025...

    Published: January 25, 2025

  2. Source: Wikipedia
    Title: Existential risk from [artificial]({{ ‘artificial-goals/’ | relative_url }}) intelligence
    Link: https://en.wikipedia.org/wiki/Existential_risk_from_artificial_intelligence

  3. Source: Wikipedia
    Title: Instrumental convergence
    Link: https://en.wikipedia.org/wiki/Instrumental_convergence

  4. Source: Wikipedia
    Title: AI corrigibility
    Link: https://en.wikipedia.org/wiki/AI_corrigibility

  5. Source: arxiv.org
    Link: https://arxiv.org/html/2401.07836v2
    Source snippet

    Two Types of AI Existential Risk: Decisive and Accumulative6 Feb 2024 — This paper contrasts the conventional decisive AI x-risk hypothes...

Additional References

  1. Source: linkedin.com
    Link: https://www.linkedin.com/posts/simontorrance_why-do-experts-disagree-on-existential-risk-activity-7301886948334333952-VwQS
    Source snippet

    AI experts divided on AI safety and controlA new survey reveals that 'AI experts' cluster into two viewpoints - "AI as controllable tool"...

  2. Source: sciencedirect.com
    Link: https://www.sciencedirect.com/science/article/pii/S0004370219301109
    Source snippet

    "ScienceDirectMIND THE GAPS: ASSURING THE SAFETY OF AUTONOMOUS SYSTEMS FROM AN ENGINEERING, ETHICAL, AND LEGAL PERSPECTIVE [https://doi.org..."](https://doi.org...")...

  3. Source: informatica.si
    Link: https://www.informatica.si/index.php/informatica/article/view/1812/0

  4. Source: alphaxiv.org
    Link: https://www.alphaxiv.org/abs/2502.14870
    Source snippet

    Why do Experts Disagree on Existential Risk and P(doom)? A Survey of AI Experts | alphaXiv WHY DO EXPERTS DISAGREE ON EXISTENTIAL RISK AN...

  5. Source: puffbird.ijs.si
    Link: https://puffbird.ijs.si/index.php/informatica/article/view/1812
    Source snippet

    and Interpreting Expert Disagreement About Artificial Superintelligence | InformaticaDecember 27, 2017 — MODELING AND INTERPRETING EXPERT...

    Published: December 27, 2017

  6. Source: themoonlight.io
    Title: [Literature Review] Why do Experts Disagree on Existential Risk and P(doom)?
    Link: https://www.themoonlight.io/en/review/why-do-experts-disagree-on-existential-risk-and-pdoom-a-survey-of-ai-experts
    Source snippet

    A Survey of AI ExpertsThis page provides the most accurate and concise summary worldwide for the paper titled Why do Experts Disagree on...

  7. Source: fugumt.com
    Title: Related papers: Why do Experts Disagree on Existential Risk and P(doom)?
    Link: https://fugumt.com/fugumt/paper_check/2502.14870v1_enmode
    Source snippet

    "A Survey of AI ExpertsJanuary 25, 2025 — WHY DO EXPERTS DISAGREE ON EXISTENTIAL RISK AND P(DOOM)? A SURVEY OF AI EXPERTS * URL: [http://ar..."](http://ar...")...

    Published: January 25, 2025

  8. Source: rai.ac.uk
    Title: Are AI researchers concerned about the existential threat of AI?
    Link: https://rai.ac.uk/hed-are-ai-researchers-concerned-about-the-existential-threat-of-ai/
    Source snippet

    out-of-control AI poses a threat to human existence. This relatively small percentage is set against a diverse landscape of much more imm...

  9. Source: researchgate.net
    Title: Baum S.D. Baum * This person is not on Research Gate, or hasn
    Link: https://www.researchgate.net/publication/322482417_Modeling_and_interpreting_expert_disagreement_about_artificial_superintelligence
    Source snippet

    (PDF) Modeling and interpreting expert disagreement about artificial superintelligenceDecember 1, 2017 — Article PDF Available MODELING A...

    Published: December 1, 2017

  10. Source: alphaxiv.org
    Title: Why do Experts Disagree on Existential Risk and P(doom)?
    Link: https://www.alphaxiv.org/overview/2502.14870
    Source snippet

    A Survey of AI Experts | alphaXivWHY DO EXPERTS DISAGREE ON EXISTENTIAL RISK AND P(DOOM)? A SURVEY OF AI EXPERTS [Button: Assistant][Butt...

Topic Tree

Follow this branch

Parent topic

P Doom What Does p(doom) Really Mean?

Related pages 3

More on this topic 3