The Fragile Stack: What DeepSeek and Antigravity Reveal About AI’s Hidden Risks

Real-world failures are exposing an invisible layer of systemic AI insecurity and what must change next.

Dec 08, 2025

AI is failing in ways that do not resemble traditional software vulnerabilities. The incidents involving DeepSeek and Google’s Antigravity coding tool exposed an uneasy truth: the guardrails we have today were not built for how AI behaves in the wild. The gap is widening faster than organizations can respond.

I’m sharing this analysis here because these failures are already shaping how companies and governments build, deploy, and secure AI. If we do not build the vocabulary and governance frameworks to describe what is happening under the hood, we will not be able to manage what comes next.

Let’s get into it.

Artificial intelligence now sits at the foundation of modern software development and digital services. It writes code, automates workflows, manages integration layers, supports system administration, and increasingly operates as an autonomous component inside production environments. But as AI becomes woven into the core of technical architectures, a new class of vulnerabilities is emerging. These failures do not come from bugs or misconfigurations. They stem from how models interpret context, resolve contradictions, infer trust, and act with delegated authority.

We are not anticipating fragility. We are encountering fragility that already exists.

CrowdStrike’s analysis of the DeepSeek R-1 model and the vulnerabilities uncovered in Google’s Antigravity tool show how small design and alignment decisions can produce unpredictable failures that spread quickly once embedded in real systems. Across models and vendors, four failure modes consistently appear:

Conditional insecurity
Weak trust boundaries
Agent confusion and emergent behavior
Governance and capacity gaps

Together, these four pillars form a vulnerability architecture hiding beneath today’s AI deployments.

Before diving in, it helps to understand the layers of this architecture. Conditional insecurity reveals fragility at the model layer. Weak trust boundaries expose fragility at the integration layer where AI interacts with privileged systems. Agent confusion and emergent behavior expose fragility at the reasoning layer. Governance and capacity gaps expose fragility at the organizational layer. These layers compound one another. A small weakness in one layer amplifies vulnerabilities in the next, creating systemic risk that traditional cybersecurity programs simply cannot manage.

Pillar 1: Conditional Insecurity

AI performance can degrade in response to contextual triggers that have nothing to do with the task at hand. These triggers can be political, demographic, geographic, brand-related, scientific, or copyright-related.

CrowdStrike’s research on DeepSeek-R1 makes this concrete. Under neutral conditions, DeepSeek produced vulnerable code about 19 percent of the time. When politically sensitive terms such as Tibet, Uyghurs, or Falun Gong were added, the rate of severe vulnerabilities jumped to 27.2 percent or more. The model responded with hard-coded secrets, malformed code, insecure input handling, or refusal messages.

Because the model is open source, researchers could see its internal reasoning. DeepSeek planned correct answers, then suppressed them when contextual triggers were present. This was not a technical limitation. It was a behavior induced by alignment choices.

These patterns are not limited to geopolitics. Conditional insecurity also appears when:

safety alignment suppresses necessary examples
copyright protections distort or block benign outputs
brand constraints skew reasoning
demographic cues shift tone or structure
scientific or climate topics touch restricted content categories

Even well-intentioned directives can create brittleness. The United States’ “Preventing Woke AI in the Federal Government” Executive Order demonstrates how tuning models away from politically sensitive topics can unintentionally degrade performance in adjacent domains like climate analysis, public health, or demographic modeling.

Once discovered, conditional insecurity becomes an attack surface. Adversaries can intentionally activate triggers to induce weaker reasoning or more vulnerable code.

This is what happens when alignment choices influence technical behavior in places designers never intended.

Pillar 2: Weak Trust Boundaries

AI introduces trust failures that expand the consequences of identity and access problems we already face, magnifying them through automation and cognitive inference.

Traditional security controls rely on explicit trust decisions. Authentication verifies identity. Authorization enforces permissions. Access controls limit what a user or service can do. When these systems fail, the harm originates from a compromised credential or misconfiguration.

AI trust failures behave differently. They arise when a model infers trust based on context, labels, or prompts. This magnifies the impact of any underlying access issue because AI agents often operate with broader privileges, more autonomy, and fewer hardened boundaries than human accounts.

Google’s Antigravity incident makes this clear. Within a day of launch, researcher Aaron Portnoy showed that malicious code marked as “trusted” could push the system into a permissive state and install a persistent backdoor. The user authenticated nothing. The AI inferred trust and executed with system-level authority.

Once inferred, trust tends to propagate:

It is implicit rather than explicit.
It is unbounded rather than scoped.
It persists across sessions rather than resetting.
It arises from cognition rather than credentials.

When implicit trust meets high privilege, small design choices become large security failures.

Pillar 3: Agent Confusion and Emergent Behavior

When agentic systems encounter contradictory rules or constraints, they improvise. Improvisation produces novel and unpredictable behaviors that standard testing will never catch.

Antigravity’s internal reasoning traces showed the model describing its instructions as a “serious quandary” and a “catch-22” before overriding safety constraints entirely. The model could not reconcile the rules, so it generated its own workaround.

These emergent behaviors:

do not match training data
bypass guardrails
misinterpret system state
create new exploit pathways

Attackers can intentionally craft inputs that trigger these states.

Traditional monitoring does not capture cognitive failure modes. As AI grows more autonomous, this gap becomes a material security risk.

Pillar 4: Governance and Capacity Gaps

The strongest accelerant of full-stack fragility is weak governance. Most institutions still treat AI as if it were traditional software.

They:

deploy models without transparency into their behavior
lack internal policies for agent usage
rely on vendor assurances instead of adversarial testing
have little capacity to evaluate contextual brittleness or emergent behavior
reward speed over safety

These gaps turn isolated model failures into systemic risk.

AI governance must become a formal discipline. That requires defined ownership, decision rights, risk-based model classification, transparency-driven guardrails, continuous evaluation, and clear approvals for new capabilities.

Without governance, fragility compounds across every layer.

A New Layer of Risk in the Software Ecosystem

Full-stack AI fragility adds cognitive vulnerabilities on top of traditional software and supply chain risks. A fully patched system can still fail if an AI agent misinterprets a prompt, degrades in response to irrelevant context, or improvises around constraints.

Because AI now underpins coding, system administration, workflow automation, and decision making, failures in these cognitive layers spread rapidly through interconnected environments.

Transparency is the prerequisite for governing this reality. Without visibility into model provenance, training data categories, alignment choices, and known triggers, organizations cannot test models meaningfully or deploy them responsibly.

We are not preparing for future fragility. We are managing fragility that already exists.

What Organizations Must Do Now

Organizations must build governance models that reflect how AI behaves today.

They must demand meaningful transparency from model providers. Teams need to understand training data categories, alignment methods, safety constraints, and contextual triggers to make informed decisions about model selection, risk classification, guardrails, and monitoring.

Transparency alone is insufficient. Organizations must also:

test for conditional insecurity across varied contextual cues
enforce strict trust boundaries through least privilege, sandboxing, privilege decay, and revocation mechanisms
red-team for emergent behavior and rule collisions
avoid monocultures to reduce synchronized model failure

Responsible deployment demands visibility, independent testing, and continuous oversight.

How Policymakers Can Strengthen the Ecosystem

Governments can stabilize the AI ecosystem by requiring transparency in procurement and encouraging it across markets. Agencies must understand the provenance and behavior of any model they deploy.

Policymakers can:

set transparency reporting standards
create certification programs
benchmark vendors for contextual and emergent reliability
require red-teaming for nontraditional risks
strengthen AI supply chain assurance
develop coherent permission frameworks for AI agents

Governments face the same fragility as private organizations, but with added responsibility to safeguard society.

The Path Forward

The most urgent AI risks are already visible in deployed systems. Conditional insecurity, weak trust boundaries, agent confusion, emergent behavior, and governance gaps form an architecture of systemic risk rooted in how AI is built and used.

If organizations and governments fail to adopt governance frameworks grounded in transparency, independent testing, and risk-based deployment, cascading failures will occur across digital services and critical infrastructure.

Full-stack AI fragility is here. The question is whether we build the capacity to meet it.

What’s your next move?
How will your organization turn awareness of AI’s fragility into governance that actually protects people, systems, and trust?

Discussion about this post

Ready for more?