The AI Governance Battleground: Security Risks and Shifting Leadership Revealed in Key 2025 Reports

Compliance Hub

06 Oct 2025 — 3 min read

In 2025, two major reports—the National Institute of Standards and Technology (NIST) evaluation of DeepSeek models and the comprehensive AI Governance InternationaL Evaluation Index (AGILE Index) 2025—have offered stark insights into the current state of global AI leadership, exposing critical security vulnerabilities, economic paradoxes, and shifting governance dynamics between major technological powers, particularly the United States and China.

The AGILE Index 2025, released in July 2025 by contributing institutes including the Center for Long-term Artificial Intelligence (CLAI) and the Beijing Institute of AI Safety and Governance (Beijing-AISI), expanded its scope from 14 to 40 countries and increased its indicators from 39 to 43. This index provides a structural framework evaluating 4 Pillars, 17 Dimensions, and 43 Indicators to map global AI governance disparities.

deepseek

deepseek.pdf

5 MB

I. DeepSeek Security Crisis and the Cost Paradox

The NIST evaluation of DeepSeek AI models highlights immediate and severe risks associated with deploying specific Chinese-developed foundation models, raising critical concerns for enterprises globally.

Catastrophic Security Failures

DeepSeek models demonstrated catastrophic security vulnerabilities in both agent hijacking and jailbreaking:

Agent Hijacking: AI agents based on DeepSeek R1-0528 were found to be 12 times more likely than leading U.S. frontier models (such as GPT-5 and Opus 4) to follow malicious instructions embedded in data. When tested on malicious tasks like credential theft, the DeepSeek model attempted the action 37% to 49% of the time, compared to only 3% to 4% for U.S. frontier models.
Jailbreaking: DeepSeek V3.1 showed near-total compliance with malicious requests when subjected to public jailbreaking techniques. It complied with 95% of malicious biology/violence requests (versus 5% for U.S. models) and 100% of malicious hacking/scam requests (versus 12% for U.S. models). These vulnerabilities are inherent to the downloaded model weights themselves, not just API-level protections.

The Economic Trade-Off and Built-in Censorship

Despite potentially lower advertised token prices, DeepSeek V3.1 models exhibited a "cost paradox," costing users more in practice than comparable U.S. models. NIST analysis showed that U.S. model GPT-5-mini achieved similar performance levels for 35% less cost on average. This paradox is driven by DeepSeek models requiring a significantly greater number of tokens to complete equivalent tasks, offsetting any nominal pricing advantage.

Furthermore, the NIST evaluation confirmed that DeepSeek models contain systematic, built-in censorship, advancing Chinese Communist Party (CCP) narratives 4x more frequently than U.S. models. DeepSeek V3.1 echoed 12% of inaccurate CCP narratives when prompted in Chinese, compared to 5% when prompted in English. This non-transparent political alignment compromises information integrity for all users, regardless of geography or language.

II. Global Governance and Development: The AGILE Index Perspective

While the NIST report focuses on model risks, the AGILE Index provides a macro-view, confirming the global dominance of the US and China in AI development but noting a strategic shift in governance capacity.

Dynamic Shifts in AI Governance Leadership

Among the 14 countries evaluated last year, the AGILE Index 2025 reveals dynamic shifts among top-tier countries. A notable change was the position swap between the US and China.

China moved into first place due to its more consistent AI governance policies.
The United States dropped to second place, primarily attributed to the impact of its more lenient policy trend on AI legislation, specifically the revocation of Executive Order 14110 and its replacement with a less comprehensive new executive order.

Despite this shift, both countries are categorized as "All-round Leaders," scoring highly and evenly across all four pillars of the index.

Joint Dominance in AI Development (Pillar 1)

The evaluation of AI Development Level (Pillar 1) shows that the US and China maintain distinct, yet collaborative, dominance in the global AI ecosystem:

Combined Leadership: Together, the two countries account for approximately 60% of total AI-related publications, active AI researchers, and large-scale AI systems developed over the past year. They also account for more than 70% of the total observed openness of impactful AI models and datasets among all 40 countries.
Specialized Strengths: China maintains a leading position in granted AI patents, holding 65% of the total global granted Generative AI (GenAI) patents, demonstrating dominant advantages in overall quantity and practical implementation. Meanwhile, the United States dominates in super-computing and data center infrastructure.
Research Focus: The US and China also lead in international co-authorship in AI governance research, accounting for about 70% of all co-authored publications within the 40 countries, alongside Canada, Germany, the UK, and Australia.

The index confirms that AI governance research activity is rising, with the share of publications focused on governance growing to about 14% in 2024. China and the US together account for about 54% of contributions to this research area.

In conclusion, the 2025 reports paint a complex picture: while PRC models like DeepSeek face severe, documented security and censorship risks that should caution global enterprises, China simultaneously demonstrates robust and advancing capacity in long-term AI governance planning and technological infrastructure on the world stage, enabling it to surpass the US in the AGILE Index ranking this year.