Positioning Paper定位论文

From AI Assistants to
Trusted Execution Infrastructure
从 AI 助手到
可信执行基础设施

Why enterprise AI requires governance, authorization, and execution control. 为什么企业部署 AI 需要治理、授权与执行控制。

Aiegis · Positioning Paper · 2026Aiegis · 定位论文 · 2026

Executive Summary

For the past three years, enterprises have adopted artificial intelligence at remarkable speed. Models summarize documents, answer questions, retrieve knowledge, draft content, and support decisions. These systems have created real value. Yet for all the investment and enthusiasm, the AI deployed inside most large organizations today remains, fundamentally, an advisor. It recommends. A human decides. A human acts.

The next phase of enterprise AI is different in kind, not merely in degree. Organizations increasingly expect AI not only to advise but to execute — to access systems, invoke tools, coordinate workflows, operate business applications, and collaborate with other agents. The moment an AI system begins to act rather than merely speak, the defining question changes. It is no longer can the AI do this? It is should the AI be allowed to do this — under what conditions, with which permissions, and under whose accountability?

That is not an intelligence question. It is an authority question. And it is the question the AI industry has, so far, largely declined to answer.

This paper makes a simple argument with significant consequences. The bottleneck to enterprise AI adoption is no longer model capability. It is the absence of a layer that can translate an organization's existing governance — its approval chains, its separations of duty, its information barriers, its compliance controls — into constraints that an autonomous system cannot bypass at the moment of execution. Identity systems answered who can access this resource? Agentic AI demands a new layer that answers why is this action occurring, is it authorized, does it comply, and can it be audited or revoked?

Policy Enforcement Architecture (PEA) was built to fill that gap. PEA is not a new model, not an agent framework, and not a governance policy framework. It is an AI security and authorization architecture that operationalizes enterprise governance inside the AI execution path. Its purpose is to transform AI from an advisory tool into trusted execution infrastructure — and, in doing so, to make possible what enterprises actually want but cannot yet safely have: the ability to delegate real authority to AI without surrendering control.

The chapters that follow develop this argument in five movements. Part I shows why the next enterprise AI crisis is one of authority, not intelligence. Part II identifies the missing layer between enterprise governance and AI execution. Part III explains the principle of trusted execution and why authorization must become infrastructure. Part IV reframes the entire problem as one of trusted delegation and introduces the PEA Trust Pyramid. Part V explains why existing solutions do not close the gap, and why enterprises in finance, healthcare, and government need PEA specifically.

Part I — The Enterprise AI Problem

1. The Next Enterprise AI Crisis Is Not Intelligence

The AI industry is organized around a single objective: making AI more intelligent. Every headline advance is measured in capability — better reasoning, longer context, stronger planning, more autonomous agents, more fluent tool use. This progress is real and important. But it addresses only one side of the enterprise equation, and not the side that is actually blocking adoption.

For most large organizations, intelligence is not the barrier. Trust is. A global bank does not refuse to deploy an AI system because the system cannot reason. It refuses because it cannot guarantee the system will stay inside organizational boundaries once it is allowed to act. The relevant uncertainty is not about the quality of the model's thinking. It is about the consequences of the model's doing.

Consider the distinction carefully, because the entire argument of this paper rests on it. Intelligence determines what an AI system is capable of doing. Governance determines what it is allowed to do. These are different properties, secured by different mechanisms, and they do not improve together. A more capable model is not a more governable one. In fact, the relationship often runs the other way: the more capable and autonomous the system, the more urgent the governance question becomes, because the same capability that creates value also expands the blast radius of an unauthorized action.

The challenge is not whether AI can access a database; it is whether AI should access this database in this context for this purpose. The challenge is not whether AI can invoke a workflow; it is whether that invocation complies with the organization's risk, compliance, and business-policy requirements. Today the industry has poured enormous effort into the first question in each pair and comparatively little into the second. As autonomy increases, that imbalance stops being an academic gap and starts being an operational hazard.

The uncomfortable truth is that an organization can possess the most capable AI in the world and still be unable to deploy it where it matters most — not because the AI is not smart enough, but because the enterprise cannot establish, verify, and bound the authority under which it would act. Capability has outrun control. Closing that distance is the work of the next decade of enterprise AI.

2. From Assistants to Executors

The first generation of enterprise AI can be described as the Assistant Era, and its operating model is straightforward: a human poses a question or a task, the AI produces a recommendation, and the human decides whether and how to act on it. The AI proposes; the person disposes. In this model, organizational accountability is essentially unchanged. The AI may be wrong, may hallucinate, may give incomplete advice — and the enterprise tolerates this, because the AI holds no execution authority. The last word, and the last responsibility, belong to a human.

This is precisely why enterprises have been comfortable deploying assistants at scale. ChatGPT Enterprise, Copilot, Gemini for Workspace, Claude for enterprise — all sit in the same structural position. Human → AI → Suggestion. The blast radius of an error is bounded by the human reviewer standing between the AI and the world.

The next generation changes that relationship at its root. Organizations increasingly want systems that can execute workflows, coordinate business processes, reach into enterprise applications, invoke APIs, perform operational tasks, and collaborate with other agents to pursue an objective with minimal human intervention. The operating model shifts to Human → AI → Action. At that moment the AI stops being a voice in the room and becomes a participant in the world. It begins to produce consequences directly.

This is not merely a technical evolution; it is a governance transition. When the human reviewer is removed from the inner loop — when the AI's output is no longer a suggestion to a person but a command to a system — the question the enterprise must answer changes entirely. Under the assistant model, the operative question was is the AI smart enough to be useful? Under the executor model, it becomes who authorized the AI to do this, and within what limits?

Existing AI architectures were never designed to answer that second question. They were designed to make the AI capable and to keep its language safe. They assume, implicitly, that a human stands at the boundary deciding what to do with the output. Remove that human, hand the output directly to systems of record, payment rails, customer data, and trading infrastructure, and the assumption silently fails. The enterprise has not gained a smarter employee. It has gained an actor whose authority no one has actually defined.

3. Why AI Safety Is Not Enough

Over the past several years the AI field has invested heavily and admirably in safety. Alignment, content filtering, prompt-injection defenses, jailbreak prevention, guardrails, constitutional methods, and model monitoring have all matured. These mechanisms address genuine risks; they help prevent harmful outputs, reduce misuse, and improve reliability. Nothing in this paper diminishes their importance. But they share an assumption that becomes insufficient the moment AI begins to act: that the primary risk originates in what the AI says.

For an advisory system, that assumption is reasonable. The dangerous output of an assistant is a sentence — a wrong fact, a toxic statement, a leaked secret rendered as text. So the safety stack is built to govern language: filter the output, constrain the generation, refuse the unsafe request. As long as a human stands between the sentence and any consequence, governing the sentence is enough.

For an executing system, the assumption breaks. As AI gains access to tools, applications, APIs, databases, and workflows, the most significant risks migrate away from language and toward execution. A model that produces an incorrect answer is, at worst, an inconvenience that a reviewer can catch. A model that performs an unauthorized action can create financial loss, a compliance violation, operational disruption, or regulatory exposure — and it can do so at machine speed, thousands of times, before any human review process begins.

The two regimes ask fundamentally different questions. Traditional AI safety asks: is the output harmful, is the response compliant, is the content appropriate? Enterprise AI execution must ask: is this action authorized, does it comply with governance policy, does it exceed delegated authority, can it be independently verified? The first set is about content. The second is about authorization and governance. No amount of progress on the first set answers the second.

This leads to the observation that reorganizes the entire security model: a perfectly aligned AI can still perform an unauthorized action. An AI does not have to be jailbroken, manipulated, or malicious to create catastrophic risk. It only has to act outside the organization's authority boundaries while behaving exactly as designed. A flawlessly aligned agent that exports a client list because nothing told it not to has caused a governance failure, not a safety failure — and the safety stack, by construction, never had the information needed to stop it. The challenge is no longer preventing harmful outputs. It is governing execution.

Part II — The Missing Layer

4. The Missing Layer: Enterprise Governance

A modern enterprise is not simply a collection of systems. It is a governance structure. Every large organization runs on a dense network of policies, responsibilities, approvals, restrictions, and controls: separation of duties, approval workflows, risk-management controls, compliance requirements, information-classification policies, data-access restrictions, regulatory obligations, information barriers — the "Chinese walls" that keep one part of a firm from seeing what another part knows — and audit requirements layered on top of all of it.

These mechanisms are not bureaucratic friction. They exist because organizations have learned, often expensively, that unrestricted authority creates unacceptable risk. So enterprises deliberately separate power. As a rule, no single actor is permitted to initiate, approve, execute, and audit the same action; those capabilities are split across people and departments precisely so that no one can act without a check. Governance, in this sense, is not an operational inconvenience sitting beside the security system. Governance is a core security mechanism.

Human employees operate inside these structures by default. Their authority is bounded by their role, their business processes, their management chain, and the accountability mechanisms that attach to a named person. A research analyst does not need to be reminded, every morning, that she may not move material non-public information across the wall into the banking side of the house — the constraint is woven into her role, her training, her supervision, and the consequences she would face.

Agentic AI breaks this default. An AI system may possess extraordinary technical capability, hold access to many tools, coordinate across systems, and interact with other agents — and yet none of those capabilities carries any inherent awareness of governance. The AI may know precisely how to perform an action without having any representation of whether it should. It does not read the policy manual. It did not attend the compliance training. It does not intuit the organizational boundary that every human employee absorbs as a condition of employment.

This produces the central structural fact of enterprise AI. Governance exists. Execution exists. But the connection between them is missing. Most systems on the market today connect AI to enterprise resources — they make the database reachable, the API callable, the workflow triggerable. Very few connect AI to enterprise governance — to the rules that determine when reaching, calling, and triggering are actually permitted. That missing connection is the gap this paper is about, and it is the gap that will determine which organizations can safely move AI from advice to action.

5. Governance That Cannot Be Enforced Is Not Governance

Enterprises pour enormous effort into governance. Policies are written, standards defined, controls documented, approval processes established, compliance requirements mandated, risk frameworks maintained. For decades these artifacts have served as the foundation of enterprise operations. But the arrival of agentic AI exposes a weakness that was always present and rarely tested: most governance exists as documentation, and organizations have quietly assumed that documented governance translates automatically into operational control.

For human participants, that assumption mostly holds. Employees can be trained on the policy. Managers can supervise adherence. Auditors can review after the fact. Compliance teams can investigate violations and impose consequences. Human governance leans heavily on behavioral enforcement — on the fact that people internalize rules, fear sanctions, and can be corrected. The written policy works because a socialized, accountable human stands behind it.

Autonomous systems dissolve that backing. An AI does not internalize a policy manual or fear a sanction. More to the point, it operates at a speed and scale that defeats after-the-fact control: an autonomous agent can execute thousands of actions before a human review process even begins, let alone concludes. Governance that depends entirely on post-hoc review — catch it in the quarterly audit, flag it in the monthly report — becomes structurally ineffective the moment the actor is a machine running continuously.

The result is a precise and dangerous mismatch. The organization possesses governance. The AI possesses execution capability. But no mechanism ensures that the governance actually shapes the execution. This is the governance gap, and it is worth stating its nature exactly: the problem is not the absence of governance. The problem is the absence of governance enforcement. A governance rule that cannot influence runtime behavior is merely documentation. A policy that cannot constrain execution is merely guidance. An approval requirement that automation can bypass is not, in any meaningful sense, a control.

The conclusion follows directly. As organizations adopt increasingly autonomous systems, governance can no longer live only in documents and human habits. It must become executable, enforceable, and part of runtime decision-making — present at the exact moment an action is about to occur, with the power to permit or deny it. Only then does autonomous execution remain tethered to organizational intent. This principle — that governance which cannot be enforced at runtime is not really governance — is one of the foundational ideas behind PEA. PEA does not create governance. It operationalizes it, converting documentation into runtime control.

6. Operationalizing Governance

If governance is to shape AI execution, the architecture itself must change, because historically governance and execution have lived in separate domains. Governance defined what should happen. Systems determined what could happen. Human beings stood in the middle, manually bridging the two — reading the policy, interpreting the situation, and choosing the compliant action. That arrangement is workable while humans remain in the loop. It becomes fragile in direct proportion to how much authority is delegated to machines, because the more an autonomous system does on its own, the less realistic it is to rely on a human to enforce the policy at each step.

From this a new requirement emerges, and it is worth stating plainly: governance must become machine-enforceable. Note what the requirement is not. It is not "write more policies." Most enterprises already have extensive, mature policy frameworks; the documents are not the deficiency. The deficiency is that those policies lose their grip the instant decisions and actions are taken by autonomous systems rather than by the trained, supervised humans the policies were written for.

PEA addresses this by introducing a governance-aware execution model whose central move is a reversal of the default. Conventional systems assume execution is permitted unless something stops it; the AI can act, has access, and therefore proceeds. PEA assumes the opposite: execution must be justified before it is permitted. Every action is evaluated against governance requirements before authority is granted, not flagged for review after the fact. The burden of proof inverts. Access is no longer self-justifying.

This changes the role governance plays inside the enterprise. Governance stops being an external auditing function that inspects the wreckage afterward and becomes an active participant in execution itself — a party at the table at the moment of action. Architecturally, the path shifts from Intent → Execution to Intent → Authorization → Execution. The change looks subtle on the page and is profound in practice. Authorization becomes the bridge between organizational governance and operational execution. Execution is no longer determined solely by technical capability; it is determined by organizational authority.

The strategic payoff is the resolution of a fear that has quietly limited AI adoption: that automation must mean a loss of governance. The opposite becomes possible. Rather than weakening governance through automation — letting the machine outrun the controls — organizations can strengthen governance through automation, because a rule enforced by the runtime is enforced uniformly, instantly, and without exception, in a way human diligence never achieves. That is the operational promise of PEA, and the bridge to the principle that follows.

Part III — Trusted Execution

7. The Principle of Trusted Execution

Most AI architectures are organized around capability. PEA is organized around authority, and that single reorientation changes the entire design philosophy. Conventional systems operate under an implicit assumption so familiar it is rarely examined: if an AI can perform an action and has technical access to the required resources, execution is generally permitted. Access is treated as its own justification. PEA rejects this assumption at the root. Capability does not imply authority. Access does not imply authorization. Execution does not imply legitimacy.

In its place PEA installs a single principle: every execution must be justified before it is permitted. This is not an exotic idea imported into the enterprise from computer science; it is exactly how modern organizations already operate with their own people. An employee may hold access to many systems, and yet that access does not, by itself, authorize every possible action those systems could perform. Real authorization depends on purpose, context, responsibility, policy, and governance requirements — the why and the under what conditions, not merely the can.

A worked example makes the principle concrete. Consider an equity research analyst at a large bank. As an individual, she legitimately holds access to a market-data terminal, parts of the CRM, the internal research repository, and corporate email. Now suppose an AI agent acts on her behalf to carry out the intent generate a market research report. Today's prevailing assumption is dangerous in its simplicity: if the user has the permissions, the AI inherits them. But the organization never intended the task "generate a research report" to carry the power to export the client list, reach into the investment-banking deal database, or email material non-public information outside the wall — even though the analyst, as a person, can touch some of those systems for other legitimate reasons.

This is the heart of the matter, and it can be written as a chain of inequalities: user permissions ≠ intent permissions ≠ execution permissions. What a person is allowed to do is not what a given task is allowed to do, and neither is automatically what a specific execution in a specific context is allowed to do. Most enterprise security architectures have never separated these three. PEA's notion of a minimal capability set — granting an intent only the narrow, bounded authority it actually requires, and no more — exists precisely to keep them apart.

The same discipline that governs the analyst must therefore govern the AI. An AI system should not act merely because it can; it should act only because it has been authorized to, for this purpose, within these limits. This reframing turns execution from a technical operation into a governed one, and it yields the mandatory ordering — Intent → Authorization → Execution — in which authorization is not a secondary check bolted on afterward but a prerequisite for action. That seemingly simple shift is the foundation for AI execution that is accountable, auditable, and governable by design rather than by hope.

8. Authorization as Infrastructure

Every major transition in enterprise computing has demanded new infrastructure, and the pattern is consistent enough to be predictive. The rise of the internet demanded network security infrastructure. The rise of distributed computing demanded identity infrastructure. The rise of cloud computing demanded trust infrastructure. Agentic AI demands the next entry in that lineage: authorization infrastructure. The requirement is not a marketing posture; it follows from the fact that agentic AI changes the nature of software itself. Traditional software executes predefined instructions. Agentic AI executes delegated objectives — it decides, mid-flight, which actions to take in pursuit of a goal.

That difference is decisive for security architecture. Traditional systems primarily needed to answer one question — who is requesting access? — and so enterprise security spent two decades maturing around identity. Identity and Access Management, Single Sign-On, Privileged Access Management, Public Key Infrastructure, and Zero Trust all emerged to solve variations on the same theme: who are you, may you access this resource, should this identity be trusted? These technologies genuinely transformed enterprise computing. But they were designed for a world in which humans were the primary actors and identity was a reliable proxy for intent — where knowing who was acting told you most of what you needed to know about why.

Agentic AI severs that proxy. In an AI-driven environment, the identity of the requester is frequently no longer sufficient to judge an action. A human may initiate an objective; an AI system may plan the execution; multiple agents may coordinate; tools may invoke further systems; workflows may cross organizational and even legal-entity boundaries. By the time an action reaches a system of record, the original human identity has been refracted through several layers of autonomous decision-making. Knowing who started the chain tells you little about whether this particular action, several steps later, is one the organization meant to permit.

So the question the infrastructure must answer changes. It is no longer only who is this? but why is this action occurring, for what purpose, under whose authority, and within which limits? Identity alone cannot answer those questions. An AI system may present valid credentials and hold legitimate access and still take an action that is, in substance, unauthorized. The challenge has moved from authentication to authorization — from establishing identity to establishing the legitimacy of a specific act.

This is the role PEA was designed to play. It treats authorization as a first-class architectural concern rather than an implementation detail buried inside each application. Instead of assuming that access implies authority, PEA requires authority to be independently established before execution occurs. In this model the layers compose cleanly: identity answers who is acting, authorization answers why the action is permitted, governance answers whether the action complies with organizational policy, and execution answers whether the action should proceed. Authority becomes explicit, verifiable, auditable, and revocable. Just as IAM became indispensable infrastructure for the internet era, authorization infrastructure will become indispensable for the agentic-AI era — and PEA is built to be that foundation. Under it, authority is not inherited; it is granted, constrained, monitored, accountable, and, above all, governed. That is the difference between an intelligent system and a trusted one.

9. Trusted Execution Infrastructure

The long-term future of enterprise AI is not conversational. It is operational. Organizations do not ultimately invest in AI to generate paragraphs; they invest in AI to improve decisions, automate processes, coordinate workflows, and carry out business functions. As AI becomes embedded in operations, execution becomes the primary source of value — and, inseparably, the primary source of risk. The same action that compresses a three-day process into three minutes is the action that, taken wrongly or without authority, moves money, exposes data, or breaches a regulation.

This is why model alignment, however good, cannot be the whole answer. Alignment shapes the disposition of the model; it cannot, on its own, guarantee that a specific action in a specific context falls within the authority the organization actually granted. Guaranteeing that requires infrastructure — a layer outside the model that establishes, checks, and bounds authority independently of whatever the model happens to decide. Just as enterprise computing evolved from isolated applications to identity-aware systems, agentic AI must evolve from capability-centric systems to governance-aware ones.

That evolution defines a new architectural category, which this paper calls Trusted Execution Infrastructure: the layer that ensures AI systems operate within authorized boundaries, remain accountable to enterprise governance, and maintain verifiable control over execution authority. PEA was created to occupy this category. Its purpose is not to make AI more intelligent; it is to make intelligent systems trustworthy enough to become operational infrastructure — dependable enough that an organization will route a real business process through them.

The distinction matters because it relocates the competitive frontier. The future of enterprise AI will not be decided solely by model capability, where the major labs are already converging and where further gains, while real, are increasingly commoditized. It will be decided by whether organizations can trust AI with authority — and that is a property of the surrounding architecture, not of the model weights. The enterprise that can safely delegate authority to a capable model will outrun the enterprise that merely possesses one. Trusted Execution Infrastructure is what makes that delegation safe, and the next part of this paper argues that trusted delegation, properly understood, is the real prize.

Part IV — Trusted Delegation

10. The Future of Agentic AI: Trusted Delegation

The history of enterprise technology can be read as a long history of delegation. Organizations grow by finding ways to hand responsibility down and out — safely and efficiently. Managers delegate to employees. Organizations delegate to departments. Businesses delegate to software systems. Each successful form of delegation rests on a single condition: trust. Delegation without trust creates risk; trust without control creates vulnerability. The whole apparatus of enterprise governance — assigned authority, defined responsibilities, required approvals, audited actions, preserved accountability — exists to make trusted delegation possible at scale. It is the machinery that lets a firm grow beyond the span of any one person's direct control.

Agentic AI introduces a genuinely new form of delegation, and this is what makes the present moment different from every prior wave of automation. For the first time, organizations are contemplating delegating not merely computation but execution — not just running a predetermined procedure, but handing over the latitude to decide which actions to take. Traditional software executes predefined instructions; you know in advance everything it can do. Agentic AI makes decisions, selects actions, coordinates resources, and pursues objectives in ways not fully specified ahead of time. The question this forces is unavoidable: can an organization safely delegate operational authority to a system whose specific actions it cannot fully predict?

That question is not, at its core, technical. It is a governance question, and it resolves into the conditions under which delegation remains safe. Without governance, delegation becomes dangerous. Without authorization, it becomes uncontrolled. Without accountability, it becomes unacceptable. This is why the future of enterprise AI depends on trusted delegation specifically — not on delegation alone, which is easy, but on delegation an organization can stand behind.

To delegate with confidence, an organization must be able to answer a precise set of questions about any authority it hands to AI: what authority has been delegated, under what conditions, for which purpose, within which boundaries, under whose accountability, and how it can be revoked. These are not peripheral governance niceties; they are prerequisites for enterprise-scale adoption. An organization that cannot answer them will not — and should not — route its core operations through an autonomous system, no matter how capable.

From this follows the thesis at the center of this paper. The next generation of AI systems will not be judged solely by intelligence. They will be judged by their ability to operate within delegated authority — to do exactly what they were entrusted to do, and nothing they were not. Stated as plainly as possible: the next frontier of AI is not intelligence; it is trusted delegation. The first decade of AI was about making machines capable. The decade now beginning is about making it safe to give those capable machines real authority — and that is a problem of governance, not of intelligence.

11. The PEA Trust Pyramid

The arc of this argument can be compressed into a single structure that organizes everything above. Call it the PEA Trust Pyramid. It has four layers, each resting on the one beneath it, ascending from a technical foundation to a business outcome:

Trusted Delegation

Governance

Authorization

Security

At the base sits Security, which answers can the AI operate safely? This is the layer the industry already understands well: protection against manipulation, prompt injection, jailbreaks, and the corruption of the system itself. It is necessary and it is foundational — but it is only the ground floor. A system can be perfectly secure in this sense and still do something the organization never authorized.

Above it sits Authorization, which answers does the AI hold legitimate authority to act? This is where capability stops being self-justifying. Authorization establishes that a specific action, in a specific context, falls within power that was actually granted — separating what the AI can do from what it is permitted to do. It is the layer most conspicuously missing from today's agent architectures, and the one PEA treats as a first-class concern.

Above that sits Governance, which answers does the AI's action comply with the organization's rules? Authorization establishes that authority was granted; governance establishes that exercising it here, now, in this way, is consistent with the enterprise's separations of duty, information barriers, compliance obligations, and risk controls. This is where the organization's decades of accumulated rules are brought to bear on the individual action — at runtime, with the power to permit or deny.

At the apex sits Trusted Delegation, which answers the only question an executive ultimately cares about: is the enterprise willing to hand real work and real authority to AI? This is not a technical property; it is a business outcome, and it is the outcome every layer beneath exists to produce. Security, authorization, and governance are means. Trusted delegation is the end. Enterprises do not ultimately buy security, or authorization, or even governance for their own sake — they buy the ability to say, with confidence, we are willing to let the AI do this.

The pyramid clarifies what PEA is and is not. PEA is not a governance policy framework; it does not write the organization's rules. It is the structure that carries an organization up the pyramid — taking the security the industry already provides, adding the authorization layer that is missing, enforcing the governance the organization already has, and thereby delivering the trusted delegation the enterprise actually wants. Each layer is a precondition for the one above; remove any and the apex collapses. That is why piecemeal solutions, however good at their own layer, cannot deliver the summit alone — the subject of the next part.

Part V — Why PEA

12. Why Existing Solutions Are Not Enough

It would be a mistake to read this paper as a critique of the existing AI safety and security stack. The mechanisms the field has built are valuable, and a serious enterprise deployment will use many of them. The point is narrower and more important: each of them solves a different problem, and none of them, alone or in combination, closes the gap between enterprise governance and AI execution. The following comparison is offered not to diminish these technologies but to locate them precisely.

SolutionWhat it primarily governs
Model alignmentThe disposition and behavior of the model
GuardrailsThe content of model outputs
AI firewall / gatewayThe traffic flowing to and from the model
IAM / SSOThe identity of the requester
Zero TrustAccess to resources and networks
PEAThe authority under which an action executes

Alignment shapes how a model is disposed to behave, but a perfectly aligned model can still take an action outside organizational authority. Guardrails govern what the model says, which is the wrong target once the risk lives in what the model does. AI firewalls and gateways govern the traffic between the model and the outside world — valuable for inspection and rate-limiting, but they reason about requests and responses, not about whether a given action is authorized for a given purpose under organizational policy. IAM and SSO govern identity, answering who is acting — indispensable, and exactly the layer this paper argues is no longer sufficient on its own, because in an agentic chain the identity at the start does not determine the legitimacy of the action at the end. Zero Trust governs access, continuously verifying that a requester should reach a resource — but reaching a resource is not the same as being authorized to perform a specific consequential action with it.

The pattern is consistent. Every existing layer reasons about something adjacent to the authorization question — the model, the output, the traffic, the identity, the access — and each is genuinely necessary. But the question should this specific action, with this purpose, in this context, proceed under the organization's governance? falls between all of them. It is nobody's job in the conventional stack. PEA exists to make it somebody's job — to treat execution authority as a distinct architectural concern with its own enforcement point. PEA does not replace the other layers; it completes the stack by adding the one that agentic AI made necessary and that nothing else provides.

13. Why Enterprises Need PEA

The abstract argument lands hardest when set inside a real organization, because the governance gap is not evenly distributed. It grows with complexity, with the sensitivity of the data, and with the severity of what a wrong action costs. The institutions where agentic AI promises the most value are precisely the ones where ungoverned execution is least tolerable — which is exactly why these institutions, despite abundant access to capable models, have been the slowest to grant them real authority.

Consider a global bank. It is not, in governance terms, a single company; it is an enormous governance system spanning dozens of lines of business, multiple countries, multiple legal entities, multiple regulatory regimes, and several identity sources at once. Its defining controls are not conveniences but obligations: material non-public information must be isolated; the information barrier between research and banking must hold; activity must remain auditable across jurisdictions. When such an institution imagines a research agent, a compliance agent, and a risk agent collaborating across these boundaries, its first question is not whether the agents are clever. It is whether each agent's authority can be bounded so that collaboration never becomes a path around the wall. The permissions an AI accumulates here are a cross-domain combination problem, not a simple role lookup — and that is the native problem PEA was built to solve.

Healthcare presents the same structure in a different vocabulary. Protected health information, HIPAA obligations, and the requirement that clinical actions trace to legitimate clinical authority mean that an agent able to read records, schedule procedures, or touch a clinical system cannot be governed by identity alone. The question is always whether this action, for this purpose, is authorized under the rules that protect the patient — exactly the question PEA forces to be answered before execution rather than discovered afterward.

Government and the public sector raise the stakes again: classified data, mission authority, and strict accountability mean that an autonomous system operating outside its delegated authority is not merely a compliance event but a security one. Here the ability to grant bounded authority, enforce it at runtime, audit it completely, and revoke it instantly is not a feature; it is a precondition for deployment at all.

Across all three, the conclusion is the same, and it is the conclusion of this entire paper. The largest barrier to enterprise AI adoption in these institutions is not the intelligence of the model. It is the absence of a way to grant AI execution authority without breaking the governance structures the organization has spent decades building. These organizations have never lacked intelligence. What they lack — what they have always lacked, and what PEA supplies — is intelligence they can trust to act.

14. Alignment Is Evidence, Not Authority

Section 3 argued that a perfectly aligned AI can still act outside its authority. A natural objection follows: won’t better alignment eventually close that gap? In June 2026, OpenAI published Reinforcement Learning Towards Broadly and Persistently Beneficial Models — the strongest empirical answer yet, and it points the other way.

The paper is real progress, and we say so plainly. Training models on beneficial traits — honesty, corrigibility, concern for human welfare — produced improvements that generalize across domains and persist under adversarial pressure. But note what it carefully does not claim. Its results are improvements, not guarantees: the model improved on 44 of 53 benchmarks — so nine did not; under adversarial pressure it was “harder to push,” not impossible; under harmful fine-tuning, “somewhat more resistant.” That the authors study persistence at all is an honest admission that alignment can drift under pressure. Every result is probabilistic.

Anyone who has worked in banking, insurance, or audit already thinks in one equation: Risk = Probability × Impact. Alignment attacks the first term, lowering the probability of harmful intent; it does not bound the second. Regulated, high-stakes deployment needs a worst-case bound, not an expected-case improvement — a bank cannot underwrite a fiduciary system on “44 of 53 benchmarks improved.” Lowering probability is necessary and valuable, but it is a different kind of object than bounding impact, and bounding impact is precisely what an authorization layer does.

The paper’s deeper finding cuts in governance’s favor. It builds on the result that models have personas — behavioral dispositions that are real, generalize across tasks, and can drift. Some read this as license to simply train the persona to be good and stop worrying. We read it the opposite way: the more real and generalizable a persona is, the less verifiable it becomes from the outside, and the less it can be trusted at the moment of execution. The paper does not weaken the assumption that a model should be untrusted at the boundary where it takes real-world action — it strengthens it.

This yields a principle worth stating plainly: Alignment is evidence, not authority. A model’s measured traits — its honesty scores, persona profiles, drift telemetry — are legitimate, useful evidence. They belong to observation: audit, monitoring, trust analysis, forensics. They should inform how closely a human watches an agent. They must never silently raise or lower what an agent is permitted to do; a bank does not skip an approval because a customer “seems honest.” Even the tempting move — a drift monitor that detects a model going off the rails and automatically tightens its permissions — must remain an open loop: raise an alert, trigger human review, pull an out-of-band kill-switch. The moment a model-derived score is wired into the authorization engine, an adversary who controls the model can spoof it, and the gate moves with it. Observation feeds humans; it does not feed the gate.

None of this diminishes alignment; it is a division of labor. Probability is the model builder’s to lower, and they are lowering it. Impact is governance’s to bound, and that task does not get easier as models improve — it gets more important, because more capable agents are trusted with more consequential actions. Alignment shapes what an agent wants to do; governance determines what it is allowed to do. Enterprises need both, and they must control them independently.

15. Conclusion

The AI industry has made extraordinary progress on capability. Models reason, agents plan, systems coordinate, tools execute. Yet one challenge remains unresolved, and it is the one that now governs whether all that capability can reach the places where it would matter most. Enterprises do not merely need intelligent systems. They need governable ones.

The transition from AI assistants to AI executors changes the fundamental requirements of enterprise architecture. The moment AI gains the ability to act, governance can no longer remain external to execution, reviewing the record after the fact. Governance must become operational. Authorization must become enforceable. Execution must become accountable. These are not refinements of the safety agenda; they are a different agenda, and the safety stack was never designed to carry it.

This is the work PEA was built to do. PEA does not replace enterprise governance, compliance frameworks, or organizational controls; it provides the missing connection between governance and execution. It transforms governance from policy into runtime control, authorization from documentation into enforcement, and execution from a raw technical capability into a trusted organizational function. In doing so it introduces a principle for the agentic-AI era that the preceding chapters have built toward step by step: intelligence alone is not enough; authority must be governed, execution must be justified, and trust must be engineered rather than assumed.

The future of enterprise AI will not be defined solely by how intelligent AI becomes. It will be defined by whether organizations can safely entrust AI with authority. That is the transition named in this paper's title — from AI assistants to trusted execution infrastructure — and it is the transition PEA exists to make possible.

PEA transforms enterprise governance from policy into runtime control, enabling the trusted delegation of authority to AI — and with it, the move from AI assistants to trusted execution infrastructure.

摘要

过去三年,企业部署 AI 的速度令人惊叹。模型可以总结文档、回答问题、检索知识、起草内容、辅助决策,并已创造出真实的价值。然而,尽管投入巨大、热情高涨,今天绝大多数大型组织内部部署的 AI,本质上仍然只是一个顾问:它提出建议,由人来决策,由人来执行。

企业级 AI 的下一阶段,与今天有着本质区别,而不仅仅是程度上的不同。组织越来越希望 AI 不只是建议,而是能够执行——访问系统、调用工具、编排工作流、操作业务应用,并与其他 Agent 协作。当一个 AI 系统开始"行动"而不只是"说话"的那一刻,最关键的问题就变了。问题不再是"AI 能不能做到?",而是"AI 是否应该被允许这么做——在什么条件下、拥有哪些权限、由谁来承担责任?"

这不是一个智能问题,而是一个权力问题。而这恰恰是迄今为止整个 AI 行业基本回避的问题。

本文提出一个简单却影响深远的论点:企业采用 AI 的瓶颈,已经不再是模型能力,而是缺少一个层——一个能够把组织已有的治理(审批链、职责分离、信息隔离墙、合规控制)转化为自治系统在执行那一刻无法绕过的约束的层。身份系统回答的是"谁可以访问这个资源?";而 Agentic AI 需要一个新的层,去回答"这个行为为什么发生、是否被授权、是否合规、能否被审计或撤销?"

策略执行架构(Policy Enforcement Architecture,PEA)正是为填补这一空白而生。PEA 不是一个新的大模型,不是一个 Agent 框架,也不是一个治理政策框架。它是一种 AI 安全与授权架构,把企业治理操作化(operationalize)地嵌入到 AI 的执行路径之中。它的目标,是把 AI 从一个辅助工具升级为可信执行基础设施(Trusted Execution Infrastructure)——并由此让企业真正想要、却尚无法安全获得的东西成为可能:在不丧失控制权的前提下,把真正的权力委托给 AI。

后续章节将分五个部分展开这一论点。第一部分论证:企业级 AI 的下一场危机,关乎权力,而非智能。第二部分指出:企业治理与 AI 执行之间,存在一个缺失的层。第三部分阐释"可信执行"原则,以及为什么授权必须成为一种基础设施。第四部分把整个问题重构为"可信委托",并提出 PEA 信任金字塔。第五部分说明现有方案为何无法弥合这一缺口,以及金融、医疗与政府机构为何尤其需要 PEA。

第一部分 —— 企业级 AI 的真正难题

1. 企业级 AI 的下一场危机不是智能

整个 AI 行业都围绕一个目标运转:让 AI 更聪明。每一次被大书特书的进步,衡量的都是能力——更强的推理、更长的上下文、更强的规划、更自治的 Agent、更流畅的工具调用。这些进步是真实而重要的。但它们只回应了企业问题的一面,而且不是真正在阻碍落地的那一面。

对大多数大型组织而言,智能并不是障碍,信任才是。一家全球性银行拒绝部署某个 AI 系统,不是因为它不会推理,而是因为它无法保证:一旦允许这个系统行动,它会始终待在组织边界之内。真正的不确定性,与模型思考的质量无关,而与模型"动手"的后果有关。

请仔细体会这个区别,因为本文的整个论证都建立在它之上。智能决定一个 AI 系统"能够"做什么;治理决定它"被允许"做什么。这是两种不同的属性,由不同的机制保障,而且并不会同步提升。一个能力更强的模型,并不会因此成为一个更可治理的模型。事实上,二者的关系往往相反:系统越强大、越自治,治理问题就越迫切——因为创造价值的那份能力,也同样放大了一次越权行为的破坏半径。

难题不在于 AI 能否访问一个数据库,而在于在"这个上下文、为这个目的"之下,它是否应该访问"这个"数据库;不在于 AI 能否调用一个工作流,而在于这次调用是否符合组织的风险、合规与业务政策要求。今天,行业把巨大的精力投向每一组问题中的前者,对后者的投入却相形见绌。随着自治程度提升,这种失衡就不再是一个学术上的空白,而会变成一种现实的运营风险。

一个令人不安的事实是:一个组织可以拥有全世界最强的 AI,却依然无法把它部署到最关键的地方——不是因为 AI 不够聪明,而是因为企业无法确立、验证并约束它行动所依据的权力。能力已经跑在了控制的前面。弥合这段距离,正是企业级 AI 下一个十年的工作。

2. 从助手到执行者

第一代企业级 AI 可以被称为"助手时代",它的运作模式十分简单:人提出一个问题或任务,AI 给出建议,由人决定是否以及如何据此行动。AI 提议,人裁决。在这种模式下,组织的问责结构基本没有改变。AI 可能出错、可能产生幻觉、可能给出不完整的建议——而企业能够容忍这些,因为 AI 不掌握执行权。最终的发言权和最终的责任,都归于人。

正因如此,企业才愿意大规模部署助手。ChatGPT Enterprise、Copilot、Gemini for Workspace、面向企业的 Claude——它们在结构上处于同一个位置:人 → AI → 建议。错误的破坏半径,被夹在 AI 与现实世界之间的那个人类审阅者牢牢限制住。

下一代则从根本上改变了这种关系。组织越来越希望系统能够执行工作流、编排业务流程、深入企业应用、调用 API、完成运营任务,并与其他 Agent 协作,以最少的人为介入去追求一个目标。运作模式于是变成:人 → AI → 行动。在这一刻,AI 不再只是房间里的一个声音,而成为现实世界中的一个参与者,开始直接产生后果。

这不仅是一次技术演进,更是一次治理转折。当人类审阅者被移出内层闭环——当 AI 的输出不再是给某个人的建议,而是给某个系统的指令——企业必须回答的问题就彻底变了。在助手模式下,关键问题是"AI 是否足够聪明、足够有用?";在执行者模式下,问题变成了"谁授权 AI 这么做,边界又在哪里?"

现有的 AI 架构从未被设计来回答这第二个问题。它们被设计用来让 AI 更有能力、让它的语言更安全;它们隐含地假设:有一个人站在边界处,决定如何处理这些输出。一旦移除这个人,把输出直接交给业务系统、支付通道、客户数据和交易基础设施,这个假设就悄无声息地失效了。企业并没有获得一个更聪明的员工,而是获得了一个谁都没有真正定义过其权力边界的行动者。

3. 为什么 AI 安全还不够

过去几年,AI 领域在安全上投入巨大,也成果斐然。对齐(alignment)、内容过滤、提示注入防御、越狱防护、护栏(guardrails)、宪法式方法、模型监控,都已日趋成熟。这些机制应对的是真实的风险,它们有助于防止有害输出、减少滥用、提升可靠性。本文丝毫无意贬低它们的重要性。但它们共享一个假设——这个假设在 AI 开始行动的那一刻就变得不再充分:风险主要来自 AI"说了什么"。

对一个顾问型系统而言,这个假设是合理的。助手的危险输出是一句话——一个错误的事实、一句有害的言论、一段以文本形式泄露的机密。于是整套安全机制都建立在治理"语言"之上:过滤输出、约束生成、拒绝不安全的请求。只要有人站在"那句话"与任何后果之间,治理"那句话"就足够了。

但对一个执行型系统而言,这个假设就崩塌了。当 AI 获得了对工具、应用、API、数据库和工作流的访问权,最重大的风险便从语言转向了执行。一个给出错误答案的模型,最坏不过是个能被审阅者拦下的麻烦;而一个执行了越权操作的模型,可能造成财务损失、合规违规、运营中断或监管风险——而且它能以机器的速度,在任何人类审阅流程开始之前,重复这样做成千上万次。

两种范式提出的是根本不同的问题。传统 AI 安全问的是:输出是否有害?回应是否合规?内容是否得体?而企业级 AI 执行必须问:这个行为是否被授权?它是否符合治理政策?它是否超出了被委托的权力?它能否被独立验证?前一组问题关乎内容,后一组关乎授权与治理。在前一组问题上取得再多进展,也回答不了后一组。

这引出了那个足以重组整个安全模型的论断:一个完美对齐的 AI,依然可能执行一次越权操作。AI 无需被越狱、被操纵或心怀恶意,就能制造灾难性风险。它只需在完全按设计行事的同时,越出了组织的权力边界。一个对齐得无可挑剔的 Agent,仅仅因为没有任何东西告诉它"不要这么做",就导出了一份客户名单——它造成的是一次治理失败,而不是一次安全失败;而安全机制,从其构造上看,根本就没有拦下它所需的信息。难题已经不再是防止有害输出,而是治理执行。

第二部分 —— 那个缺失的层

4. 缺失的层:企业治理

一家现代企业,并不只是一堆系统的集合,它是一个治理结构。每一个大型组织,都运行在一张由政策、职责、审批、限制和控制交织而成的密网之上:职责分离、审批流程、风险管理控制、合规要求、信息分级政策、数据访问限制、监管义务、信息隔离墙("中国墙",让公司一部分看不到另一部分所知的信息),以及覆盖于这一切之上的审计要求。

这些机制不是官僚式的摩擦。它们之所以存在,是因为组织往往以高昂代价学到了一个教训:不受约束的权力会带来无法接受的风险。于是企业刻意地分割权力。原则上,没有任何一个行为者被允许同时发起、审批、执行并审计同一个行为;这些能力被拆分到不同的人和部门,正是为了让任何人都无法在没有制衡的情况下行动。从这个意义上说,治理并不是安全系统旁边的运营负担——治理本身就是一种核心安全机制。

人类员工天然在这些结构内部运作。他们的权力被角色、业务流程、管理链,以及附着在一个具名个人身上的问责机制所约束。一位研究分析师,不需要每天早上被提醒她不得把重大非公开信息(MNPI)越过隔离墙带到投行一侧——这条约束已经被编织进她的角色、培训、监管,以及她将要承担的后果之中。

Agentic AI 打破了这种默认状态。一个 AI 系统可能拥有非凡的技术能力、握有多种工具、能跨系统编排、能与其他 Agent 交互——然而这些能力之中,没有任何一项天然携带对治理的认知。AI 可能精确地知道"如何"执行一个动作,却对自己"是否应该"执行毫无表征。它不读政策手册,它没参加合规培训,它无法直觉地感知那条每个人类员工作为受雇前提而吸收的组织边界。

这就构成了企业级 AI 最核心的结构性事实:治理存在,执行存在,但二者之间的连接缺失了。今天市场上绝大多数系统,连接的是 AI 与企业"资源"——让数据库可达、让 API 可调、让工作流可触发;极少有系统连接 AI 与企业"治理"——连接那些决定"何时可达、可调、可触发"的规则。这条缺失的连接,正是本文所关注的空白,也是决定哪些组织能够安全地把 AI 从建议推向行动的关键所在。

5. 无法被强制执行的治理,不是治理

企业在治理上投入巨大。政策被写下,标准被定义,控制被记录,审批流程被建立,合规要求被强制,风险框架被维护。数十年来,这些产物一直是企业运营的基石。但 Agentic AI 的到来,暴露了一个一直存在、却很少被真正检验的弱点:大多数治理是以"文档"的形式存在的,而组织默默假设,写下来的治理会自动转化为运营层面的控制。

对人类参与者,这个假设大体成立。员工可以被培训去执行政策,管理者可以监督其遵守,审计师可以事后审查,合规团队可以调查违规并施加后果。人类治理在很大程度上依赖于"行为层面"的强制——依赖于人会内化规则、会畏惧惩罚、会被纠正。书面政策之所以管用,是因为有一个被社会化、可被问责的人站在它背后。

自治系统抽走了这个支撑。AI 不会内化政策手册,也不会畏惧惩罚。更关键的是,它以一种击穿事后控制的速度和规模运行:一个自治 Agent 可以在人类审阅流程开始之前——更不用说结束之前——就执行成千上万次操作。完全依赖事后审查的治理(季度审计时抓出来、月报里标记出来),在行为者变成一台持续运转的机器的那一刻,就在结构上失效了。

结果是一种精确而危险的错配:组织拥有治理,AI 拥有执行能力,但没有任何机制确保治理真正塑造执行。这就是治理鸿沟(governance gap)。它的本质值得被精确陈述:问题不在于缺少治理,而在于缺少治理的"强制执行"。一条无法影响运行时行为的治理规则,只是文档;一项无法约束执行的政策,只是指引;一个可以被自动化绕过的审批要求,在任何有意义的层面上都算不上一种控制。

结论由此直接得出:随着组织部署越来越自治的系统,治理不能再只活在文档和人的习惯里。它必须成为可执行、可强制的,并成为运行时决策的一部分——就在一个动作即将发生的那一刻在场,并拥有许可或拒绝它的权力。唯有如此,自治执行才能始终系于组织的意图。这条原则——无法在运行时被强制执行的治理,就不是真正的治理——正是 PEA 背后的奠基性思想之一。PEA 不创造治理,它把治理操作化,把文档转化为运行时控制。

6. 把治理操作化

如果治理要去塑造 AI 的执行,架构本身就必须改变,因为在历史上,治理与执行一直分处两个领域。治理定义"应该"发生什么,系统决定"能够"发生什么,而人站在中间,手动地弥合二者——读政策、解读情境、选择合规的行动。只要人还在闭环里,这套安排就行得通;但它会随着越来越多的权力被委托给机器而变得脆弱,因为一个自治系统自主完成的越多,依赖人在每一步去强制执行政策就越不现实。

由此浮现出一个新的要求,值得直白地讲出来:治理必须变得可被机器强制执行。请注意这个要求"不是"什么——它不是"多写些政策"。大多数企业早已拥有庞大而成熟的政策框架,文档并不是短板。短板在于:当决策与行动由自治系统、而非由那些政策当初为之而写的、受过训练并被监管的人类来做出时,这些政策就失去了抓地力。

PEA 通过引入一种"治理感知的执行模型"来应对这一挑战,其核心动作是对默认状态的反转。常规系统假设:除非有东西阻止,否则执行就是被允许的——AI 能行动、有访问权,于是就继续下去。PEA 假设相反:执行必须先被"证成"(justified),才能被允许。每一个动作,都要在权力被授予"之前"对照治理要求接受评估,而不是事后被标记待审。举证责任发生了反转,访问不再自我证成。

这改变了治理在企业系统内部所扮演的角色。治理不再是一个事后查看残局的外部审计职能,而成为执行本身的一个主动参与者——在行动发生的那一刻就坐在桌前的一方。在架构上,路径从 意图 → 执行 转变为 意图 → 授权 → 执行。这个改变在纸面上看似细微,在实践中却影响深远。授权成为组织治理与运营执行之间的桥梁。执行不再仅由技术能力决定,而是由组织权力决定。

其战略回报,是化解了一个长期悄悄限制 AI 落地的恐惧:自动化必然意味着治理的失控。相反的可能性出现了:组织不必通过自动化"削弱"治理(让机器跑过控制),而可以通过自动化"强化"治理——因为一条由运行时强制执行的规则,会被一致地、即时地、无例外地执行,其程度是人类的勤勉永远达不到的。这正是 PEA 的运营承诺,也通向下一章的原则。

第三部分 —— 可信执行

7. 可信执行原则

大多数 AI 架构以能力为中心来组织,PEA 则以权力为中心,而正是这一次重新定向,改变了整个设计哲学。常规系统运行在一个熟悉到几乎无人审视的隐含假设之上:只要 AI 能够执行某个动作,并且对所需资源拥有技术访问权,执行通常就被允许。访问被当作了它自身的证成。PEA 从根上拒绝这个假设:能力不等于权力,访问不等于授权,执行不等于正当。

取而代之,PEA 安装了一条单一原则:每一次执行,都必须在被允许之前先被证成。这不是一个从计算机科学引入企业的异质想法;它恰恰就是现代组织管理自己员工的方式。一名员工可能拥有对多个系统的访问权,但这份访问权本身,并不授权这些系统所能执行的每一个可能动作。真正的授权取决于目的、上下文、职责、政策与治理要求——取决于"为什么"和"在什么条件下",而不仅仅是"能不能"。

一个具体的例子能让这条原则落地。设想一家大型银行的股票研究分析师。作为个人,她合法地拥有对行情终端、CRM 部分模块、内部研究库以及公司邮箱的访问权。现在假设有一个 AI Agent 代表她去执行"生成一份市场研究报告"这个意图。今天流行的假设,危险就危险在它的简单:用户有权限,AI 就继承这些权限。但组织从未打算让"生成研究报告"这个"任务"携带导出客户名单、深入投行交易数据库、或把 MNPI 越过隔离墙发送出去的权力——尽管这位分析师作为个人,出于其他正当理由可以触及其中某些系统。

这正是问题的核心,它可以写成一串不等式:用户权限 ≠ 意图权限 ≠ 执行权限。一个人被允许做什么,不等于某个特定任务被允许做什么,二者也都不自动等于某个特定上下文中的某次具体执行被允许做什么。大多数企业安全架构从未把这三者分开。PEA 的"最小能力集"(minimal capability set)思想——只授予一个意图它真正所需的、狭窄而有界的权力,绝不多给——正是为了把它们隔开而存在。

因此,那条约束分析师的纪律,也必须约束 AI。一个 AI 系统不应仅仅因为"能"就行动;它只应因为"已被授权"——为这个目的、在这些边界之内——而行动。这次重构,把执行从一个技术操作变成了一个被治理的操作,并由此得出那个强制性的次序:意图 → 授权 → 执行,其中授权不是事后加装的次级检查,而是行动的前提。这个看似简单的转变,正是让 AI 执行从设计上就可问责、可审计、可治理(而不是靠运气)的基础。

8. 把授权变成基础设施

企业计算的每一次重大转型,都要求新的基础设施,而这个规律一致到足以用来预测。互联网的兴起,要求网络安全基础设施;分布式计算的兴起,要求身份基础设施;云计算的兴起,要求信任基础设施。Agentic AI 要求的,是这条谱系上的下一项:授权基础设施。这个要求不是营销姿态,而是源于一个事实——Agentic AI 改变了软件本身的性质。传统软件执行预先定义好的指令;Agentic AI 执行被委托的目标——它在过程之中,临机决定为达成目标该采取哪些动作。

这个区别对安全架构是决定性的。传统系统主要只需回答一个问题——"是谁在请求访问?"——于是企业安全用了二十年围绕"身份"走向成熟。身份与访问管理(IAM)、单点登录(SSO)、特权访问管理(PAM)、公钥基础设施(PKI)、零信任(Zero Trust),都是为解决同一个主题的变体而出现的:你是谁,你可否访问这个资源,这个身份是否可信。这些技术确实改造了企业计算。但它们是为一个"人是主要行动者、身份是意图的可靠代理"的世界而设计的——在那个世界里,知道"谁"在行动,就基本知道了关于"为什么"你所需了解的大部分。

Agentic AI 切断了这种代理关系。在一个由 AI 驱动的环境里,请求者的身份常常已不足以判断一个行为。人可能发起一个目标,AI 系统可能规划其执行,多个 Agent 可能协同,工具可能调用更多系统,工作流可能跨越组织乃至法人实体的边界。等到一个动作抵达业务系统时,最初那个人类身份已经穿过了好几层自治决策的折射。知道是"谁"开启了这条链,几乎无法告诉你:几步之后的"这个具体动作",是不是组织本意要允许的那一个。

于是,基础设施必须回答的问题变了。它不再仅仅是"这是谁?",而是"这个行为为什么发生、为了什么目的、由谁授权、在哪些边界之内?"。仅靠身份回答不了这些问题。一个 AI 系统可能出示有效凭证、握有合法访问权,却依然实施了一个在实质上未被授权的动作。挑战已经从认证(authentication)转向了授权(authorization)——从确立身份,转向确立某个具体行为的正当性。

这正是 PEA 被设计来扮演的角色。它把授权当作一等的架构关切,而不是埋在每个应用内部的实现细节。PEA 不假设访问意味着权力,而是要求权力在执行发生之前被独立确立。在这个模型里,各层清晰地组合在一起:身份回答"谁在行动",授权回答"为什么这个行为被允许",治理回答"这个行为是否符合组织政策",执行回答"这个行为是否应当进行"。权力因此变得显式、可验证、可审计、可撤销。正如 IAM 成为互联网时代不可或缺的基础设施,授权基础设施也将成为 Agentic AI 时代不可或缺的基础设施——而 PEA 正是为成为这一基石而建。在它之下,权力不被继承,而是被授予、被约束、被监控、被问责,并且首要地,被治理。这就是一个"聪明的系统"与一个"可信的系统"之间的区别。

9. 可信执行基础设施

企业级 AI 的长期未来不是对话式的,而是运营式的。组织最终投资 AI,不是为了生成段落,而是为了改进决策、自动化流程、编排工作流、执行业务职能。当 AI 嵌入运营,"执行"就成为价值的首要来源——并且不可分割地,成为风险的首要来源。那个把三天流程压缩到三分钟的动作,正是那个一旦做错、或在无权之下做出,就会动用资金、暴露数据、违反监管的动作。

正因如此,模型对齐——无论多么出色——都不可能是全部答案。对齐塑造的是模型的倾向;它本身无法保证某个特定上下文中的某个特定动作,落在组织真正授予的权力之内。要保证这一点,需要基础设施——一个在模型之外、独立于模型临机所做任何决定来确立、检查并约束权力的层。正如企业计算从孤立的应用演进为身份感知的系统,Agentic AI 也必须从以能力为中心的系统,演进为以治理为中心的系统。

这次演进定义了一个新的架构范畴,本文称之为可信执行基础设施(Trusted Execution Infrastructure):确保 AI 系统在被授权的边界内运行、对企业治理保持可问责、并对执行权保持可验证控制的那个层。PEA 正是为占据这一范畴而生。它的目的不是让 AI 更聪明,而是让聪明的系统变得足够可信,从而能够成为运营基础设施——可靠到一个组织愿意让一条真实的业务流程从它身上经过。

这个区别之所以重要,是因为它重置了竞争的前沿。企业级 AI 的未来,不会仅由模型能力决定——在那条赛道上,头部实验室已经趋同,进一步的提升虽真实却日益商品化。它将由组织能否"信任"AI 去掌握权力来决定——而这是一种关乎周边架构、而非模型权重的属性。那个能够安全地把权力委托给一个强大模型的企业,会跑赢那个仅仅"拥有"一个强大模型的企业。可信执行基础设施,正是让这种委托变得安全的东西;而本文下一部分将论证:被正确理解的"可信委托",才是真正的奖品。

第四部分 —— 可信委托

10. Agentic AI 的未来:可信委托

企业技术的历史,可以被读作一部漫长的委托史。组织通过不断找到把责任向下、向外安全而高效地传递的方法来成长。管理者把责任委托给员工,组织把责任委托给部门,企业把责任委托给软件系统。每一种成功的委托,都建立在同一个条件之上:信任。没有信任的委托制造风险,没有控制的信任制造脆弱。整套企业治理机器——被指派的权力、被定义的职责、被要求的审批、被审计的行为、被保全的问责——之所以存在,正是为了让"可信委托"在规模上成为可能。它正是让一家公司能够超越任何一个人直接掌控范围而成长的那套装置。

Agentic AI 引入了一种真正全新的委托形态,而这正是当下这一刻与以往每一波自动化都不同的地方。组织第一次开始考虑:委托给机器的,不再只是"计算",而是"执行"——不只是运行一段预定的程序,而是交出"决定采取哪些动作"的余地。传统软件执行预先定义好的指令,你事先就知道它所能做的一切;Agentic AI 则以并未被事先完全规定的方式做决策、选动作、调资源、追目标。这逼出了一个无法回避的问题:一个组织,能否安全地把运营权力委托给一个它无法完全预测其具体动作的系统?

这个问题在其核心处并不是技术问题,而是治理问题,并最终归结为"委托在什么条件下仍然安全"。没有治理,委托变得危险;没有授权,委托变得失控;没有问责,委托变得无法接受。这正是为什么企业级 AI 的未来,特别地取决于"可信"委托——而不只是委托本身(那很容易),而是一个组织敢于为之背书的委托。

要有信心地委托,一个组织必须能够就它交给 AI 的任何权力,回答一组精确的问题:被委托的是什么权力、在什么条件下、为哪个目的、在哪些边界之内、由谁来问责、又如何被撤销。这些不是治理上无关紧要的修饰,而是企业规模化采用的前提。一个回答不了这些问题的组织,不会——也不应——让它的核心运营从一个自治系统身上经过,无论那个系统多么强大。

由此引出本文核心的论点:下一代 AI 系统,不会仅以智能被评判,而会以它们"在被委托的权力之内行动"的能力被评判——恰好做它们被托付去做的事,绝不做它们未被托付的事。用最直白的话说:AI 的下一个前沿不是智能,而是可信委托。AI 的第一个十年,是关于让机器有能力;现在开启的这个十年,是关于让"把真正的权力交给这些有能力的机器"变得安全——而这是一个治理问题,不是一个智能问题。

11. PEA 信任金字塔

以上整个论证的脉络,可以被压缩进一个统领全局的结构。我们称之为 PEA 信任金字塔。它有四层,每一层都坐落在下一层之上,从一个技术地基拾级而上,直至一个商业成果:

可信委托(Trusted Delegation)

治理(Governance)

授权(Authorization)

安全(Security)

最底层是安全,它回答"AI 能否安全地运行?"。这是行业已经理解得相当透彻的一层:抵御操纵、提示注入、越狱,以及系统本身被污染。它是必要的,是地基——但它只是底楼。一个系统可以在这个意义上完美安全,却依然做出某件组织从未授权的事。

其上是授权,它回答"AI 是否拥有行动的合法权力?"。这是能力不再自我证成的地方。授权确立:某个特定上下文中的某个特定动作,落在真正被授予的权力之内——把 AI"能"做的,与它"被允许"做的分开。这是今天 Agent 架构中最显眼缺失的一层,也是 PEA 当作一等关切来对待的一层。

再上是治理,它回答"AI 的行为是否符合组织的规则?"。授权确立了权力已被授予;治理则确立:在此时、此地、以这种方式行使它,是否与企业的职责分离、信息隔离墙、合规义务和风险控制相一致。这是组织数十年累积的规则被作用于具体动作之处——在运行时,拥有许可或拒绝的权力。

顶端是可信委托,它回答那个高管最终唯一在意的问题:"企业是否愿意把真正的工作和真正的权力交给 AI?"。这不是一种技术属性,而是一个商业成果,也是其下每一层之所以存在所要产出的成果。安全、授权、治理都是手段,可信委托才是目的。企业最终买的,不是安全、不是授权、甚至不是治理本身——它们买的,是能够有信心地说出"我们愿意让 AI 去做这件事"的那份能力。

这座金字塔厘清了 PEA 是什么、不是什么。PEA 不是一个治理政策框架,它不书写组织的规则。它是把一个组织托举着拾级而上的那个结构——接过行业已经提供的安全,补上缺失的授权层,强制执行组织已经拥有的治理,并由此交付企业真正想要的可信委托。每一层都是其上一层的前提;抽掉任何一层,顶端就坍塌。这正是为什么各自为政的方案,无论在自己那一层多么出色,都无法独自交付这座顶峰——而这正是下一部分的主题。

第五部分 —— 为什么是 PEA

12. 为什么现有方案还不够

把本文读成对现有 AI 安全与安全堆栈的批判,将是一个误读。这个领域所构建的机制是有价值的,一次严肃的企业部署会用到其中许多。本文的论点更窄、也更重要:它们各自解决的是不同的问题,而无论单独还是组合,它们都无法弥合企业治理与 AI 执行之间的那道鸿沟。下面这张对照表,不是为了贬低这些技术,而是为了把它们精确地定位。

方案它主要治理的对象
模型对齐(Alignment)模型的倾向与行为
护栏(Guardrails)模型输出的内容
AI 防火墙 / 网关流向与流出模型的流量
IAM / SSO请求者的身份
零信任(Zero Trust)对资源与网络的访问
PEA一个行为据以执行的权力

对齐塑造模型倾向于如何行事,但一个完美对齐的模型,依然可能做出越出组织权力的动作。护栏治理模型"说什么",可一旦风险落在模型"做什么"之上,这就瞄错了目标。AI 防火墙与网关治理模型与外界之间的流量——对于检查和限流很有价值,但它们推理的是请求与响应,而不是某个具体动作是否在组织政策下、为某个目的而被授权。IAM 与 SSO 治理身份,回答"谁在行动"——不可或缺,也正是本文论证其已不再单独充分的那一层,因为在一条 Agentic 链中,起点的身份并不决定终点那个动作的正当性。零信任治理访问,持续验证一个请求者是否应当抵达一个资源——但"抵达一个资源",与"被授权用它去实施某个有后果的具体动作",并不是一回事。

规律是一致的。每一个现有的层,推理的都是某个"邻近于"授权问题的东西——模型、输出、流量、身份、访问——而每一个都确实必要。但"这个具体动作,带着这个目的、在这个上下文里,是否应当在组织治理之下进行?"这个问题,落在了它们所有人之间的缝隙里。在常规堆栈中,它不是任何人的职责。PEA 的存在,就是让它成为某个人的职责——把执行权当作一个独立的架构关切,配以它自己的强制执行点。PEA 不取代其他各层,它补全了这个堆栈,加上了那个被 Agentic AI 变得必要、却无人提供的层。

13. 为什么企业需要 PEA

把抽象论证放进一个真实组织里,它的落点最为有力,因为治理鸿沟并非均匀分布。它随复杂度、随数据的敏感度、随一次错误动作代价的严重程度而增长。那些 Agentic AI 最有希望创造价值的机构,恰恰是最无法容忍"无治理执行"的机构——这也正是为什么这些机构尽管能够大量获得强大模型,却在授予它们真正的权力上最为迟缓。

设想一家全球性银行。在治理意义上,它不是一家公司,而是一个庞大的治理系统,横跨数十个业务条线、多个国家、多个法人实体、多套监管体系,以及同时存在的多个身份来源。它最核心的那些控制不是便利,而是义务:重大非公开信息必须被隔离,研究与投行之间的信息隔离墙必须站得住,活动必须跨司法辖区保持可审计。当这样一家机构设想一个研究 Agent、一个合规 Agent、一个风险 Agent 跨越这些边界协作时,它的第一个问题不是这些 Agent 聪不聪明,而是:每个 Agent 的权力能否被约束,使得"协作"永远不会变成一条绕过隔离墙的路径。在这里,一个 AI 累积起来的权限,是一个跨域组合问题,而不是一次简单的角色查询——而这正是 PEA 被构建来解决的原生问题。

医疗以另一套词汇呈现出同样的结构。受保护的健康信息(PHI)、HIPAA 义务,以及"临床动作必须追溯到合法临床权力"的要求,意味着一个能够读取病历、安排诊疗、或触及临床系统的 Agent,无法仅靠身份来治理。问题永远是:在保护患者的规则之下,"这个"动作、为"这个"目的,是否被授权——而这恰恰是 PEA 强制要在执行之前回答、而非事后才发现的那个问题。

政府与公共部门把赌注又抬高了一层:机密数据、任务权力与严格问责,意味着一个在其被委托权力之外运行的自治系统,不只是一桩合规事件,更是一桩安全事件。在这里,授予有界权力、在运行时强制执行、完整审计、并即时撤销的能力,不是一项功能,而是能否部署的前提。

跨越这三者,结论是同一个,也正是本文的总结论。这些机构采用企业级 AI 的最大障碍,不是模型的智能,而是缺少一种方法——在不破坏组织数十年来构建的治理结构的前提下,授予 AI 执行权。这些组织从不缺智能。它们所缺的——它们一直缺的、也正是 PEA 所提供的——是一种它们能够信任其去行动的智能。

14. 对齐是证据,不是权力

本文第 3 节已经指出:一个完全对齐的 AI 依然可能做出越权的行为。随之而来一个自然的反问——把模型对齐做得更好,是不是终将弥合这道缝隙?2026 年 6 月,OpenAI 发表《Reinforcement Learning Towards Broadly and Persistently Beneficial Models》,给出了迄今最有力的实证回答,而结论恰恰指向相反的方向。

这篇论文本身是真正的进展,我们愿意先把这句话讲清楚:它把诚实、可纠正性、对人类福祉的关切等“有益特征”经强化学习训练后,发现这些特征能跨领域泛化、并在对抗压力下保持。但请注意论文谨慎地没有声称的那部分——它的结果是“改善”,不是“保证”:模型在 53 项基准中改善了 44 项,也就是有 9 项没有;对抗压力下是“更难被推”,而不是不能;有害微调下是“略微更耐”。论文甚至专门研究“持续性(persistence)”——这本身就是一种诚实的承认:对齐会在压力与微调下漂移。每一个结论,都是概率性的。

而做过银行、保险或审计的人,脑子里只有一个等式:风险 = 概率 × 后果。对齐攻的是第一项,降低坏意图产生的概率;它不限定第二项。受监管的高风险部署需要的是最坏情况边界,而不是平均情况的改善——银行无法用“53 项基准改善 44 项”去为一个受托系统承保。降低概率必要且有价值,但它和“限定后果”不是同一类东西,而限定后果正是授权层所做的事。

更深一层,论文建立在一个发现之上:模型拥有人格(persona)——真实存在、能跨任务泛化、且会漂移的行为倾向。有人会把它读成“那就把人格训好、不必再担心”;我们的读法正相反:人格越真实、越能泛化,从外部就越无法验证,在执行的那一刻就越不能被信任。论文没有削弱“模型在执行真实动作的边界上应被视为不可信”这一前提,反而强化了它。

由此得到一条愿意署名的原则:Alignment is evidence, not authority.(对齐是证据,不是权力。)模型被测得的特征——诚实分、人格画像、漂移遥测——都是正当且有用的证据,属于观测:审计、监测、信任分析与取证;它们应当影响人类盯一个 Agent 盯得多紧,但绝不能悄悄抬高或压低 AI 被允许做的事。银行不会因为客户“看起来诚实”就跳过审批。哪怕是最有诱惑的做法——一个发现模型跑偏就自动收紧权限的“漂移监测器”——也必须保持开环:发告警、触发人工复核、拉带外 kill-switch。一旦模型派生的分数被直接接进授权引擎,控制了模型的攻击者就能伪造它,门就跟着动。观测喂给人,不喂给门。

这一切都不是对对齐工作的贬低,而是一种分工:概率归模型构建者去降,后果归治理去限——而这件事不会因为模型变好而变容易,只会更重要,因为更强的 Agent 被托付了更有后果的动作。对齐塑造一个 Agent 想做什么,治理决定它被允许做什么。企业两者都需要,并且必须各自独立地控制。

15. 结语

AI 行业在能力上取得了非凡进步。模型会推理,Agent 会规划,系统会协同,工具会执行。然而有一个挑战仍未解决,而它恰恰是如今决定"这一切能力能否抵达它们最该发挥作用之处"的那个挑战。企业需要的不只是聪明的系统,而是可被治理的系统。

从 AI 助手到 AI 执行者的转变,改变了企业架构的根本要求。AI 一旦获得行动的能力,治理就不能再外在于执行、只在事后查看记录。治理必须变得可操作,授权必须变得可强制,执行必须变得可问责。这些不是安全议程的微调,而是另一套议程——一套安全堆栈从未被设计来承载的议程。

这正是 PEA 被构建来做的工作。PEA 不取代企业治理、合规框架或组织控制;它提供治理与执行之间那条缺失的连接。它把治理从政策转化为运行时控制,把授权从文档转化为强制执行,把执行从一种原始的技术能力转化为一种可信的组织职能。由此,它为 Agentic AI 时代引入了一条前面各章一步步铺垫至此的原则:仅有智能是不够的;权力必须被治理,执行必须被证成,信任必须被工程化,而不是被假定。

企业级 AI 的未来,不会仅由 AI 变得多聪明来定义,而会由组织能否安全地把权力托付给 AI 来定义。这正是本文标题所点明的那次转变——从 AI 助手,到可信执行基础设施——也正是 PEA 之所以存在、要去使其成为可能的那次转变。

PEA 把企业治理从政策转化为运行时控制,从而使向 AI 的可信权力委托成为可能——并随之,实现从 AI 助手到可信执行基础设施的跨越。

Aiegis builds execution governance for AI agents — bounding what agents are allowed to do, independent of what they want to do.Aiegis 构建面向 AI 智能体的执行治理——限定智能体被允许做什么,独立于它们想做什么。 aiegisafety.com