From guardrails to governance: A CEO’s guide for securing agentic systems

3. Permissions by design: Bind instruments to duties, to not fashions

A standard anti-pattern is to provide the mannequin a long-lived credential and hope prompts maintain it well mannered. SAIF and NIST argue the other: credentials and scopes ought to be certain to instruments and duties, rotated commonly, and auditable. Brokers then request narrowly scoped capabilities by these instruments.

In observe, that appears like: “finance-ops-agent might learn, however not write, sure ledgers with out CFO approval.”

The CEO query: Can we revoke a particular functionality from an agent with out re-architecting the entire system?

Management information and habits

These steps gate inputs, outputs, and constrain habits.

4. Inputs, reminiscence, and RAG: Deal with exterior content material as hostile till confirmed in any other case

Most agent incidents begin with sneaky information: a poisoned net web page, PDF, e-mail, or repository that smuggles adversarial directions into the system. OWASP’s prompt-injection cheat sheet and OpenAI’s personal steerage each insist on strict separation of system directions from consumer content material and on treating unvetted retrieval sources as untrusted.

Operationally, gate earlier than something enters retrieval or long-term reminiscence: new sources are reviewed, tagged, and onboarded; persistent reminiscence is disabled when untrusted context is current; provenance is hooked up to every chunk.

The CEO query: Can we enumerate each exterior content material supply our brokers be taught from, and who accepted them?

5. Output dealing with and rendering: Nothing executes “simply because the mannequin mentioned so”

Within the Anthropic case, AI-generated exploit code and credential dumps flowed straight into motion. Any output that may trigger a facet impact wants a validator between the agent and the true world. OWASP’s insecure output dealing with class is express on this level, as are browser safety greatest practices round origin boundaries.

Source link

Moltbook was peak AI theater

This is the most misunderstood graph in AI

What we’ve been getting wrong about AI’s truth crisis

The crucial first step for designing a successful enterprise AI system

Inside the marketplace powering bespoke AI deepfakes of real women

DHS is using Google and Adobe AI to make videos

Institute of Museum and Library Services Grant Guidelines Take Political Turn Under Trump — ProPublica

‘Wicked: For Good’ Is Coming to Streaming. Here’s What You Can Watch

Paul Weiss Partner Wrote Epstein On Sex Laws… But Was Just Passing Along Analysis From Alan Dershowitz

PG&E will try out SPAN Edge customer meter upgrade device

Moltbook was peak AI theater

Top Picks

What investigators know about Brown, MIT shooting suspect after dayslong manhunt

Best Running Shoes (2025), Tested and Reviewed: Saucony, Nike, Hoka

Dodgers force World Series to decisive Game 7 by holding off Blue Jays 3-1

PSG beat Flamengo on penalties to win FIFA Intercontinental Cup | Football News

From guardrails to governance: A CEO’s guide for securing agentic systems

3. Permissions by design: Bind instruments to duties, to not fashions

Management information and habits

4. Inputs, reminiscence, and RAG: Deal with exterior content material as hostile till confirmed in any other case

5. Output dealing with and rendering: Nothing executes “simply because the mannequin mentioned so”

Related Posts