Skip to content

Trust Center

Triggerfish enforces security in deterministic code below the LLM layer — not in prompts the model might ignore. Every policy decision is made by code that cannot be influenced by prompt injection, social engineering, or model misbehavior. See the full Security-First Design page for the deep technical explanation.

Security Controls

These controls are active in the current release. Each is enforced in code, tested in CI, and auditable in the open-source repository.

ControlStatusDescription
Sub-LLM Policy Enforcement ACTIVEEight deterministic hooks intercept every action before and after LLM processing. The model cannot bypass, modify, or influence security decisions.
Data Classification System ACTIVEFour-level hierarchy (PUBLIC, INTERNAL, CONFIDENTIAL, RESTRICTED) with mandatory no-write-down enforcement.
Session Taint Tracking ACTIVEEvery session tracks the highest classification of data accessed. Taint only escalates, never decreases.
Immutable Audit Logging ACTIVEAll policy decisions logged with full context. Audit logging cannot be disabled by any component of the system.
Secrets Isolation ACTIVECredentials stored in OS keychain or vault. Never in config files, storage, logs, or LLM context.
Plugin Sandboxing ACTIVEThird-party plugins run in a Deno + WASM double sandbox (Pyodide). No undeclared network access, no data exfiltration.
Dependency Scanning ACTIVEAutomated vulnerability scanning via GitHub Dependabot. PRs opened automatically for upstream CVEs.
Open Source Codebase ACTIVEFull security architecture is Apache 2.0 licensed and publicly auditable.
On-Premises Deployment ACTIVERuns entirely on your infrastructure. No cloud dependency, no telemetry, no external data processing.
Encryption ACTIVETLS for all data in transit. OS-level encryption at rest. Enterprise vault integration available.
Responsible Disclosure Program ACTIVEDocumented vulnerability reporting process with defined response timelines. See disclosure policy.
Hardened Container Image PLANNEDDocker images on Google Distroless base with near-zero CVEs. Automated Trivy scanning in CI.

Defense in Depth — 13 Independent Layers

No single layer is sufficient alone. If one layer is compromised, the remaining layers continue to protect the system.

LayerNameEnforcement
01Channel AuthenticationCode-verified identity at session establishment
02Permission-Aware Data AccessSource system permissions, not system credentials
03Session Taint TrackingAutomatic, mandatory, escalation-only
04Data LineageFull provenance chain for every data element
05Policy Enforcement HooksDeterministic, non-bypassable, logged
06MCP GatewayPer-tool permissions, server classification
07Plugin SandboxDeno + WASM double sandbox (Pyodide)
08Secrets IsolationOS keychain or vault, below LLM layer
09Filesystem Tool SandboxPath jail, path classification, taint-scoped I/O
10Agent Identity & DelegationCryptographic delegation chains
11Audit LoggingCannot be disabled
12SSRF PreventionIP denylist + DNS resolution checks
13Memory Classification GatingWrite at own level, read down only

Read the full Defense in Depth architecture documentation.

Why Sub-LLM Enforcement Matters

Most AI agent platforms enforce security through system prompts —

instructions to the LLM saying "do not share sensitive data." Prompt injection attacks can override these instructions.

Triggerfish takes a different approach: the LLM has zero authority over security decisions. All enforcement happens in deterministic code below the LLM layer. There is no pathway from LLM output to security configuration. :::

Compliance Roadmap

Triggerfish is pre-certification. Our security posture is architectural and verifiable in source code today. Formal certifications are on the roadmap.

CertificationStatusNotes
SOC 2 Type I PLANNEDSecurity + Confidentiality trust services criteria
SOC 2 Type II PLANNEDSustained control effectiveness over observation period
HIPAA BAA PLANNEDBusiness associate agreement for healthcare customers
ISO 27001 PLANNEDInformation security management system
Third-Party Penetration Test PLANNEDIndependent security assessment
GDPR Compliance PLANNEDSelf-hosted architecture with configurable retention and deletion

A Note on Trust

The security core is open source under Apache 2.0. You can read every

line of policy enforcement code, run the test suite, and verify claims yourself. Certifications are on the roadmap. :::

Audit the Source

The full Triggerfish codebase is available at github.com/greghavens/triggerfish — Apache 2.0 licensed.

Vulnerability Reporting

If you discover a security vulnerability, please report it through our Responsible Disclosure Policy. Do not open public GitHub issues for security vulnerabilities.