Self-owned dogfood audit report

jackjin1997/sentinel

No-execution dogfood sample for a public autonomous incident-response agent. The review is based on commit 2ebf5a5363db4ee95483ef3ecbae8e0842550131. It is not a commissioned audit, private vulnerability disclosure, or certification.

TargetSentinel AI incident agent

ValidationStatic source review + scanner

Scanner Score54/100 heuristic score

ExecutionNo target code run

Start AI agent audit Rerun public scan Markdown report

Scope

Next.js POST /api/agent request and SSE streaming boundary
Multi-vendor LLM orchestration in lib/agent.ts
Agent tool registration and tool output flow
Bright Data backed external web tools
Model and Bright Data credential configuration
Vultr and Cloudflare demo deployment scripts
Test, lint, typecheck, and CI readiness

Out of scope: live Sentinel deployments, model provider accounts, Bright Data account behavior, private telemetry, Vultr host state, Cloudflare account state, and unpublished branches.

Executive Summary

sentinel is a useful dogfood target because it is a real autonomous incident-response agent: a browser UI starts a server-side run, multiple model vendors participate in four phases, tools can query internal mock telemetry and external web data, and results stream back to the operator over SSE.

The repo already includes several good defensive choices for a demo-grade agent: request bodies are capped, client disconnects abort provider calls, slow consumers are cut off, tool schemas restrict common inputs, phase output is bounded, and the vendor-status tool uses a fixed vendor allowlist.

The main production risk is the missing operator boundary around a cost-bearing, credential-backed agent endpoint. Before Sentinel is reused outside a controlled demo, it needs authentication, rate/concurrency limits, output redaction, deployment hardening, and CI gates that prove those controls do not regress.

Boundary Map

Area	Evidence	Risk Notes
Browser entry	`app/page.tsx:119-124`	The UI posts `{ incidentId }` directly to `/api/agent` and reads streamed events.
Agent API	`app/api/agent/route.ts:19-73`	The route validates body size and JSON, then starts `runIncidentAgent`. No auth, rate limit, origin policy, or concurrency cap is visible.
SSE output	`app/api/agent/route.ts:52-56`, `lib/agent.ts:8-15`	Tool calls, tool results, text deltas, errors, and final report objects are serialized to the client.
Model credentials	`lib/agent.ts:31-55`, `.env.local.example`	Qwen, Anthropic, Google, and Bright Data credentials are environment backed.
External web tools	`lib/tools/brightdata.ts:83-200`	Vendor status is allowlisted; public postmortem search and GitHub commit lookups can still spend quota and return untrusted web content.
Deployment	`scripts/deploy-vultr.sh:47-88`	The demo script copies `.env.local`, creates a root-managed systemd service, and exposes HTTP `:80`.
Validation gates	`package.json`	`dev`, `build`, and `start` exist, but no `test`, `lint`, `typecheck`, or CI workflow was visible.

Findings

HighPublic agent run endpoint needs auth, quota, and concurrency boundaries

Evidence: app/page.tsx:119-124 starts a run with a plain POST to /api/agent. app/api/agent/route.ts:19-73 accepts any request with a small JSON body and invokes runIncidentAgent, which can call multiple LLM providers and tools across phases.

Recommended fix: add an auth gate before runIncidentAgent, then enforce per-IP or per-token rate limits, a global concurrency cap, and a per-run budget ceiling.

MediumTool results and errors need centralized redaction before streaming

Evidence: AgentEvent includes tool-result and error payloads with unknown content, and the API route streams JSON.stringify(event) directly to the client.

Recommended fix: add a single sanitizeAgentEvent(event) layer before SSE serialization and test redaction for tokens, cookies, signed URLs, query strings, session IDs, and provider error text.

MediumExternal web tools need a stricter tool policy for production telemetry

Evidence: fetchVendorStatus uses a vendor enum, while searchPublicPostmortems accepts an LLM-provided query and fetchGithubRecentCommits accepts an LLM-provided public repo name.

Recommended fix: mark each tool as internal, external, read-only, write-capable, cost-bearing, and prompt-injection exposed. Include source URL, fetch time, fallback status, and freshness in external tool results.

MediumDemo deploy scripts need production hardening notes

Evidence: scripts/deploy-vultr.sh:47-88 syncs code to /opt/sentinel, copies .env.local, creates a systemd service, and exposes HTTP :80. scripts/add-cf-tunnel.sh can publish a trycloudflare URL.

Recommended fix: add demo-only warnings, run as a non-root service user, use a secret manager or locked-down env file, require HTTPS/auth proxy for public deployments, and document log retention.

LowRelease gates are too thin for a security-sensitive agent

Evidence: package.json exposes only dev, build, and start; no test, lint, or typecheck scripts were visible.

Recommended fix: add CI for install, typecheck, lint, unit tests, build, and scanner output generation.

Positive Signals

POST /api/agent caps honest and post-read request bodies at 1 KB.
Client disconnect aborts the agent run so the server stops spending provider tokens.
The SSE route tracks slow consumers and aborts after repeated backpressure misses.
runIncidentAgent rejects unknown incident IDs before running the model chain.
Tool schemas use Zod enums, minimums, maximums, and runtime validation for the GitHub repo shape.
The external vendor-status tool uses a fixed vendor allowlist.
Phase outputs are locally capped and provider output tokens are bounded.
.env.local.example uses placeholders rather than committed secrets.

Priority Fix Plan

Put /api/agent behind auth and add rate, concurrency, and spend controls.
Add centralized SSE event redaction with tests for tool results and provider errors.
Add a tool policy table for internal/external, cost-bearing, prompt-exposed, and fallback-capable tools.
Harden deploy docs and scripts for non-root service execution, HTTPS, firewall, auth proxy, and secret handling.
Add CI with typecheck, lint, unit tests, build, and scanner output generation.

Example Validation Commands

node tools/agent-mcp-audit.mjs /path/to/sentinel --json
node tools/agent-mcp-audit.mjs /path/to/sentinel --sarif > agent-mcp-audit.sarif
bun run typecheck
bun run lint
bun test
bun run build

What the Paid Sprint Adds

The paid sprint would go deeper than this public dogfood sample: implementation-ready patches for auth and rate limiting, sanitizer tests, deployment-mode threat table, CI workflow, agent tool policy, and a concise launch handoff for the repo owner.

Open AI agent intake View fixed quote Review terms