ShipFleet spawns parallel Claude Code sessions against your repo. Each agent works in its own git worktree. You see every keystroke live. A different AI model reviews the code. You approve the PR. That's it.
Every file read, code write, test run, and git commit streams live to your terminal or dashboard. You can kill, redirect, or send a message to any agent at any time. Nothing is hidden.
Human enters the loop only once: to approve the PR. Everything else — from discovering the bug to verifying the fix worked — is automated.
Polls Sentry errors, GitHub Issues (by label), inbox file, CI failures. Scores each by frequency, user impact, severity. Noise filtered out.
Haiku reads each item, deduplicates semantically, decides: build it, skip it, or ask a human. Outputs an actionable task with refined description.
Workers atomically claim tasks (SQLite row lock). Each spawns claude in an isolated git worktree. Runs tests, types, commits, pushes, opens PR.
Gemini reviews security (OWASP, injection, secrets). GPT-5 reviews bugs (logic, races, nulls). Different model families = can't rubber-stamp its own work.
24h after merge, re-checks the source. Sentry error still firing? GitHub issue still open? Regressions auto-re-queue at high priority.
The only manual step. Review the diff, read the cross-model review comment, merge when ready. Everything else is automated.
Everything is a subcommand of shipfleet.
Every autonomous action has a guard. The system is built to fail safely, not to fail silently.
shipfleet init
reads your project and configures itself. Framework, test runner, linter, type checker, git remote, CLAUDE.md — all automatic.