A real autonomous loop
Milestone ledgers with machine-checkable acceptance, forced verification turns, deterministic green/fix classification, and re-planning when a fix keeps failing. Walk away; it builds, tests, fixes, and verifies until done.
The autonomous agentic IDE · Windows
UltraCodey doesn't autocomplete your code — it staffs the project. An engineer who plans, a designer who builds, a principal who reviews and stamps every change, and employees who work shifts while you sleep. One super app, on your machine, with your accounts, under your rules. And yes — there's a pet.
Window tabs switch the whole surface — each one is a complete product, not a sidebar panel. Code, a daily assistant, a model lab, an analyst desk and an AI staffing agency. Every screenshot below is the real app.
Code — the engineering surface: the agent outlines your repo, edits across files, runs the tests, and reports — every step visible as it happens.
Most agents are one model talking to itself — and shipping its own unreviewed code. UltraCodey's Code surface is organized like an engineering firm: three roles, real separation of duties, and nothing ships unstamped.
Owns the task. Reads the repo, writes the plan, breaks the work into assignments and delegates — it never touches the code itself.
Implements. Edits the files, runs the commands, and iterates until the build is green and the acceptance criteria actually pass.
Reviews the diff with fresh eyes and a deterministic test signal, sends weak work back, and stamps what's done. No stamp, no done.
Away from your desk? The hands-off reviewer takes over: a deterministic risk table, first-match-wins rules, and constrained reviewer sub-agents for high-risk calls — with a flight-recorder audit log of every decision made while you were gone. You come back and read the tape.

Code
In the session above, the agent was asked for a dark-mode toggle. It outlined the settings module, made the edits across three files, ran the suite — 214 passing — and flagged a contrast issue it noticed along the way. That's the whole product in one screenshot: it finishes, it verifies, and it thinks ahead.
/revert undoes a whole run
Employee
Every other tool waits for you to ask. UltraCodey lets you build a roster: name them, pick a face from the cast, hand each one a job description. They work their shifts on schedule — in their own chats, with the same engine and permissions as everything else.

Benchmark
Everyone argues about models; UltraCodey measures them. Pick two or more of your connected models and run a real gauntlet — coding, reasoning, instruction-following, writing — judged with rubrics.

Research
Research loops watch the web for you on a schedule — markets, your stack's ecosystem, any topic, person or product — and synthesize trend-aware reports into a feed you skim over coffee.

Chat
Not everything is code. Chat is a clean, fast general assistant with web access and your memory — and a hard wall between it and your engineering sessions, so a casual question never lands in a repo.
70 verified MCP integrations with one-click install and browser sign-in, plus 124 built-in skill playbooks the agent reaches for on its own.
The vibe layer
Serious firm, playful soul. Codey is your desk pet — a tiny always-on-top companion who watches runs with you, celebrates green tests, dozes off when things are quiet, and answers when you talk to it. Boop it. It likes that.

Make it yours
Built for vibe coders: a generative theme platform, not a light/dark switch. Tiered collections from Arctic to Autumn, live-motion themes with animated backdrops, and Ultra Mode — the aurora flagship. Higher tiers unlock as you level.

Leveling
Every message, finished run and daily streak earns XP. Levels unlock themes and characters; badge tiers stack on your profile; over a thousand achievements track the journey from First Steps to Century Club — across conversation, shipping, goals and devotion.
Everything below ships in the app today. No waitlists inside the product, no cloud lock-in, no telemetry.
Milestone ledgers with machine-checkable acceptance, forced verification turns, deterministic green/fix classification, and re-planning when a fix keeps failing. Walk away; it builds, tests, fixes, and verifies until done.
Anthropic, OpenAI, Gemini, OpenRouter, xAI, DeepSeek, Mistral, Groq, Ollama and a dozen more — or any OpenAI-compatible endpoint. Sign in with Claude Pro/Max or ChatGPT the same way the official tools do, or paste a key. Keys live in the Windows keychain, never in plaintext.
Fan work out to parallel sub-agents, round-robin across every connected provider, and race N attempts in isolated git worktrees — a judge merges only the attempt that builds green.
Reinforced SQLite memory scored by recency, importance and results; reflection after every run; skills it authors for itself; project profiles and file-location memory. Every new chat starts already knowing your codebase and your preferences.
Use multiple subscriptions at once: a manager model orchestrates while worker models execute — or set both to Auto and let the best connected model take the lead per task.
CodeMirror editor, real PTY terminal that survives tab switches, red/green diffs with a code reviewer, project-wide search, checkpoints with one-command revert, @-file mentions, live todo tracking, and a built-in browser for your dev server.
Pixel-accurate multi-monitor mouse and keyboard control, screenshots, window management and clipboard — benchmarked to ≤1px cursor accuracy across the full virtual desktop, with a visual-verify gate.
Full Access when you want it; an auto-reviewer when you're away — a deterministic risk table, first-match rules, a constrained reviewer agent for high-risk calls, and a flight-recorder audit log of every decision.
Schedule any prompt on any interval, or set a /goal and the
agent keeps working until the goal is verifiably complete — with cost
dashboards, budgets, and run replay to audit what happened.
Control Flow Guard, CET shadow stacks, DPAPI-encrypted secrets, SSRF and DNS-rebinding protection, debugger tamper guard, process containment via Windows Job Objects. Local-first: your code never transits anything but the model API you chose.
Run fully local with Ollama — curated model presets by VRAM tier, pull progress in-app, and the same agent loop pointed at your own hardware.
One click pulls your memory, skills and MCP servers from Claude Code, Codex, Gemini CLI, Cursor and Windsurf. It even speaks ACP both directions — drive it from Zed, or host external agents inside it.
After every run UltraCodey reflects: what worked, what failed, what the reviewer rejected, what you reverted. Useful lessons are reinforced; bad ones decay; recurring playbooks become skills it writes for itself. A self-train loop consolidates everything in the background — locally, in SQLite you can open and read.
An agent with shell access deserves bank-grade paranoia. UltraCodey is hardened at every layer — binary, secrets, network, process — and it keeps receipts.
We did. Then we built the app we actually wanted. Fair comparison, current as of June 2026.
| Capability | UltraCodey | Claude Code | Cursor | Codex |
|---|---|---|---|---|
| Native desktop app, local-first | Yes | Terminal | Editor fork | Terminal / cloud |
| Use ANY provider — or all at once | 20+, mixed per role | Anthropic | Several, theirs | OpenAI |
| Sign in with existing paid plans | Claude + ChatGPT | Claude | No | ChatGPT |
| Persistent self-learning memory | Yes, reinforced | Files | Limited | Limited |
| Hire persistent AI employees | Yes | No | No | No |
| Built-in model benchmark lab | Yes | No | No | No |
| Standing web research loops | Yes | No | No | No |
| Computer use (real desktop control) | Yes, ≤1px | No | No | No |
| Walk-away autonomy with audit log | Yes | Partial | Partial | Partial |
| A pet that celebrates your green tests | Obviously | No | No | No |
| Price during beta | Free | Plan | $20+/mo | Plan |
Honest caveat: those are excellent tools — UltraCodey can even import their configs and host them over ACP. The difference is scope: they assist a programmer; UltraCodey staffs a project.
UltraCodey is in private beta. Seats open in waves — ask for one and we'll send the signed installer and a quick-start.
Windows 10/11 · 64-bit · bring your own model accounts · no telemetry