How does NSPEC verify bugs?

An independent bug-verifier agent re-runs the repro in a fresh browser context, up to three times. Only bugs that reproduce, with a manual-grade confidence score, make it into the report.

Do you need access to my source code?

No. NSPEC tests the running UI. You give it a URL and optional login. It never reads your repo unless you opt in to git-diff based risk prioritization.

Which viewports are covered?

Six viewports at launch: desktop 1440, laptop 1280, tablet portrait and landscape, mobile portrait and landscape.

Yes, on Enterprise. Docker and Helm, with BYO LLM (OpenAI, Anthropic, or a local model). Artifacts never leave your network.

Changelog · NSPEC

site · 2026.04.20·April 20, 2026site

nspec.dev launches in public preview

The marketing site, waitlist API, and legal/company pages went live. Dogfooded a full NSPEC standard-tier run against the site itself; four verified bugs were caught on the first pass (mobile menu close button, a dead pricing CTA, an enterprise-tier link, and placeholder footer stubs), all fixed within the day. MX, SPF, DKIM, and DMARC records configured end-to-end so the waitlist confirmation email lands in the inbox, not spam.

core · v5.9.4·April 10, 2026verifierruntime

Server-side verifier dispatcher and hang watchdog

Promoted the bug-verifier from subagent-only to a first-class server-side dispatcher, so every verdict is recorded with its reproduction attempts, confidence, and source artefacts in the manifest regardless of how an agent was invoked. Shipped a watchdog that kills hung agent processes instead of stalling the run. Fixed the pages/ walker regression that was inflating evidence counts from real steps to thousands of stale shots.

core · v5.9.3·April 10, 2026runtimeinfra

Pre-warm MCP handshake, kill the startup race

The run orchestrator was occasionally losing the MCP handshake on cold starts. Moved handshake verification into a pre-warm subprocess and made the runner refuse to schedule work until init returns clean. Pure stdio, process isolation preserved.

core · v5.9.2·April 10, 2026runtime

Agent loader + --agents CLI flag

Project-scoped agents under .claude/agents/ weren't being discovered by the runner in headless mode. Built an explicit agent_loader plus a --agents flag that passes a JSON manifest, removing the auto-discovery assumption. Verified green on a full standard-tier run against a reference app.

core · v5.9·March 28, 2026verifierruntime

Streaming parallel verification

Bug verification now streams in parallel instead of sequencing through a single verifier. Standard-tier wall time for comprehensive runs dropped under four minutes on typical apps. Per-tier timeout bumps prevent false P1s on slow environments.

core · v5.8·March 10, 2026verifierdashboardinfra

Project memory, false-positive prevention, Docker infra

Introduced cross-run project memory: flaky-selector detection, known-false-positive suppression, and a cross-run diff so every pass focuses on what changed. Containerised the runner end-to-end for repeatable deployment. Dashboard gained a runs-over-time view, faster filtering, and N+1 query fixes.

dashboard · phase 4.2·February 22, 2026dashboard

Report server for historical runs

Ship a lightweight report server so teams can browse any prior run without needing the CLI. URLs are stable and link directly into a specific bug bundle.

dashboard · phase 4.1·February 8, 2026dashboarddocs

Self-contained HTML reports

HTML reports now embed screenshots as base64 so the artefact is a single drop-in file · easy to email, easy to attach, no broken image links.

core · v5.1–5.6·January 18, 2026dashboardinfra

Auth, RBAC, admin portal, prompt editor

Multi-user foundations: authentication, role-based access control, an admin portal for managing projects and members, and an inline prompt editor for tuning each specialist agent.

core · v2·November 2025core

Initial NSPEC v2 runtime

Ground-up rewrite of the QA orchestrator, eight specialist agents (orchestrator, ui-explorer, component-auditor, responsive-tester, performance-profiler, bug-verifier, test-case-designer, accessibility-reviewer), six-viewport coverage, and the evidence-bundle output format that's still in use.

Get new entries in your inbox · join the waitlist on the home page.