Skip to content

docs(spec): Part 2.1 — instar test-as-self orchestration (draft for review)#457

Open
JKHeadley wants to merge 2 commits into
mainfrom
echo/test-as-self-part-2-1-spec
Open

docs(spec): Part 2.1 — instar test-as-self orchestration (draft for review)#457
JKHeadley wants to merge 2 commits into
mainfrom
echo/test-as-self-part-2-1-spec

Conversation

@JKHeadley

Copy link
Copy Markdown
Owner

Sub-spec to the approved SELF-PROPAGATION-HARNESS-SPEC.md. Draft for Justin's review (approved: false in frontmatter).

Context. Part 1 (poll-ownership lease) shipped PR #446 — verified live on Echo. Part 2 v1 shipped PR #448 — runbook + deterministic verifier. v1 explicitly deferred the three pieces that turn the runbook from "if a human does the recipe right" into "one button does it": auto-mint bot via Secret Drop, full Playwright Telegram round-trip, the instar test-as-self CLI command itself.

Why now. PR #428 (cross-machine seamlessness) is one live two-machine test away from merge. That test is exactly the kind of hand-done deploy that bit us on 2026-05-27. Building Part 2.1 IS the path to closing #428.

Surface locked. CLI flags + reject conditions (Bob block, canonical-home block, raw-token block) + seven gated steps + crash-capture wiring + all-three-tier test plan + migration-parity path.

Open question for Justin (in spec):

Leaning A — the parent spec's whole argument was that hand-done deploys are the failure mode.

ELI16 companion: SELF-PROPAGATION-HARNESS-PART-2-1-SPEC.eli16.md

This PR is doc-only (no src/ changes), so it does not require the instar-dev gate's full spec→trace→side-effects pipeline. Once Justin approves the scope, the implementation lands on a separate src/ PR with the full gate satisfied.

🤖 Generated with Claude Code

@vercel

vercel Bot commented May 28, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
instar Ready Ready Preview, Comment May 28, 2026 3:42am

Request Review

Instar Agent (echo) and others added 2 commits May 27, 2026 20:41
… review)

Context:
- Parent SELF-PROPAGATION-HARNESS-SPEC is approved + landed.
- Part 1 (poll-ownership lease) shipped PR #446, verified live on Echo.
- Part 2 v1 shipped PR #448 — runbook + deterministic verifier (lease,
  log demote line, real crash signatures).
- v1 explicitly deferred (and SKILL.md lists as NOT YET): auto-mint bot
  via Secret Drop, full Playwright Telegram round-trip, one-button
  `instar test-as-self` CLI command.

This sub-spec defines Part 2.1 precisely so:
- PR #428 (cross-machine seamlessness) has a clean, repeatable
  two-machine deploy harness to run the live test through.
- The 2026-05-27 hand-done mmtest failure mode (ad-hoc deploy, unclear
  crash provenance) cannot recur.

CLI surface locked, seven steps gated by Tier-1 LLM supervision, three
structural guardrails (Bob block, canonical-home block, token hygiene),
all-three-tier test plan, migration-parity pathway.

Open question to Justin (in spec): A (full Part 2.1), B (skip
Playwright), or C (defer Part 2.1, ship PR #428 manually first).
Leaning A. ELI16 companion ships alongside per the spec standard.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
…light

Justin's greenlight came via Telegram (topic 13481, 2026-05-27, ~20:14 PDT):
"For number one yes, I agree with a" (option A = full Part 2.1: auto-mint
via Secret Drop + Playwright Telegram round-trip + one-button CLI).

Conformance pass against the six Instar standards documented in the
review-convergence field. Build proceeds on a separate src/ branch.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant