Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion evals
Submodule evals updated 64 files
+15 −0 CLAUDE.md
+13 −0 coding-agents/claude-sonnet.yaml
+235 −0 docs/experiments/2026-06-10-sdd-cost-experiments.md
+67 −0 docs/experiments/2026-06-10-sdd-run-mapping.md
+58 −0 docs/experiments/2026-06-11-build-loop-autoresearch.md
+87 −0 docs/superpowers/skills/micro-testing-prompt-guidance.md
+104 −0 docs/superpowers/skills/profiling-run-economics.md
+81 −0 fixtures/sdd-go-fractals-coarse/design.md
+817 −0 fixtures/sdd-go-fractals-coarse/plan.md
+81 −0 fixtures/sdd-go-fractals-control-plan/design.md
+867 −0 fixtures/sdd-go-fractals-control-plan/plan.md
+81 −0 fixtures/sdd-go-fractals-crisp/design.md
+175 −0 fixtures/sdd-go-fractals-crisp/plan.md
+81 −0 fixtures/sdd-go-fractals-critical-plan/design.md
+312 −0 fixtures/sdd-go-fractals-critical-plan/plan.md
+81 −0 fixtures/sdd-go-fractals-elicited/design.md
+919 −0 fixtures/sdd-go-fractals-elicited/plan.md
+81 −0 fixtures/sdd-go-fractals-stripped/design.md
+460 −0 fixtures/sdd-go-fractals-stripped/plan.md
+71 −0 fixtures/sdd-svelte-todo-elicited/design.md
+928 −0 fixtures/sdd-svelte-todo-elicited/plan.md
+14 −0 scenarios/cost-remove-export-boundary/checks.sh
+31 −0 scenarios/cost-remove-export-boundary/setup.sh
+46 −0 scenarios/cost-remove-export-boundary/story.md
+14 −0 scenarios/cost-session-timeout-boundary/checks.sh
+21 −0 scenarios/cost-session-timeout-boundary/setup.sh
+48 −0 scenarios/cost-session-timeout-boundary/story.md
+18 −0 scenarios/sdd-escalates-broken-plan/checks.sh
+3 −0 scenarios/sdd-escalates-broken-plan/setup.sh
+55 −0 scenarios/sdd-escalates-broken-plan/story.md
+16 −0 scenarios/sdd-go-fractals-coarse/checks.sh
+3 −0 scenarios/sdd-go-fractals-coarse/setup.sh
+61 −0 scenarios/sdd-go-fractals-coarse/story.md
+16 −0 scenarios/sdd-go-fractals-control-plan/checks.sh
+3 −0 scenarios/sdd-go-fractals-control-plan/setup.sh
+61 −0 scenarios/sdd-go-fractals-control-plan/story.md
+16 −0 scenarios/sdd-go-fractals-crisp/checks.sh
+3 −0 scenarios/sdd-go-fractals-crisp/setup.sh
+61 −0 scenarios/sdd-go-fractals-crisp/story.md
+16 −0 scenarios/sdd-go-fractals-critical-plan/checks.sh
+3 −0 scenarios/sdd-go-fractals-critical-plan/setup.sh
+61 −0 scenarios/sdd-go-fractals-critical-plan/story.md
+16 −0 scenarios/sdd-go-fractals-elicited/checks.sh
+3 −0 scenarios/sdd-go-fractals-elicited/setup.sh
+61 −0 scenarios/sdd-go-fractals-elicited/story.md
+16 −0 scenarios/sdd-go-fractals-stripped/checks.sh
+3 −0 scenarios/sdd-go-fractals-stripped/setup.sh
+61 −0 scenarios/sdd-go-fractals-stripped/story.md
+18 −0 scenarios/sdd-quality-reviewer-catches-planted-defect/checks.sh
+3 −0 scenarios/sdd-quality-reviewer-catches-planted-defect/setup.sh
+68 −0 scenarios/sdd-quality-reviewer-catches-planted-defect/story.md
+22 −0 scenarios/sdd-spec-context-consumed/checks.sh
+71 −0 scenarios/sdd-spec-context-consumed/setup.sh
+39 −0 scenarios/sdd-spec-context-consumed/story.md
+16 −0 scenarios/sdd-svelte-todo-elicited/checks.sh
+3 −0 scenarios/sdd-svelte-todo-elicited/setup.sh
+59 −0 scenarios/sdd-svelte-todo-elicited/story.md
+12 −0 scenarios/writing-plans-no-spec-conversational/checks.sh
+24 −0 scenarios/writing-plans-no-spec-conversational/setup.sh
+42 −0 scenarios/writing-plans-no-spec-conversational/story.md
+22 −1 setup_helpers/__init__.py
+123 −0 setup_helpers/sdd_broken_plan.py
+118 −0 setup_helpers/sdd_quality_defect_plan.py
+28 −0 setup_helpers/sdd_real_projects.py