# Published Loop Library catalog

Generated from the production catalog database (catalog updated 2026-06-24).
Live catalog: https://signals.forwardfuture.com/loop-library/catalog.md
Machine-readable catalog: https://signals.forwardfuture.com/loop-library/catalog.json
Plain-text catalog: https://signals.forwardfuture.com/loop-library/catalog.txt
Agent instructions: https://signals.forwardfuture.com/loop-library/llms.txt

Search by outcome, trigger, artifact, evidence, category, or keyword. Treat
adaptations and new designs as unpublished unless they appear at the live catalog
URL above.

## 001 — [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/)

- Category: Engineering
- Use when: Use this whenever implementation changes may have left READMEs, setup guides, API references, examples, or runbooks behind.
- Prompt: Whenever a documentation pass is needed, review the codebase in full and make sure all documentation reflects the current implementation. Update stale documentation, verify the changes, then open a pull request.
- Verify: Documentation matches the current implementation. Finish with a reviewable pull request.
- Keywords: AI coding agent, documentation audit, documentation drift, documentation maintenance, pull request workflow
- Related: [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/), [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/)

## 002 — [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/)

- Category: Engineering
- Use when: Use this for a deliberate architectural refactor where the destination can be stated in concrete terms and the current system can be tested after each meaningful change.
- Prompt: Refactor until you are happy with the architecture. After each significant step, live-test the system, run autoreview, and commit. Track progress in /tmp/refactor-{projectname}.md.
- Verify: The architecture is satisfactory and checks pass. Live-test, autoreview, and commit each significant step.
- Keywords: AI coding agent, architecture refactor, autoreview, incremental refactoring, coding agent workflow
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/)

## 003 — [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/)

- Category: Engineering
- Use when: Use this when a product has a defined set of routes, a stable performance harness, and a 50 ms target that maps to a specific metric and environment.
- Prompt: Continue optimizing the code for speed. After each significant change, measure page-load performance across every page under the same repeatable test conditions. Continue until every page loads in under 50 ms.
- Verify: Every page loads in under 50 ms. Use the same benchmark and confirm there are no regressions.
- Keywords: AI coding agent, page load optimization, performance benchmark, web performance workflow, 50 ms page load
- Related: [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/), [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/)

## 004 — [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/)

- Category: Engineering
- Use when: Use this as a scheduled reliability pass when an agent can read production telemetry, trace failures into the repository, run the relevant tests, and prepare a reviewable fix.
- Prompt: Review our production logs for errors. If you find an actionable issue, trace it to its root cause, fix it, verify the fix, and open a pull request. If no actionable errors are present, stop without making changes.
- Verify: Actionable production errors are fixed and verified. Finish with a pull request, or stop when no actionable errors are present.
- Keywords: AI coding agent, production log review, error triage, root cause analysis, reliability workflow
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/)

## 005 — [The 100% test coverage loop](https://signals.forwardfuture.com/loop-library/loops/100-percent-test-coverage-loop/)

- Category: Engineering
- Use when: Use this when 100% coverage is an explicit project requirement and the repository has a trustworthy coverage command, clear exclusions, and a test suite that can be run repeatedly.
- Prompt: Add tests until we have 100% test coverage.
- Verify: The full test suite passes at 100% coverage. Use the project's coverage report as the source of truth.
- Keywords: AI coding agent, 100 percent test coverage, test coverage workflow, automated testing, coding agent prompt
- Related: [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/), [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/)

## 006 — [The SEO/GEO visibility loop](https://signals.forwardfuture.com/loop-library/loops/seo-geo-visibility-loop/)

- Category: Content
- Use when: Use this when a site has a defined set of priority pages and target questions, and you can rerun the same technical crawl and search visibility checks after each change.
- Prompt: Run an SEO/GEO audit across crawlability, indexation, page intent, titles, internal links, structured data, source citations, and answer-first content. Rank the gaps by expected impact, fix the highest-leverage issue, then rerun the same crawl and target-query benchmark across search engines and AI answer engines. Repeat until no critical technical issues remain, every priority query maps to a clear answer-ready page, and the benchmark shows no high-impact gap left to fix.
- Verify: Priority pages are indexable, answer-ready, and technically sound. The repeatable crawl and query benchmark finds no remaining high-impact gaps.
- Keywords: SEO audit, generative engine optimization, GEO workflow, AI search visibility, answer engine optimization
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/)

## 007 — [The logging coverage loop](https://signals.forwardfuture.com/loop-library/loops/exhaustive-logging-coverage-loop/)

- Category: Engineering
- Use when: Use this when important user flows, service boundaries, background jobs, or failure paths are difficult to trace because the system's logging is incomplete or inconsistent.
- Prompt: Review the system's logging and add missing coverage until every important path produces useful, tested logs.
- Verify: Every important path emits useful, tested logs. Representative success and failure tests prove coverage without exposing sensitive data.
- Keywords: AI coding agent, structured logging, observability coverage, logging tests, production diagnostics
- Related: [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/), [The 100% test coverage loop](https://signals.forwardfuture.com/loop-library/loops/100-percent-test-coverage-loop/)

## 008 — [The nightly changelog loop](https://signals.forwardfuture.com/loop-library/loops/nightly-changelog-sweep/)

- Category: Engineering
- Use when: Use this when a project changes frequently enough that user-facing release notes can drift from merged pull requests, commits, deployments, and product changes.
- Prompt: Each night, review changes from the previous day and update the changelog with anything users should know.
- Verify: Every user-relevant change from the previous day is accounted for. The changelog is updated and validated, or the no-change result is recorded.
- Keywords: AI coding agent, nightly changelog, release notes workflow, changelog automation, daily repository review
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The repository cleanup loop](https://signals.forwardfuture.com/loop-library/loops/repository-cleanup-loop/)

## 009 — [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

- Category: Evaluation
- Use when: Use this when product quality needs a strict consecutive-success bar and failures should permanently improve the test and benchmark suite.
- Prompt: Test realistic scenarios. When one fails, document it, add regression and benchmark coverage, fix it, and restart the streak. Stop after [N] successful cases in a row.
- Verify: The latest [N] realistic cases pass in a row. Every earlier failure is documented, fixed, and protected by regression and benchmark coverage.
- Keywords: AI product evaluation, quality streak, regression testing, benchmark coverage, realistic scenarios
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The 100% test coverage loop](https://signals.forwardfuture.com/loop-library/loops/100-percent-test-coverage-loop/)

## 010 — [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

- Category: Evaluation
- Use when: Use this for an exhaustive, end-to-end application QA pass when a production-like local environment and complete interactive-surface coverage matter more than a narrow regression or sample of major features.
- Prompt: Build sanitized, production-scale local data under production-like settings. Inventory every user-facing feature, role, route, button, input, modal, state, and workflow; define documented acceptance criteria and finite risk-based edge cases for each. Test as a real user, logging every bug with reproduction evidence. Review findings for shared causes and dependencies; implement coherent fixes with regression tests, then rerun the full inventory. Stop at a clean pass or blocked handoff. Ask before production, sensitive data, or destructive actions.
- Verify: Every inventoried product surface meets its documented acceptance criteria. The final full regression run covers every inventoried surface and its finite risk-based edge cases in the production-like local environment, with each reproducible bug fixed and backed by evidence.
- Keywords: production-grade QA, production-like local testing, exhaustive product testing, real user testing, UI control coverage, edge case testing, bug documentation, full regression testing
- Related: [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/), [The production data cleanup loop](https://signals.forwardfuture.com/loop-library/loops/production-data-cleanup-loop/)

## 011 — [The test-suite speed loop](https://signals.forwardfuture.com/loop-library/loops/test-suite-speed-loop/)

- Category: Engineering
- Use when: Use this when slow tests are delaying local feedback or continuous integration and the project has stable commands for measuring runtime and coverage.
- Prompt: Optimize the test suite to run as quickly as possible without reducing coverage or changing behavior.
- Verify: The suite is faster with no coverage or behavior regression. Repeatable timing, the full passing suite, and the original coverage report prove the result.
- Keywords: AI coding agent, test suite performance, faster CI, test optimization, coverage preservation
- Related: [The 100% test coverage loop](https://signals.forwardfuture.com/loop-library/loops/100-percent-test-coverage-loop/), [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/)

## 012 — [The repository cleanup loop](https://signals.forwardfuture.com/loop-library/loops/repository-cleanup-loop/)

- Category: Engineering
- Use when: Use this when abandoned branches, old worktrees, unclear pull requests, or unmerged commits make it difficult to know which repository state still matters.
- Prompt: Inspect local and remote branches, pull requests, commits, and worktrees. Recover valuable work and clean everything stale until the repository is current and organized.
- Verify: Valuable work is recovered and remaining repository state is intentional. Branches, pull requests, commits, and worktrees are current, owned, or safely removed with evidence.
- Keywords: AI coding agent, repository cleanup, git worktree audit, branch hygiene, pull request triage
- Related: [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/), [The nightly changelog loop](https://signals.forwardfuture.com/loop-library/loops/nightly-changelog-sweep/)

## 013 — [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/)

- Category: Operations
- Use when: Use this when several branches or pull requests may be ready at once and the release must avoid stale worktrees, partial overlays, and incomplete changes.
- Prompt: Review pending changes and pull requests, exclude stale or unfinished work, combine the valid changes, and release them together.
- Verify: Only current, complete changes ship in the combined release. The released revision is the latest integrated main that contains every selected change.
- Keywords: AI release operations, batch release, stale code prevention, pull request coordination, deployment safety
- Related: [The repository cleanup loop](https://signals.forwardfuture.com/loop-library/loops/repository-cleanup-loop/), [The post-release baseline loop](https://signals.forwardfuture.com/loop-library/loops/post-release-baseline-loop/)

## 014 — [The production data cleanup loop](https://signals.forwardfuture.com/loop-library/loops/production-data-cleanup-loop/)

- Category: Operations
- Use when: Use this when a production dataset contains records that no longer match a product, policy, taxonomy, or quality definition and the classifier allowed them through.
- Prompt: Review production records, remove anything that does not meet the allowed definition, improve the classification logic, and verify the remaining data.
- Verify: Every remaining record meets the allowed definition. Representative classification tests and a post-cleanup audit prove the retained data is valid.
- Keywords: AI data operations, production data cleanup, classification logic, data quality audit, regression examples
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The logging coverage loop](https://signals.forwardfuture.com/loop-library/loops/exhaustive-logging-coverage-loop/)

## 015 — [The post-release baseline loop](https://signals.forwardfuture.com/loop-library/loops/post-release-baseline-loop/)

- Category: Operations
- Use when: Use this immediately after a release when future regressions or improvements need to be measured against the exact version now in production.
- Prompt: After current releases finish, run the standard benchmarks and record the results as the new baseline.
- Verify: The new baseline belongs to the completed release. Revision, environment, benchmark version, conditions, and results are recorded together.
- Keywords: AI release operations, post-release benchmark, performance baseline, release verification, benchmark history
- Related: [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/), [The test-suite speed loop](https://signals.forwardfuture.com/loop-library/loops/test-suite-speed-loop/)

## 016 — [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/)

- Category: Engineering
- Use when: Use this when a real but loosely written ticket, bug report, or customer complaint needs to become a bounded engineering change with enough proof for a fast review.
- Prompt: Take a ticket, bug report, failing behavior, or customer complaint and turn it into a review-ready patch. Reproduce the failure in the smallest representative environment, prove the root cause, make the smallest credible fix, and rerun the original reproduction plus relevant regression tests. If the issue cannot be reproduced after two serious attempts, say so. Do not fold unrelated refactors into the patch. Finish with the cause, changed files, before-and-after proof, risks, and pull-request summary.
- Verify: The failure is fixed, verified, and ready for review. The issue reproduces before the fix, no longer reproduces afterward, and relevant regression checks pass.
- Keywords: AI coding agent, ticket to pull request, bug reproduction, root cause analysis, review-ready patch
- Related: [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/), [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

## 017 — [The customer AI deployment loop](https://signals.forwardfuture.com/loop-library/loops/customer-ai-deployment-loop/)

- Category: Operations
- Use when: Use this when an AI workflow must live inside a real customer process and needs validation, approval, gradual rollout, monitoring, and a clear business outcome.
- Prompt: Run this when a customer requests an AI workflow, reports a failure, or reaches an operations review. Choose one priority, such as enriching leads, drafting emails, summarizing meetings, or updating a CRM. Define the owner, inputs, approvals, success metric, and ROI hypothesis. Dry-run it on realistic customer data, fix the smallest verified problem, then release through approved stages and monitor production. Finish with the outcome, evidence, customer update, lessons saved, and next review.
- Verify: One customer priority reaches a proven terminal state. The workflow reaches its agreed rollout stage, a production issue is fixed, or a blocker is escalated with an owner and next step.
- Keywords: customer AI deployment, AI workflow rollout, approval gates, production monitoring, AI ROI
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

## 018 — [The product update podcast loop](https://signals.forwardfuture.com/loop-library/loops/product-update-podcast-loop/)

- Category: Content
- Use when: Use this when a product ships frequently enough that users would benefit from a short recurring audio explanation of what changed and how to use it.
- Prompt: Each night, review publicly released product changes and select only those users need to know. Verify each against the product, docs, or release notes. Use the Jellypod MCP to turn the approved changes into a three-to-five-minute podcast explaining what changed, why it matters, and how to try it. Check the script and audio for accuracy, clarity, and pronunciation. If nothing meaningful shipped, make no episode. Ask before publishing. Finish with the draft episode, sources, and review result.
- Verify: The episode accurately covers every meaningful public update. Finish with a review-ready three-to-five-minute episode, or a confirmed no-episode result when nothing meaningful shipped.
- Keywords: AI podcast workflow, product update podcast, Jellypod MCP, release communication, editorial automation
- Related: [The nightly changelog loop](https://signals.forwardfuture.com/loop-library/loops/nightly-changelog-sweep/), [The post-release baseline loop](https://signals.forwardfuture.com/loop-library/loops/post-release-baseline-loop/)

## 019 — [The Clodex adversarial-review loop](https://signals.forwardfuture.com/loop-library/loops/clodex-adversarial-review-loop/)

- Category: Engineering
- Use when: Use Clodex when Claude is building a meaningful code change and Codex should independently review each repair round.
- Prompt: Run /clodex [task] think hard --max-iter 5 --threshold medium. Claude plans the task, implements it, opens a pull request, asks Codex for an adversarial review, fixes findings above the accepted severity, and repeats. Keep the branch, PR, findings, verdict, and iteration state resumable. Stop when Codex approves, only accepted findings remain, progress stalls, or the iteration cap is reached. Never describe an errored or exhausted run as approved. Finish with the PR, checks, verdict, and remaining findings.
- Verify: The pull request reaches the configured review bar. Codex approves it or only explicitly accepted findings remain; errors, stalls, and exhausted limits are reported as such.
- Keywords: Clodex, Codex adversarial review, Claude Code plugin, review fix loop, pull request automation
- Related: [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/), [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/)

## 020 — [The Loop Harness verification loop](https://signals.forwardfuture.com/loop-library/loops/loop-harness-verification-loop/)

- Category: Engineering
- Use when: Use this when a recurring repository task should run unattended but one agent must not be allowed to generate and approve the same output.
- Prompt: Use Loop Harness for scheduled repository work such as CI triage, issue grooming, dependency updates, or docs sync. Set [retry limit], then start an isolated git worktree. Let one Claude session stage a patch or outbox message and a second Claude session verify it against explicit criteria. Ship only after a pass; otherwise preserve the findings and retry only within the limit. Finish with the source revision, staged output, verifier result, delivery status, and next run.
- Verify: Only independently verified output ships. A second-agent pass releases the configured output; a failed verification preserves evidence and produces no external change.
- Keywords: Loop Harness, scheduled coding agent, git worktree isolation, second-agent verification, autonomous agent workflow
- Related: [The Clodex adversarial-review loop](https://signals.forwardfuture.com/loop-library/loops/clodex-adversarial-review-loop/), [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/)

## 021 — [The Boeing 747 benchmark](https://signals.forwardfuture.com/loop-library/loops/boeing-747-benchmark/)

- Category: Design
- Use when: Use this as a concrete Three.js vision benchmark, or adapt the same capture-and-critic pattern to another rendered subject.
- Prompt: Before building, choose reference images, a scoring rubric, [visual threshold], and [budget]. Build the most realistic Boeing 747 you can from Three.js primitives, then create a rig that screenshots nine repeatable angles. After each change, render and score the same views, have a critic identify the weakest feature, and fix it without regressing stronger views. Keep the best version. Stop at the threshold, stalled progress, or budget. Finish with the model, nine renders, scores, remaining gaps, and run summary.
- Verify: The Boeing 747 meets the visual bar from all nine angles. The same camera rig and rubric show every required view meeting the preset threshold, or the run reports stagnation, budget exhaustion, and remaining gaps.
- Keywords: Boeing 747 benchmark, Three.js agent workflow, vision self-verification, 3D reconstruction loop, camera inspection system
- Related: [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 022 — [War Loops: frontend reconstruction](https://signals.forwardfuture.com/loop-library/loops/war-loops-frontend-designer/)

- Category: Design
- Use when: Use War Loops when an authorized interface must be rebuilt from a URL or image and judged on appearance, motion, and responsive behavior.
- Prompt: Point War Loops at an authorized URL or image. Capture it with a genuine browser and record the layout, styles, content, motion, and responsive behavior. Build a static Pencil mirror and a moving Forge version. Compare both with the source at desktop, tablet, and mobile sizes; repair only the weakest fidelity signals. Stop when every gate passes, progress stalls, or capture is blocked. Finish with the builds, spec, renders, scores, and remaining gaps.
- Verify: The builds match the source across all three fidelity axes. Static appearance, experiential motion, and responsive reflow pass their gates, or the run reports stagnation or a blocked capture.
- Keywords: War Loops, autonomous frontend designer, frontend fidelity, visual evaluation loop, responsive motion matching
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/)

## 023 — [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/)

- Category: Evaluation
- Use when: Use this to tune a prompt, policy, or configuration when cheap iteration is useful but final acceptance must use fresh examples.
- Prompt: Improve a prompt, policy, or configuration. A support assistant's system prompt is one example. Save the champion, its score, a working set, untouched holdout cases, must-pass checks, and [budget]. Each round, change one thing based on a recorded failure. Promote the challenger only if it beats the champion on holdouts by [margin] without weakening a must-pass check; otherwise keep the champion. Stop at the target, budget limit, or no progress. Return the winner, scores, experiment log, and remaining failures.
- Verify: The best holdout-tested champion is returned. Every challenger is logged, and accepted changes beat the previous champion on untouched cases without weakening a must-pass check.
- Keywords: self-improving loop, champion challenger evaluation, Goodhart prevention, independent evaluation gate, bounded optimization workflow
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

## 024 — [The devil's-advocate loop](https://signals.forwardfuture.com/loop-library/loops/devils-advocate-design-loop/)

- Category: Evaluation
- Use when: Use this before committing to an architecture, interface, rollout plan, or other consequential design that benefits from structured adversarial review.
- Prompt: Before committing to an architecture, interface, or rollout plan, have a critic argue that it is wrong. Record each objection, impact, and status in a repository-local log at .agent-reviews/redteam.md. The builder must fix and verify each high-impact weakness or document why it is accepted; the critic may reopen unsupported answers. Stop when no high-impact objection remains or the same issues repeat for two rounds without new evidence. Finish with the decision, resolved and accepted objections, evidence, and any stalemate.
- Verify: No high-impact objection remains open. Every logged objection is verified as resolved or explicitly accepted with evidence, or the final report truthfully records a two-round stalemate.
- Keywords: devil's advocate loop, adversarial design review, critic builder workflow, architecture objection log, red team design process
- Related: [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/), [The Clodex adversarial-review loop](https://signals.forwardfuture.com/loop-library/loops/clodex-adversarial-review-loop/)

## 025 — [The fresh-clone loop](https://signals.forwardfuture.com/loop-library/loops/fresh-clone-loop/)

- Category: Engineering
- Use when: Use this to test whether a repository's onboarding instructions work in a clean environment without undocumented help.
- Prompt: Clone [repository] into a disposable environment and follow only its README to the documented ready state, such as running the app or building the package. When a step fails or assumes missing knowledge, record the gap, fix the setup or documentation issue, discard the environment, and start again. Carry no dependencies, configuration, credentials, or repairs between attempts. Stop when one uninterrupted fresh clone reaches that state, progress stalls, or [budget] ends. Return exact commands, gaps closed, and remaining blockers.
- Verify: A clean environment reaches the documented ready state using only the README. The final run uses only the onboarding guide and needs no unstated dependency, configuration, or manual repair.
- Keywords: fresh clone loop, README verification, developer onboarding test, clean environment setup, repository documentation workflow
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The repository cleanup loop](https://signals.forwardfuture.com/loop-library/loops/repository-cleanup-loop/)

## 026 — [The Infinite Clickbait thumbnail loop](https://signals.forwardfuture.com/loop-library/loops/infinite-clickbait-loop/)

- Category: Design
- Use when: Use this when a video topic and asset set are ready but the thumbnail needs several structured ideation and critique rounds before production.
- Prompt: For [video], use [approved assets] to make ten thumbnail concepts. Score each at real YouTube sizes against [inspiration channel] for clarity, curiosity, emotional pull, contrast, and accuracy. Take the top three, improve each one's weakest dimension, and rescore them under the same rubric. Keep iterating the strongest concept until it clears [quality threshold] or [budget] ends. Reject anything the video cannot deliver. Return the winner, two runners-up, previews, final scores, and rationale.
- Verify: One accurate thumbnail clears the fixed quality threshold. The winner outscores the alternatives under the same conditions, remains legible at realistic sizes, and represents the video accurately.
- Keywords: Infinite Clickbait, YouTube thumbnail loop, thumbnail iteration workflow, clickbait scoring rubric, AI visual design
- Related: [The Boeing 747 benchmark](https://signals.forwardfuture.com/loop-library/loops/boeing-747-benchmark/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 027 — [The autonomy-loop builder-reviewer loop](https://signals.forwardfuture.com/loop-library/loops/autonomy-loop/)

- Category: Engineering
- Use when: Use autonomy-loop when a repository has deterministic test, build, and lint gates plus a task suited to repeated builder-reviewer handoffs.
- Prompt: Use autonomy-loop for [repository task] after the test, build, and lint gates pass. Run /autonomy-loop:autonomy-init, then start builder and reviewer in separate worktrees. The builder reads LOOP-STATE.md, makes one bounded change, and adds a red-before, green-after test. The reviewer reruns the gates and proves the test by reverting or mutating the fix. Accept only on both passes; park protected or repeated-failure work for a human. Finish with the commit, gate evidence, test proof, trust tier, and risks.
- Verify: Every accepted wave passes autonomy-loop's proof-of-test gate. The new test fails without the change, passes with it, every configured gate passes, and protected production changes remain human-gated.
- Keywords: autonomy-loop, adversarial code review, mutation testing, builder reviewer workflow, Claude Code loop
- Related: [The Clodex adversarial-review loop](https://signals.forwardfuture.com/loop-library/loops/clodex-adversarial-review-loop/), [The Loop Harness verification loop](https://signals.forwardfuture.com/loop-library/loops/loop-harness-verification-loop/)

## 028 — [The Codex completion-contract loop](https://signals.forwardfuture.com/loop-library/loops/codex-completion-contract-loop/)

- Category: Engineering
- Use when: Use this for long-running Codex work, pull requests, runtime checks, or user-visible artifacts where a plausible partial result could be mistaken for completion.
- Prompt: Run $goal-planner-codex [task] for long-running Codex work where partial work could be mistaken for done. Landing a PR and verifying production is one example. Before acting, define every required outcome and its evidence. After each bounded action, mark requirements proved, weak, missing, or contradicted. Complete the Goal only when all are proved; otherwise stop as blocked, stalled, or exhausted. Ask before creating Goal state. Finish with the requirement-to-evidence table, status, owner, and next action.
- Verify: Every Codex Goal requirement has current, adequate proof. The final audit contains no weak, missing, or contradicted required item; otherwise the work remains open, blocked, or exhausted.
- Keywords: Codex Goal, completion contract, evidence audit, definition of done, false completion prevention
- Related: [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/), [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

## 029 — [The Revolve versioned-experiment loop](https://signals.forwardfuture.com/loop-library/loops/revolve-self-improvement-loop/)

- Category: Evaluation
- Use when: Use Revolve to improve a prompt, policy, workflow, model configuration, code path, or dataset when experiments must remain comparable and resumable across sessions.
- Prompt: Use Revolve to improve a support prompt, code path, or testable subject. In revolve/, define the goal and [budget], freeze the tests and scoring, checkpoint the current version, and record a baseline. Each round, test one hypothesis; keep only a clear, regression-free win. If the evaluation changes, open a new revision and rerun the baseline. Ask before changing live files. Stop on success, no progress, a blocker, or exhausted budget. Return the best checkpoint, comparisons, rollback, and next action.
- Verify: The best Revolve checkpoint wins within one evaluation revision. The incumbent and candidates have comparable recorded runs, accepted changes pass every guard, rollback is available, and live promotion has approval.
- Keywords: Revolve, agent self improvement, checkpoint evaluation, revisioned experiments, evidence based promotion
- Related: [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 030 — [The five-minute repository maintainer loop](https://signals.forwardfuture.com/loop-library/loops/five-minute-repository-maintainer-loop/)

- Category: Engineering
- Use when: Use this when Codex may coordinate maintenance across several active repositories and you want parallel work to stay steerable without duplicating or micromanaging threads.
- Prompt: While repository maintenance is active, wake every five minutes. Triage [repositories] and read each repository thread's latest state. Reuse one thread per repository; assign its highest-value bounded task only within granted permissions, and do not interrupt coherent active work. Require tests, live proof, autoreview, and green CI before work can land. Escalate product, access, security, or irreversible decisions. Record meaningful changes and stop when every item is landed, decision-ready, blocked, or has no work.
- Verify: Every repository item reaches a proven handoff or terminal state. Authorized autonomous work lands with evidence; other items are decision-ready, blocked with one exact ask, or recorded as a clean no-op.
- Keywords: Codex repository maintenance, multi-repository orchestration, five minute agent loop, GitHub project triage, thread delegation
- Related: [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/), [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/)

## 031 — [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

- Category: Engineering
- Use when: Use this after several days of project feedback when repeated mistakes may point to similar issues elsewhere and the agent can inspect both the conversation history and the complete current project.
- Prompt: Review all available threads from [lookback window] where I reported something wrong with [project] and asked for a fix. Build a deduplicated issue list, group it into failure patterns, and verify current state. Audit the complete project for every pattern, fix each confirmed instance, and add regression coverage where practical. Repeat the full audit until it finds no remaining instance or [iteration budget] ends. Stop on blocked or approval-gated work. Return the issues, fixes, evidence, and blockers.
- Verify: The issue inventory is closed and a fresh pattern audit is clean. Every reported issue and newly found match has current proof of resolution; blocked, approval-gated, or budget-exhausted items remain explicitly open.
- Keywords: recent user feedback, project-wide issue audit, failure pattern sweep, regression prevention, AI coding agent
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/)

## 032 — [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/)

- Category: Evaluation
- Use when: Use this when what a product says it does may no longer match what it actually does across marketing, documentation, demos, support answers, or the live product.
- Prompt: List every customer-facing promise [product] makes in marketing, documentation, demos, and AI answers. Compare each promise with current product behavior and evidence, then label it proven, partly proven, misleading, unsupported, outdated, or missing evidence. Fix or narrow the riskiest mismatch and rerun the affected check. Repeat until no high-risk unsupported promise remains. Ask before changing production or public copy. Return the promises, evidence, fixes, and decisions needed.
- Verify: Every high-risk customer promise is supported, narrowed, or waiting on an explicit decision. Each promise links to current evidence, and every high-risk mismatch is fixed, narrowed to what the product can prove, or clearly approval-gated.
- Keywords: product promise audit, customer trust, claim verification, evidence based product review, marketing product alignment
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

## 033 — [The propagation compliance loop](https://signals.forwardfuture.com/loop-library/loops/propagation-compliance-loop/)

- Category: Engineering
- Use when: Use this after changing something that appears in several files—such as a version number, feature name, count, rule, setting, or identifier—and every copy must stay consistent.
- Prompt: After changing a version, count, rule, name, or configuration, list where the new value belongs and update it. Search the project for the old value and related forms. Review each match: fix real stale values, but keep intentional history, examples, migrations, or compatibility rules. Repeat until zero stale values remain. If one returns for two rounds, stop and identify what may be regenerating it. Return changes, intentional matches, and search output.
- Verify: No unintended copy of the old value remains. The final searches find only references that are intentionally historical or required for examples, migrations, or compatibility, with a reason recorded for each one.
- Keywords: configuration propagation, version update audit, stale value search, repository consistency, grep verification loop
- Related: [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

## 034 — [The multi-LLM convergence loop](https://signals.forwardfuture.com/loop-library/loops/multi-llm-convergence-loop/)

- Category: Evaluation
- Use when: Use this when an important plan, specification, design, document, or code change benefits from two independent AI perspectives rather than one model reviewing its own blind spots.
- Prompt: Review [plan, specification, document, or code change] against [quality bar] for at most [pass limit] rounds. Have one of two genuinely different model families—AI systems from separate providers—review it. Verify each finding and apply only necessary fixes, then give the revised version to the other reviewer. Succeed only when both approve the same unchanged version. Stop at the limit, repeating disagreement (oscillation), unavailable review, or required approval. Return the final work, round log, verdict, and disagreements.
- Verify: Two different AI model families approve the exact same version. The final two clean reviews come from different model families with no edit between them; a pass limit, repeating disagreement, unavailable reviewer, or approval boundary is reported as a stall instead of consensus.
- Keywords: multi LLM review, cross model consensus, artifact convergence, alternating reviewers, independent AI review
- Related: [The Clodex adversarial-review loop](https://signals.forwardfuture.com/loop-library/loops/clodex-adversarial-review-loop/), [The devil's-advocate loop](https://signals.forwardfuture.com/loop-library/loops/devils-advocate-design-loop/)

## 035 — [The Goal Forge loop](https://signals.forwardfuture.com/loop-library/loops/goal-forge-loop/)

- Category: Engineering
- Use when: Use this when a rough coding idea is too vague to hand to Codex for a long autonomous run and the user first needs to settle scope, completion checks, safety boundaries, and required tools.
- Prompt: Turn [rough coding idea] into two planning files before Codex starts /goal, its long-running task mode. Interview the user, then write SPEC.md: what to build, exclude, and consider, plus measurable done_when completion checks. Write GOAL.md: the work plan, progress scorecard, quick and final checks, memory files, evidence, and approval boundaries. If any key decision, permission, tool, environment requirement, or test is missing, stop as not ready. Do not start implementation without approval.
- Verify: The planning files say what to build, how to judge it, and when to stop. Every done_when completion check names observable evidence, the quick and final checks can actually run, the environment is ready, and unresolved decisions are clearly marked not ready.
- Keywords: Goal Forge, Codex goal planning, SPEC.md, GOAL.md, autonomous coding contract
- Related: [The Codex completion-contract loop](https://signals.forwardfuture.com/loop-library/loops/codex-completion-contract-loop/), [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/)

## 036 — [The UI/UX Score Loop](https://signals.forwardfuture.com/loop-library/loops/ui-ux-score-loop/)

- Category: Design
- Use when: Use this for a real task such as signup, login, onboarding, checkout, sharing, or creating and editing an item when the entire experience can be exercised in a browser and scored consistently.
- Prompt: Improve [user flow, such as signup] at [URL] until [completion criterion]. In a real browser, start each pass from fresh state—no saved login, cookies, or site data. Capture meaningful screens at the agreed sizes and modes, score them with one checklist, and improve the weakest safe area. Rerun the whole flow and keep only regression-free changes. Stop on success, two full passes with no gain, blocked access, or required approval. Return scores, screenshots, changes, and stop reason.
- Verify: The complete user task scores better without making another important screen worse. The final dashboard shows the same entry point, fresh browser state, screen sizes, modes, scoring rubric, screenshots, score changes, and stop reason for every retained improvement.
- Keywords: UI UX score, browser flow audit, responsive design review, fresh browser state, user experience improvement
- Related: [War Loops: frontend reconstruction](https://signals.forwardfuture.com/loop-library/loops/war-loops-frontend-designer/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 037 — [The cold-load trimmer loop](https://signals.forwardfuture.com/loop-library/loops/cold-load-trimmer-loop/)

- Category: Engineering
- Use when: Use this when a web app feels heavy on its first visit because it downloads too much code, styling, media, or other data before showing the initial screen.
- Prompt: Reduce the data [web app] downloads before its first screen appears. First record passing tests, mobile and desktop screenshots, and compressed transferred bytes—the data actually downloaded. Use the build report only to suggest candidates. Defer, compress, or remove one item, then rebuild and rerun every check. Keep it only if tests pass, screenshots are pixel-identical, and bytes decrease; otherwise revert. Stop when no safe candidate remains, progress stalls, or approval is needed. Return measurements, changes, and untested states.
- Verify: The first screen downloads less data without a tested behavior or pixel changing. The same production-like measurement reports fewer downloaded bytes, existing tests pass, every representative screenshot is pixel-identical, and uncertain dependency removal remains approval-gated.
- Keywords: first load bytes, bundle size optimization, pixel identical screenshots, lazy loading, web performance loop
- Related: [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/), [The pixel-safe CSS trim loop](https://signals.forwardfuture.com/loop-library/loops/pixel-safe-css-trim-loop/)

## 038 — [The pixel-safe CSS trim loop](https://signals.forwardfuture.com/loop-library/loops/pixel-safe-css-trim-loop/)

- Category: Design
- Use when: Use this when a website's styling files may contain unused declarations, duplicated rules, or old overrides and representative pages and interactions can be captured in repeatable screenshots.
- Prompt: Reduce the CSS styling code [site] sends to users without changing tested screens. First capture representative pages, sizes, themes, and interactions, and record the built CSS size. Treat coverage reports only as suggestions. Remove one declaration or rule, rebuild, and rerun screenshots and project checks. Keep it only if every screenshot is pixel-identical and built CSS is smaller; otherwise revert. Stop when no supported candidate remains, progress stalls, or approval is required. Return reduction, evidence, and untested states.
- Verify: The delivered stylesheet is smaller while every tested screen remains pixel-identical. The same project checks and screenshots pass after each retained deletion, the built CSS file sent to users is smaller, and untested browsers, screens, or interactions remain explicit risks.
- Keywords: CSS cleanup, pixel safe CSS, visual regression testing, dead CSS removal, stylesheet optimization
- Related: [The cold-load trimmer loop](https://signals.forwardfuture.com/loop-library/loops/cold-load-trimmer-loop/), [The UI/UX Score Loop](https://signals.forwardfuture.com/loop-library/loops/ui-ux-score-loop/)

## 039 — [The easy onboarding loop](https://signals.forwardfuture.com/loop-library/loops/easy-onboarding-loop/)

- Category: Evaluation
- Use when: Use this when new users may face unclear instructions, hidden assumptions, difficult recovery, or unnecessary steps that experienced users no longer notice because their accounts and browsers remember earlier setup.
- Prompt: Act like a first-time user of [product]. Start at the real entry point in a clean session with no saved login, site data, remembered route, or hidden setup. Complete onboarding using only visible guidance and record obstacles. Fix the worst one with the smallest change that preserves every security, access, and product requirement. Discard the session and retry. Stop after one uninterrupted success, no safe fix, blocked access, or required approval. Return the path, changes, evidence, and blockers.
- Verify: A first-time user can complete onboarding in one uninterrupted clean session. The full experience succeeds from the real starting point without saved browser state, secret setup, guessed routes, or manual repairs, and every real requirement remains intact.
- Keywords: onboarding improvement, fresh session testing, new user experience, agent friendly onboarding, onboarding friction
- Related: [The fresh-clone loop](https://signals.forwardfuture.com/loop-library/loops/fresh-clone-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 040 — [The accessibility repair loop](https://signals.forwardfuture.com/loop-library/loops/accessibility-repair-loop/)

- Category: Design
- Use when: Use this when a website or app has a defined accessibility target and you can repeatedly test the relevant pages, components, or tasks for people using keyboards, screen readers, zoom, or other access methods.
- Prompt: Check [scope] against [accessibility standard, such as WCAG 2.2 AA] with automated scans and available keyboard, screen-reader, and other manual tests. Confirm each issue, rank it by harm, and fix the highest-impact blocker. Rerun the same checks, affected task, and regression tests. Keep only verified fixes. Stop when no blocker remains, progress stalls, verification is unavailable, or approval is required. Never silence a check or weaken the target. Return issues, fixes, evidence, exceptions, and untested needs.
- Verify: No confirmed accessibility barrier remains in the agreed pages, components, or user tasks. The same automated scans, available manual checks, affected user task, and regression tests pass after each retained fix without lowering the chosen accessibility standard.
- Keywords: accessibility audit, accessibility repair, WCAG workflow, inclusive design testing, accessibility regression
- Related: [The UI/UX Score Loop](https://signals.forwardfuture.com/loop-library/loops/ui-ux-score-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 041 — [The housekeeper loop](https://signals.forwardfuture.com/loop-library/loops/housekeeper-loop/)

- Category: Engineering
- Use when: Use this when a code project has accumulated small maintenance problems—unused code, stale files, duplicated logic, broken links, old comments, inconsistent names, or confusing organization—but broad deletion would be risky.
- Prompt: Review [repository or code project] for dead code, meaning unreachable or unused code; stale files or comments; unused dependencies; duplication; broken links; inconsistent names; and confusing structure. Protect unrelated, active, uncommitted, generated, and uncertain work. Prove one low-risk cleanup, make the smallest coherent change, then rerun the build, tests, runtime checks, and diff review. Keep only verified improvements. Stop when none remain, progress stalls, verification is unavailable, or approval is required. Return changes, evidence, and deferred candidates.
- Verify: No confirmed low-risk cleanup remains, and existing behavior still passes. Every retained cleanup is supported by direct evidence, relevant builds and tests pass, the application still runs where applicable, unrelated work is untouched, and uncertain candidates are deferred rather than deleted.
- Keywords: codebase housekeeping, dead code cleanup, unused dependency review, repository hygiene, incremental cleanup
- Related: [The repository cleanup loop](https://signals.forwardfuture.com/loop-library/loops/repository-cleanup-loop/), [The docs sweep](https://signals.forwardfuture.com/loop-library/loops/overnight-docs-sweep/)

## 042 — [The Axelrod subagent arena loop](https://signals.forwardfuture.com/loop-library/loops/axelrod-subagent-arena-loop/)

- Category: Evaluation
- Use when: Use this as a controlled experiment to see whether AI agents learn repeated-interaction behaviors such as cooperation, retaliation after betrayal, forgiveness, exploitation, and different strategies for different opponents.
- Prompt: Run a fixed Axelrod tournament with two reasoning AI agents. Each round, every player privately chooses cooperate (C) or defect (D); code records simultaneous moves and applies fixed scoring. Include always-defect and always-cooperate comparison players. Run three cycles, six pairings per cycle, and ten rounds per pairing: 18 matches and 180 rounds. Hide opponent type and private reasoning. Validate every move and total. Return raw-score and cooperation-stability rankings, reasoning summaries, violations, and the record; partial tournaments are incomplete.
- Verify: All 18 matches and 180 rounds can be reproduced from the recorded moves and fixed scoring rules. Each agent chooses before seeing the opponent's move, every move is recorded before scoring, totals reproduce from the full history, invalid responses are logged, and any partial or invalid tournament remains explicitly incomplete.
- Keywords: Axelrod tournament, Iterated Prisoner's Dilemma, multi agent benchmark, agent cooperation, reasoning subagent evaluation
- Related: [The Boeing 747 benchmark](https://signals.forwardfuture.com/loop-library/loops/boeing-747-benchmark/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 043 — [The prepare-a-new-project loop](https://signals.forwardfuture.com/loop-library/loops/prepare-new-project-loop/)

- Category: Engineering
- Use when: Use this before building a new software project when its idea or early documents still leave important implementation decisions open to interpretation.
- Prompt: Prepare [project] for implementation. Ensure its documents cover requirements, technical design, tasks with acceptance criteria, and test strategy. Each round, fix the largest gap or contradiction that could make two competent engineers build different systems. Keep details traceable, record assumptions, and ask before product forks. Recheck consistency, then have two independent reviewers describe the components, data model, dependencies, and definition of done. Stop when they materially agree and every artifact is testable, or a decision needs the user.
- Verify: Two independent reviewers derive substantially the same build from the project documents. Their descriptions agree on the components, data model, dependencies, and definition of done, and every required artifact is specific, consistent, traceable, and testable.
- Keywords: project planning loop, build ready documentation, technical design review, requirements convergence, software project preparation
- Related: [The Goal Forge loop](https://signals.forwardfuture.com/loop-library/loops/goal-forge-loop/), [The multi-LLM convergence loop](https://signals.forwardfuture.com/loop-library/loops/multi-llm-convergence-loop/), [The Codex completion-contract loop](https://signals.forwardfuture.com/loop-library/loops/codex-completion-contract-loop/)

## 044 — [The test stabilizer loop](https://signals.forwardfuture.com/loop-library/loops/test-stabilizer-loop/)

- Category: Engineering
- Use when: Use this when a test suite produces inconsistent results across otherwise comparable runs and the failures may come from shared state, timing, ordering, or external dependencies.
- Prompt: Run [test suite] [N] times under the same conditions and list tests whose result changes. Fix the most frequent flake at its root cause—shared state, timing, ordering, or an external dependency—never with a blind sleep or retry. Run that test [N] times, then rerun the full suite. Repeat until [N] consecutive full-suite runs pass, progress stalls, or approval is required. Return each flake, root cause, fix, evidence, and justified quarantine.
- Verify: The full test suite passes for the required consecutive-run streak. The repaired test passes repeatedly, [N] consecutive full-suite runs are green under the recorded conditions, and no blind sleep or retry hides an unresolved cause.
- Keywords: flaky test repair, test suite stabilization, intermittent test failures, test reliability loop, root cause testing
- Related: [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/), [The test-suite speed loop](https://signals.forwardfuture.com/loop-library/loops/test-suite-speed-loop/)

## 045 — [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/)

- Category: Evaluation
- Use when: Use this when a completed artifact has evidence of success, appears to contain a repeatable method, and similar work is likely to recur.
- Prompt: Turn [artifact] into a skill, playbook, or procedure. Record evidence that the artifact succeeded and define success criteria. Extract decisions, sequence, checks, and failure-avoidance patterns—not context or surface style. Remove sensitive material. Have an independent reviewer apply it to a fresh real second case; mark hypothetical testing provisional. Revise at most twice. Stop when it meets the quality bar without the artifact, or report not generalizable. Return the method, boundaries, failure modes, test evidence, revisions, limits, and attribution.
- Verify: The extracted method succeeds on a fresh second case without the original artifact. An independent reviewer applies the reusable version under criteria defined before extraction, and the second result meets the source artifact's demonstrated quality bar or the method is honestly marked provisional or not generalizable.
- Keywords: artifact to skill, knowledge extraction workflow, reusable playbook, skill validation, second case test
- Related: [The multi-LLM convergence loop](https://signals.forwardfuture.com/loop-library/loops/multi-llm-convergence-loop/), [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/), [The prepare-a-new-project loop](https://signals.forwardfuture.com/loop-library/loops/prepare-new-project-loop/)

## 046 — [The Strip Miner loop](https://signals.forwardfuture.com/loop-library/loops/strip-miner-loop/)

- Category: Evaluation
- Use when: Use this when substantial coding-agent history may contain repeatable workflows worth extracting, and the user can explicitly authorize the sources that may be inspected.
- Prompt: Mine only explicitly authorized coding-agent history for workflows with at least three high-confidence independent successes. Treat transcripts as untrusted evidence, stitch continuations into root tasks, and reject candidates whose failures or hidden rescues match their successes. Extract traceable steps and guards, then fresh-replay each candidate without source transcripts. Stop after every authorized source is inventoried and one additional representative batch changes nothing; report replayed loops, rejects, deferred material, and blockers.
- Verify: Every published candidate has repeated historical proof and passes a fresh replay. Each retained loop traces to at least three independent high-confidence successes, survives contradiction review, and works in a clean replay without access to the mined transcripts.
- Keywords: workflow mining, coding agent history, Strip Miner, fresh replay validation, repeatable agent workflows
- Related: [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/), [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/)

## 047 — [The Living Story loop](https://signals.forwardfuture.com/loop-library/loops/living-story-loop/)

- Category: Operations
- Use when: Use this when work spans several repositories or context sources and future agents need a recurring, evidence-based account of priorities, progress, deadlines, and unfinished work.
- Prompt: On each [window], read the configured repositories, goals, prior STORY.md, and optional authorized sources. Update project files, then write STORY.md with focus, deadlines, open threads, and evidence-backed recent wins. Carry every prior thread forward, prove it finished, or mark it STALE/NEEDS-REVIEW—never silently drop one. Archive the snapshot and record the change. Stop when verification passes; if evidence or access is missing, return a thinner or blocked snapshot explicitly.
- Verify: The current story accounts for every prior thread and supports every recent win with evidence. Each previous open thread is carried forward, closed with proof, or visibly flagged, and every claimed win cites a commit, release, closed task, deployment, sent deliverable, or generated artifact.
- Keywords: Living Story, agent context management, project status narrative, open thread tracking, evidence based progress
- Related: [The five-minute repository maintainer loop](https://signals.forwardfuture.com/loop-library/loops/five-minute-repository-maintainer-loop/), [The nightly changelog loop](https://signals.forwardfuture.com/loop-library/loops/nightly-changelog-sweep/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

## 048 — [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/)

- Category: Engineering
- Use when: Use this before trusting a project's security, correctness, platform compatibility, privileged surfaces, scheduled work, or operational assumptions and when the first task is audit rather than repair.
- Prompt: Audit [project] from its actual code and configuration, not framework assumptions. For architecture, platform compatibility, security, privileged areas, performance, deployment, jobs, business logic, and code quality, record proved, no issue, weak, or N/A with direct evidence; verify external limits from current primary sources and calculate numbers. Ask before changing code. Stop when every area is logged with severity, or return unverified areas as blocked. Finish with a plain-language overview and area-to-evidence table.
- Verify: Every audit area has a current evidence-backed outcome and severity. The area-to-evidence table contains no silent gaps: each area is proved, no issue found, weak, N/A with a reason, or explicitly unverified and blocked.
- Keywords: Groundtruth audit, evidence based code review, project security audit, platform compatibility review, area to evidence table
- Related: [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

## 049 — [The Recovery Proof loop](https://signals.forwardfuture.com/loop-library/loops/recovery-proof-loop/)

- Category: Operations
- Use when: Use this when backup existence is not enough and the organization needs repeatable proof that required systems can be restored from documented materials within agreed recovery objectives.
- Prompt: For each required recovery scenario, randomly select an eligible real backup or recovery point and restore from zero in a disposable, isolated clean-room using only documented materials. Verify integrity, dependencies, representative reads and writes, and actual RPO and RTO. Repair one blocker, destroy the environment, and retry fresh. Stop when every scenario reaches its predefined consecutive-success streak or an exception is explicitly accepted. Never overwrite production, expose restored data, or initiate failover without approval.
- Verify: Every required recovery scenario succeeds repeatedly from a real recovery point. Fresh clean-room restores satisfy integrity, dependency, representative read/write, RPO, and RTO checks under unchanged criteria, with failures preserved as regression drills and restored data destroyed securely.
- Keywords: backup recovery testing, disaster recovery drill, RPO and RTO validation, clean room restore, recovery proof
- Related: [The quality streak loop](https://signals.forwardfuture.com/loop-library/loops/quality-streak-loop/), [The post-release baseline loop](https://signals.forwardfuture.com/loop-library/loops/post-release-baseline-loop/), [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/)

## 050 — [The refund follow-up loop](https://signals.forwardfuture.com/loop-library/loops/refund-follow-up-loop/)

- Category: Operations
- Use when: Use this when someone owes you a refund and getting it may take more than one support conversation or follow-up.
- Prompt: Get my refund for [company and charge info]. Start the claim now through an approved support channel, then keep following up on replies, promises, and deadlines until the refund arrives. Keep a short case note so each follow-up has context. Stop only when the refund is received or you are genuinely blocked and need me.
- Verify: The refund is received, or a genuine blocker requires the user. An open claim, promise, or pending refund is progress, not success; keep following up until the money arrives or no approved next step remains.
- Keywords: refund follow up, consumer advocacy, customer support escalation, refund status tracking, case log
- Related: [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/), [The Living Story loop](https://signals.forwardfuture.com/loop-library/loops/living-story-loop/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/)

## 051 — [The next-action confidence check](https://signals.forwardfuture.com/loop-library/loops/next-action-confidence-check/)

- Category: Evaluation
- Use when: Use this after an agent finishes a task and another visible action could tempt it to continue beyond the user's original request.
- Prompt: Run an exit check on the task most recently completed in this conversation or workspace. This check does not authorize additional work. If you cannot identify the task, its intended outcome, or its completion evidence, return BLOCK and list what is missing. Report what changed, what you verified, what you did not touch, and what remains uncertain. Classify the current task as PASS, DELAY, or BLOCK. Separately classify the next visible action as GO, HOLD, CAP, or BLOCK. Explain the decision briefly. If you choose CAP, define its exact scope and limit. Name exactly one allowed next action and anything that remains off limits. Do not begin the action, even if the result is GO. Stop and wait for the user. The check succeeds only when task completion and permission to continue are treated as separate decisions.
- Verify: Completion and permission to continue are evaluated separately. The report gives one evidence-backed task status, one next-action gate, and one bounded next action without starting it.
- Keywords: next action gate, AI agent stop condition, task completion check, scope control, human approval
- Related: [The Codex completion-contract loop](https://signals.forwardfuture.com/loop-library/loops/codex-completion-contract-loop/), [The Goal Forge loop](https://signals.forwardfuture.com/loop-library/loops/goal-forge-loop/), [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/)

## 052 — [The research-to-artifact loop](https://signals.forwardfuture.com/loop-library/loops/research-to-artifact-loop/)

- Category: Content
- Use when: Use this when research should end in a specific decision, brief, recommendation, specification, page, or other artifact rather than a pile of notes.
- Prompt: Research [question or topic] and produce a decision-ready [memo, brief, specification, recommendation, page, or other artifact] for [audience or decision]. If the question, audience, or intended artifact is missing, ask one focused question before starting. State the decision the artifact should support, its acceptance criteria, the allowed source scope, and the research budget. If no budget is supplied, use no more than ten strong sources or ninety minutes. Prefer current primary sources where available. After each research pass, update the artifact and identify the largest remaining evidence gap, contradiction, or uncertainty. Continue only if resolving it could materially change the decision and the budget allows another pass. Never invent evidence or hide uncertainty. Stop when the artifact meets its acceptance criteria, important claims trace to sources, and remaining uncertainty is explicit. Otherwise stop as blocked or exhausted. Finish with the completed artifact, sources, findings, tensions, confidence level, open questions, and recommended next step.
- Verify: The artifact is ready to support its stated decision. The deliverable meets its acceptance criteria, traces important claims to sources, and states material uncertainty instead of hiding it.
- Keywords: research synthesis, decision ready artifact, evidence based memo, research workflow, source grounded writing
- Related: [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/), [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The Goal Forge loop](https://signals.forwardfuture.com/loop-library/loops/goal-forge-loop/)

## 053 — [The error-message rewrite loop](https://signals.forwardfuture.com/loop-library/loops/error-message-rewrite-loop/)

- Category: Engineering
- Use when: Use this when a product exposes raw, internal, inconsistent, or unhelpful error messages and the complete user-facing error surface needs a controlled rewrite.
- Prompt: Find and improve every user-visible error message within [repository, product, or named scope]. If no scope is supplied, use the user-facing surfaces in the current repository and state any exclusions before editing. Inventory error strings in source code, surfaced API or client errors, and reachable browser states. Record each one in a CSV with its location, trigger, current copy, user risk, proposed replacement, implementation status, and verification result. Rank the errors by user harm. Rewrite one coherent group at a time using plain language and a useful recovery step when one exists. Do not expose provider names, stack traces, internal identifiers, or implementation details. After each change, run the relevant tests, exercise the affected state in a real browser when possible, and search again for raw or internal error text. Do not mark an unreachable state as verified. Stop when every row is verified or explicitly blocked. Finish with the CSV, changed files, test evidence, browser evidence, and blocked items.
- Verify: Every in-scope user-facing error is clear and accounted for. The inventory contains no silently skipped row: each error is verified in its reachable state or marked blocked with the missing evidence.
- Keywords: error message rewrite, user facing errors, UX writing, browser error states, error inventory
- Related: [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/), [The propagation compliance loop](https://signals.forwardfuture.com/loop-library/loops/propagation-compliance-loop/), [The accessibility repair loop](https://signals.forwardfuture.com/loop-library/loops/accessibility-repair-loop/)

## 054 — [The stable-frame-rate loop](https://signals.forwardfuture.com/loop-library/loops/stable-frame-rate-loop/)

- Category: Engineering
- Use when: Use this when a game or interactive build has unstable frame rate and performance can be measured under one representative, repeatable scenario.
- Prompt: Improve the frame-rate stability of [game or interactive build]. Before editing, define one repeatable benchmark with the same scene, inputs, hardware, build, resolution, and settings. If no scenario or targets are supplied, propose representative values and state them before proceeding. Record frame-time distribution, average FPS, minimum FPS, CPU use, GPU use, and memory behavior. Identify the largest measured bottleneck and make one focused optimization. Rerun the complete benchmark under the same conditions. Keep the change only if it improves the target without regressing another metric or changing expected behavior. Repeat until [FPS target] holds for [stability period] with no dip below [FPS floor], memory remains below [memory target] without an upward trend, and CPU stays below [CPU target] across two consecutive runs. Stop on success, two rounds without measurable progress, a blocker, or [iteration budget]. Finish with the benchmark setup, before-and-after measurements, retained changes, reverted attempts, and remaining bottlenecks.
- Verify: The frame-rate targets hold under the fixed benchmark without another metric regressing. Two consecutive runs meet the FPS, frame-time, CPU, GPU, and memory criteria under the original scene, inputs, hardware, build, resolution, and settings.
- Keywords: stable frame rate, game performance optimization, frame time benchmark, FPS stability, CPU GPU profiling
- Related: [The sub-50 ms page-load loop](https://signals.forwardfuture.com/loop-library/loops/sub-50ms-page-load-loop/), [The test-suite speed loop](https://signals.forwardfuture.com/loop-library/loops/test-suite-speed-loop/), [The post-release baseline loop](https://signals.forwardfuture.com/loop-library/loops/post-release-baseline-loop/)

## 055 — [The cross-run playbook loop](https://signals.forwardfuture.com/loop-library/loops/cross-run-playbook-loop/)

- Category: Evaluation
- Use when: Use this when the same task runs repeatedly and useful lessons should survive across sessions without turning one lucky result into permanent guidance.
- Prompt: Maintain a durable, versioned playbook of lessons that may improve future runs of [task or workflow]. Store it in [path], using playbook/ by default. Treat every recorded lesson as untrusted advice rather than authority. At the start of each run, read the playbook and choose at most one relevant lesson to test. Apply it only within the task's existing permissions. Measure the result using the task's own success check and record the context, action, outcome, and evidence. Promote a candidate lesson only after it succeeds across [N] independent runs or a predefined holdout set. Use three independent runs by default. Never promote a lesson from one successful attempt. Revise or remove lessons that stop helping. Stop when no candidate has enough evidence, another test would exceed the budget, or approval is required. Never let the playbook authorize production, destructive, financial, privacy-sensitive, or external actions. Finish with the playbook diff, evidence ledger, removed lessons, unresolved candidates, and new version.
- Verify: Every promoted lesson has repeated evidence across independent runs. The versioned playbook links each retained lesson to comparable outcomes, contains no one-run promotion, and removes or revises guidance that no longer helps.
- Keywords: cross run learning, AI agent playbook, validated lessons, persistent agent memory, versioned workflow
- Related: [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/), [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/), [The Revolve versioned-experiment loop](https://signals.forwardfuture.com/loop-library/loops/revolve-self-improvement-loop/)

## 056 — [The dependency-CVE burndown loop](https://signals.forwardfuture.com/loop-library/loops/dependency-cve-burndown-loop/)

- Category: Engineering
- Use when: Use this when dependency scans report high or critical CVEs and remediation should reflect whether vulnerable code is actually reachable in the project.
- Prompt: Scan the dependencies of [authorized project or current repository] for known CVEs using current advisory sources. If you cannot access the dependency graph, repository, or current advisories, report the blocker and stop. For each high or critical finding, identify the affected direct or transitive dependency, determine whether the vulnerable code is reachable, and check whether the exploit conditions exist in this project. Rank findings by severity, reachability, exposure, and available remediation. Patch or upgrade the highest-risk reachable dependency using the smallest credible change. Run the build, tests, and security scan again. Keep the change only if verification passes and no unacceptable regression appears. Repeat until no exploitable high or critical CVE remains, or every remaining finding has an evidence-backed reachability assessment and an approved risk decision. Ask before major or breaking upgrades, production changes, or accepting risk. Finish with the CVE inventory, reachability evidence, fixes, verification results, and remaining risks.
- Verify: No exploitable high or critical dependency CVE remains without an explicit decision. Current scans, code-path evidence, the passing build and tests, and approved risk decisions account for every high or critical finding.
- Keywords: dependency CVE, vulnerability reachability, security patch workflow, dependency upgrade, software supply chain security
- Related: [The production error sweep](https://signals.forwardfuture.com/loop-library/loops/production-error-sweep/), [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The housekeeper loop](https://signals.forwardfuture.com/loop-library/loops/housekeeper-loop/)

## 057 — [The Loop Hiring Manager](https://signals.forwardfuture.com/loop-library/loops/loop-hiring-manager/)

- Category: Operations
- Use when: Use this when a project has recurring agent work or automation ideas but needs to decide which loops are actually worth adopting.
- Prompt: Decide whether [project or current workspace] needs new recurring agent loops. If the project cannot be identified, ask for it before continuing. Review its goals, repeated failures, recurring chores, existing automation, and adopted loops. Read the current published Loop Library from https://signals.forwardfuture.com/loop-library/api/loops. Find recurring outcomes that lack reliable ownership, a repeatable process, or proof of completion. For the strongest gap, prefer an exact published loop. If none fits, propose the smallest grounded adaptation. Design a new loop only when neither option works. Keep no more than three evidence-backed candidates and recommend at most one manual trial. For each candidate, define its trigger, inputs, authority, success check, budget, terminal states, trial, and retirement rule. Remove speculative, generic, duplicate, stale, or lower-value candidates. Do not install, schedule, or run anything without approval. Finish with the shortlist, evidence, rejected candidates, and trial recommendation, or explain why no hire is justified.
- Verify: Every recommended loop fills a proven recurring gap and has a safe trial and retirement rule. The final slate contains at most three non-duplicate candidates, one recommended manual trial, evidence of recurrence, and explicit authority, success, budget, terminal, and retirement rules.
- Keywords: Loop Hiring Manager, agent workflow portfolio, automation selection, loop adoption, manual trial
- Related: [The five-minute repository maintainer loop](https://signals.forwardfuture.com/loop-library/loops/five-minute-repository-maintainer-loop/), [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/), [The housekeeper loop](https://signals.forwardfuture.com/loop-library/loops/housekeeper-loop/)

## 058 — [The loop-auditor loop](https://signals.forwardfuture.com/loop-library/loops/loop-auditor-loop/)

- Category: Evaluation
- Use when: Use this when several active loops need a read-only portfolio review to determine which should continue, change, retire after success, or stop wasting budget.
- Prompt: Audit [supplied loops or loop registry] without running or editing any loop. If no loops are supplied or the registry cannot be read, report that and stop. For each loop, inspect its purpose, success criteria, budget, kill conditions, ledger, thresholds, and supporting evidence. Assign INSUFFICIENT EVIDENCE when required information is missing. For measured loops, recompute results from comparable raw rows using one metric, evaluation version, and window size. Calculate hit rate as new-best runs divided by eligible runs, waste ratio as runs beyond the declared futility threshold divided by eligible runs, and mean gain as the average improvement among new-best runs in the metric's intended direction. Compare the current window with the previous two comparable windows. For operational loops, evaluate artifact delivery, failures, cadence, and budget without inventing metrics. Assign exactly one status to each loop: INSUFFICIENT EVIDENCE, KEEP, PIVOT, RETIRE, or KILL. Recommend only. Stop after every supplied loop has one evidence-backed status. Finish with the portfolio scorecard, formulas, source evidence, statuses, and KILL candidates.
- Verify: Every supplied loop has exactly one status supported by comparable evidence. The scorecard shows formulas, raw sources or artifacts, the evaluation version and window, and no invented metric or silently unclassified loop.
- Keywords: loop auditor, agent workflow evaluation, automation portfolio, kill condition, waste ratio
- Related: [The Revolve versioned-experiment loop](https://signals.forwardfuture.com/loop-library/loops/revolve-self-improvement-loop/), [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/), [The Living Story loop](https://signals.forwardfuture.com/loop-library/loops/living-story-loop/)

## 059 — [The talk-to-five-buyers loop](https://signals.forwardfuture.com/loop-library/loops/talk-to-five-buyers-loop/)

- Category: Content
- Use when: Use this when landing-page copy may be missing the real objection that nearly prevented recent customers from buying.
- Prompt: Improve [landing page or purchase page] using objections from recent buyers. Before contacting anyone, identify the approved buyer group, outreach channel, privacy rules, and message. Obtain explicit approval for the outreach. Interview buyers in batches of five, up to fifteen people total. Ask each person one question: What almost stopped you from buying? Record their exact words while protecting their identity and honoring any consent or communication requirements. After each batch, group repeated concerns and draft a proposed copy change for the point on the page where each concern is most likely to arise. Do not publish the copy without approval. Use the next batch to check whether the same concern still appears. Stop when the concern no longer repeats, fifteen interviews are complete, the outreach budget ends, or access is blocked. Finish with anonymized quotes, recurring concerns, proposed copy, evidence by batch, and the recommended page change.
- Verify: The proposed copy reflects repeated buyer objections rather than internal assumptions. The final recommendation traces each proposed change to anonymized exact language from approved interviews and shows how the pattern changed across batches.
- Keywords: buyer interviews, customer objections, landing page copy, voice of customer, conversion research
- Related: [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/), [The recent-feedback sweep](https://signals.forwardfuture.com/loop-library/loops/recent-feedback-sweep/), [The customer AI deployment loop](https://signals.forwardfuture.com/loop-library/loops/customer-ai-deployment-loop/)

## 060 — [The one-post-a-week loop](https://signals.forwardfuture.com/loop-library/loops/one-post-a-week-loop/)

- Category: Content
- Use when: Use this when someone wants a repeatable weekly content format but needs to learn from replies, saves, and questions instead of guessing from likes.
- Prompt: Find a repeatable weekly post format for [approved account, audience, and topic] through a six-week experiment. If the account, audience, or topic is missing, ask for it before drafting. Obtain approval before publishing anything externally. Each week, draft one short post about a real problem [person, product, or company] solves. Record substantive replies, saves, and questions after the same measurement window. Treat likes as secondary evidence. Keep the audience, topic area, cadence, and measurement window comparable. Change only one meaningful element each week, such as the opening, format, example, or call to action, based on the strongest signal from the previous post. Stop when one format materially outperforms the alternatives, the six-week experiment ends without a winner, approval is withheld, required metrics are unavailable, or the budget is exhausted. Never fabricate engagement data. Finish with every post, its measurements, the variables tested, the winning format or no-winner result, and the next recommendation.
- Verify: The recommended format is supported by comparable audience-response evidence. The experiment records each approved post, one changed variable, the same measurement window, and a clear winner or honest no-winner result after six weeks.
- Keywords: weekly content experiment, social media workflow, audience response, content format testing, one post a week
- Related: [The product update podcast loop](https://signals.forwardfuture.com/loop-library/loops/product-update-podcast-loop/), [The Infinite Clickbait thumbnail loop](https://signals.forwardfuture.com/loop-library/loops/infinite-clickbait-loop/), [The self-improving champion loop](https://signals.forwardfuture.com/loop-library/loops/self-improving-champion-loop/)

## 061 — [The LaTeX document creation loop](https://signals.forwardfuture.com/loop-library/loops/latex-document-creation-loop/)

- Category: Content
- Use when: Use this when supplied research material must become a complete, auditable LaTeX preprint without external image assets or fabricated evidence.
- Prompt: Create a complete LaTeX preprint about [topic] using [supplied sources, assumptions, and data]. If the topic or required source material is missing, request it and stop. Do not invent claims, citations, or data. Use explicit placeholders for missing information. Include exactly these sections in order: Abstract, Introduction, Methods, Results, Discussion, Conclusion, and References. Build every figure and table with native LaTeX tools such as TikZ, pgfplots, and booktabs. Do not use \includegraphics, \svg, or external image files. Every substantive claim must trace to a numbered equation, citation, supplied datum, or labeled assumption. Compile using the project's documented command or latexmk when no command is specified. Inspect compilation errors, warnings, typography, cross-references, and figure placement. Fix the most serious issue and compile again for at most five rounds. Stop when compilation has zero errors, all seven sections are present, every figure and table is referenced before it appears, and no banned command remains. Otherwise stop as blocked or exhausted. Finish with the .tex file, compilation command and log, structural checks, three substantive weaknesses, three typography issues, and unresolved placeholders.
- Verify: The preprint compiles cleanly and every claim and native visual is accounted for. The final source has all seven ordered sections, zero compilation errors, no banned image command, traceable claims, referenced native figures and tables, and explicit remaining weaknesses.
- Keywords: LaTeX document creation, AI research writing, TikZ pgfplots, source traceability, LaTeX compilation
- Related: [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The multi-LLM convergence loop](https://signals.forwardfuture.com/loop-library/loops/multi-llm-convergence-loop/), [The artifact-to-skill loop](https://signals.forwardfuture.com/loop-library/loops/artifact-to-skill-loop/)

## 062 — [The pre-publish source-check loop](https://signals.forwardfuture.com/loop-library/loops/pre-publish-source-check-loop/)

- Category: Content
- Use when: Use this immediately before publishing an article, newsletter, post, report, or other factual draft whose claims, quotations, and attributions need a final evidence check.
- Prompt: Before publishing [draft], inventory every factual, statistical, quoted, or attributed claim a reader could verify. Find the best current primary source for each and label it supported, outdated, misattributed, unsupported, or unverifiable. Fix the riskiest mismatch, then recheck that claim and anything depending on it. Repeat until no high-risk unsupported claim remains or five rounds are exhausted. Never invent a source, cite evidence that does not support the claim, or alter a quotation. Ask before changing a named person’s quote or a legal, medical, or financial statement. Stop without changes if there are no checkable claims; stop as blocked when adequate evidence is unavailable. Finish with the claim-to-source table, corrections made, unresolved claims, and decisions requiring an editor.
- Verify: Every checkable claim is supported or visibly flagged. The final claim-to-source table traces each checkable claim to current supporting evidence or marks it for an editor, with no high-risk unsupported claim silently left in the draft.
- Keywords: pre-publish fact check, source verification, editorial accuracy, claim audit, primary source research
- Related: [The promise-to-proof loop](https://signals.forwardfuture.com/loop-library/loops/promise-to-proof-loop/), [The research-to-artifact loop](https://signals.forwardfuture.com/loop-library/loops/research-to-artifact-loop/)

## 063 — [The epistemic frontier loop](https://signals.forwardfuture.com/loop-library/loops/epistemic-frontier-loop/)

- Category: Evaluation
- Use when: Use this for a hard question, strategy, system, or unresolved decision where several explanations remain plausible and another evidence-gathering step could materially improve the conclusion.
- Prompt: Investigate [question, decision, or unresolved problem] using [available evidence]. Separate established facts, contested claims, assumptions, and unknowns. Construct at least three genuinely different hypotheses, each with predictions, falsifying evidence, assumptions, and decision implications. Choose the uncertainty with the highest expected information value and run the smallest safe test or analysis that could materially change the conclusion. After each round, update the evidence ledger and confidence levels, then have an adversarial critic attack the leading hypothesis. Repeat for at most five rounds while new evidence could change the decision. Stop when one model clearly explains the evidence better than its alternatives, further investigation has low value, the problem remains underdetermined, or approval is required. Never fabricate evidence or hide uncertainty. Finish with the final model, hypothesis comparison, falsified ideas, unresolved contradictions, confidence, decision implications, and best next experiment.
- Verify: The conclusion survives comparison with falsifiable alternatives. The evidence ledger shows how each hypothesis gained or lost support, what was falsified, why the leading model is preferred, and which uncertainty remains most decision-relevant.
- Keywords: hypothesis testing, information value, evidence ledger, decision uncertainty, adversarial reasoning
- Related: [The devil's-advocate loop](https://signals.forwardfuture.com/loop-library/loops/devils-advocate-design-loop/), [The research-to-artifact loop](https://signals.forwardfuture.com/loop-library/loops/research-to-artifact-loop/)

## 064 — [The dependency triage loop](https://signals.forwardfuture.com/loop-library/loops/dependency-triage-loop/)

- Category: Engineering
- Use when: Use this when a repository has several open Dependabot pull requests and an authorized maintainer wants them reviewed safely without stale checks, parallel merge races, or automatic high-risk upgrades.
- Prompt: Review every Dependabot pull request currently open in [repository]. Take a fixed snapshot of that set and process each pull request once. Read its diff, release notes, advisories, dependency role, current base revision, and exact-head CI results. Run the repository’s relevant tests in an isolated worktree and classify the update by version change, breaking behavior, security exposure, and regression risk. For failing checks, identify the root cause and prepare the smallest verified repair. Process merges serially: before each merge, refetch the base and pull-request head and require passing exact-head checks. Merge only low-risk patch or minor updates when explicit merge authority has already been granted. Request approval for major, breaking, security-sensitive, uncertain, or externally visible actions. Never push changes, merge, comment, or send messages without the corresponding authority. Stop successfully when the original snapshot is fully processed; stop without changes when none are open; stop as blocked when verification is unavailable. Finish with reviewed, repaired, merged, deferred, and blocked pull requests plus supporting evidence.
- Verify: Every snapshotted dependency pull request reaches an evidence-backed status. Each pull request is merged, repaired, deferred for approval, or blocked with current diff, release, CI, and repository-test evidence; every merge uses a fresh base and exact head.
- Keywords: Dependabot triage, dependency pull requests, dependency update review, safe automated merge, dependency maintenance
- Related: [The dependency-CVE burndown loop](https://signals.forwardfuture.com/loop-library/loops/dependency-cve-burndown-loop/), [The stale-safe batch release loop](https://signals.forwardfuture.com/loop-library/loops/stale-safe-batch-release-loop/), [The Loop Harness verification loop](https://signals.forwardfuture.com/loop-library/loops/loop-harness-verification-loop/)

## 065 — [The React Doctor repair loop](https://signals.forwardfuture.com/loop-library/loops/react-doctor-repair-loop/)

- Category: Engineering
- Use when: Use this when a React codebase has actionable React Doctor errors or warnings and the safest repair strategy is to work in small, verified batches.
- Prompt: Run `pnpm exec react-doctor . --verbose --yes --offline --fail-on none` to record the baseline, then rerun with `--fail-on error`. Fix at most five genuine findings, run the same scan and relevant project checks, and keep only verified improvements. Clear errors before high-confidence warnings. Stop when clean, blocked, approval is required, a finding is false-positive, or another pass makes no measurable progress. Finish with baseline and final results, retained fixes, reverted attempts, checks, and remaining findings.
- Verify: React Doctor improves without introducing regressions. Compare the baseline and final scan under the same command and confirm each retained change with the repository checks relevant to the files touched.
- Keywords: React Doctor, React repair, coding agent, static analysis, verified refactoring
- Related: [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/), [The dependency-CVE burndown loop](https://signals.forwardfuture.com/loop-library/loops/dependency-cve-burndown-loop/)

## 066 — [The restartable handoff loop](https://signals.forwardfuture.com/loop-library/loops/restartable-handoff-loop/)

- Category: Operations
- Use when: Use this when a session is ending, context is becoming unwieldy, or another human or agent may need to continue the work later.
- Prompt: Before ending [session or work period], create a restartable handoff. Record the current goal, changes, verification evidence, untouched scope, uncertainties, open risks, off-limits areas, and last decision or gate. Check that a new human or agent could continue without guessing, then name exactly one safe next action and what they must not assume. Stop after the handoff; do not begin that action.
- Verify: The next person or agent can resume without guessing. The handoff accounts for current state, evidence, untouched and off-limits scope, uncertainty, risks, the last decision, one safe next action, and assumptions the next actor must not make.
- Keywords: AI agent handoff, session continuity, context transfer, restartable work, operational handoff
- Related: [The Living Story loop](https://signals.forwardfuture.com/loop-library/loops/living-story-loop/), [The next-action confidence check](https://signals.forwardfuture.com/loop-library/loops/next-action-confidence-check/), [The cross-run playbook loop](https://signals.forwardfuture.com/loop-library/loops/cross-run-playbook-loop/)

## 067 — [The React Doctor 100/100 loop](https://signals.forwardfuture.com/loop-library/loops/react-doctor-100-loop/)

- Category: Engineering
- Use when: Use this for an authorized repository-wide React health pass when every production React app can be identified and a complete scan plus project checks can run locally.
- Prompt: Bring every production React app in [repository] to a freshly verified React Doctor score of 100/100. Inventory app roots, record a full `npx react-doctor@latest --verbose` baseline, fix one root cause at a time, and rerun the full scan plus relevant typecheck, lint, tests, and builds. Never hide findings with exclusions, ignores, suppressions, deleted behavior, or relaxed rules. Stop at 100/100 for every app, blocked, approval-required, or no measurable progress; preserve unrelated work and report exact proof.
- Verify: Every production React app has a genuine fresh 100/100 result. Record the React Doctor version, app roots, full final scan, and repository validation results; distinguish pre-existing failures and blockers from regressions caused by the repair work.
- Keywords: React Doctor 100, React code health, repository audit, coding agent, React verification
- Related: [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/), [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/)

## 068 — [The evidence-first feature loop](https://signals.forwardfuture.com/loop-library/loops/evidence-first-feature-loop/)

- Category: Engineering
- Use when: Use this when a larger feature can be divided into small engineering slices and guessing about APIs, persistence, or user-visible behavior would create avoidable risk.
- Prompt: Implement one bounded feature slice in [repository]. Read project instructions, the current implementation, relevant services, types, UI, tests, and architecture notes before editing. Report the evidence, risks, affected files, persistence impact, and validation plan; stop for approval if inspection materially changes scope or reveals destructive, production, or silent-persistence behavior. Make the smallest change, preserve unknown data and unrelated work, run relevant checks, and manually verify user-facing states. Stop after this slice and return evidence, limitations, and the next recommended slice.
- Verify: One feature slice works and its assumptions are proven by current evidence. The implementation matches observed APIs and data contracts, repository checks pass or have documented pre-existing failures, and the relevant user path is manually verified when one exists.
- Keywords: evidence-first engineering, feature implementation, AI coding agent, repository inspection, bounded development
- Related: [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/), [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/), [The full product evaluation loop](https://signals.forwardfuture.com/loop-library/loops/full-product-evaluation-loop/)

## 069 — [The architecture-preserving code refactor loop](https://signals.forwardfuture.com/loop-library/loops/architecture-preserving-code-refactor-loop/)

- Category: Engineering
- Use when: Use this for a focused readability, typing, maintainability, or performance refactor whose intended behavior and public contracts should remain unchanged.
- Prompt: Refactor [target] toward [measurable goal] in [repository]. If the target or goal is missing, ask and stop. Record current behavior and affected dependencies; select representative tests for boundaries and failure modes, then make one atomic change without altering public contracts unless authorized. Run the same tests, type and lint checks, and affected-consumer checks, keeping only regression-free improvements. Repeat for at most five rounds. Stop on success, blocked architecture, approval required, exhaustion, or no progress. Preserve unrelated work and finish with the diff, impact map, evidence, rejected attempts, and remaining debt.
- Verify: The target improves without changing contracts or downstream behavior. The stated measurable goal is met, baseline behavior and public interfaces remain intact, and the relevant tests, type checks, lint checks, and affected-consumer checks pass under recorded conditions.
- Keywords: architecture-preserving refactor, blast radius analysis, atomic refactoring, regression testing, AI coding agent
- Related: [The architecture satisfaction loop](https://signals.forwardfuture.com/loop-library/loops/architecture-satisfaction-loop/), [The ticket-to-PR-ready loop](https://signals.forwardfuture.com/loop-library/loops/ticket-to-pr-ready-loop/), [The Groundtruth loop](https://signals.forwardfuture.com/loop-library/loops/groundtruth-audit-loop/)

