planning

2026-03-21 10:28:52 +01:00 · 2026-03-21 10:28:52 +01:00 · acba1a2da0
parent 37a61f97a9
commit acba1a2da0
3 changed files with 800 additions and 0 deletions
--- a/docs/superpowers/plans/2026-03-21-batch-orchestrator.md
+++ b/docs/superpowers/plans/2026-03-21-batch-orchestrator.md
@ -0,0 +1,452 @@
+# Batch Orchestrator Implementation Plan
+
+> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
+
+**Goal:** Implement the batch orchestrator for re:struct — a `/project:run-batch` command that semi-autonomously processes planned tasks from task.md using subagents.
+
+**Architecture:** A command file (`.claude/commands/run-batch.md`) encodes the orchestrator flow as a Claude prompt instruction. A rules file (`rules/batch-orchestration.md`) codifies the constraints. Both are markdown — no application code. The orchestrator leverages Claude Code's native Agent tool for subagent dispatch.
+
+**Tech Stack:** Claude Code commands (markdown prompts), Claude Code Agent tool, project rules system
+
+**Spec:** `docs/superpowers/specs/2026-03-21-batch-orchestrator-design.md`
+
+---
+
+## File Structure
+
+| File | Action | Responsibility |
+|------|--------|----------------|
+| `rules/batch-orchestration.md` | Create | Orchestrator/subagent contract rules, loaded via CLAUDE.md |
+| `.claude/commands/run-batch.md` | Create | The orchestrator command — full flow from parsing to consolidation |
+| `.claude/settings.json` | Create | Project-level permissions for batch execution |
+| `CLAUDE.md` | Modify | Reference new rule, add run-batch to Workflow-Commands |
+
+---
+
+### Task 1: Create `rules/batch-orchestration.md`
+
+**Files:**
+- Create: `rules/batch-orchestration.md`
+
+This file defines the hard constraints that apply during any batch orchestration run.
+It is referenced from CLAUDE.md and automatically loaded into context.
+
+- [ ] **Step 1: Write the rules file**
+
+```markdown
+# Batch Orchestration Rules
+
+## Orchestrator Constraints
+- The orchestrator only reads task.md and subagent summaries — never full output files
+- The orchestrator only processes tasks from the active phase (defined in CLAUDE.md "Aktiver Scope")
+- Shared state files (status.md, decisions-log.md, task.md) are only written by the orchestrator, never by subagents
+- Phase-specific shared artifacts (e.g., capabilities-inventory.md) are updated only by the orchestrator during consolidation
+- The orchestrator never triggers /project:phase-transition or starts tasks from the next phase
+- When all DoD items of the active phase are complete: hard stop, inform user
+- Missing batch parameters (items, checkpoint mode) are always queried interactively before starting
+
+## Subagent Constraints
+- Each subagent writes only its own assigned output file
+- Subagents never write to task.md, status.md, decisions-log.md, or CLAUDE.md
+- Each subagent returns a 2-3 sentence summary as its return value
+
+## Task Parsing
+- The orchestrator parses the first section in task.md that contains ### sub-headings with - [ ] checkboxes (the work section)
+- Flat checkbox lists without sub-headings are treated as Definition of Done and ignored for dispatching
+- Tasks under the same ### heading form a parallelization group (can run concurrently)
+- Tasks under different ### headings are sequential (Group N+1 starts after Group N completes and checkpoint passes)
+
+## Context Loading
+- Base context (always): project rules of active phase + status.md
+- Phase-specific context: follow phase rules if they define context requirements (e.g., discovery-waves.md)
+- Task-specific context: if task declares (context: file1, file2), load those; otherwise load output files from prior groups
+- Prefer condensed documents over raw sources
+
+## Checkpoint Modes
+- after-each (default): pause after each group, show summaries, wait for user input
+- after-batch:N: run N groups, then pause
+- at-end: run all groups, consolidated review at end
+- Phase boundary: always a hard stop regardless of selected mode
+
+## Error Handling
+- On subagent failure: mark task as failed in checkpoint summary, do not block other parallel tasks in the same group
+- Failed tasks stay - [ ] in task.md (not marked as completed)
+- Failures are reported at the next checkpoint with the error details
+```
+
+- [ ] **Step 2: Verify the rules file covers all spec constraints**
+
+Cross-check against spec sections:
+- Orchestrator Flow (steps 1-5) ✓
+- Subagent Contract (boundaries) ✓
+- Task Parsing (conventions) ✓
+- Context Loading Strategy ✓
+- Checkpoint Modes ✓
+- Phase Protection ✓
+- Error Handling ✓
+
+- [ ] **Step 3: Commit**
+
+```bash
+git add rules/batch-orchestration.md
+git commit -m "feat: add batch orchestration rules"
+```
+
+---
+
+### Task 2: Create `.claude/commands/run-batch.md`
+
+**Files:**
+- Create: `.claude/commands/run-batch.md`
+
+This is the main orchestrator command. It's a Claude prompt instruction that encodes
+the complete flow. When invoked, Claude follows these instructions step by step.
+
+**Important:** This file must be self-contained enough that Claude can follow it without
+having seen the spec. The rules file provides the constraints; this file provides the procedure.
+
+- [ ] **Step 1: Write the command file — Preamble and Phase Detection**
+
+The command file starts with reading CLAUDE.md and determining the active phase:
+
+```markdown
+Du bist der Batch-Orchestrator. Führe die folgenden Schritte der Reihe nach aus.
+
+Beachte dabei strikt die Regeln aus @rules/batch-orchestration.md
+
+## Schritt 1 — Phase und Tasks erkennen
+
+1. Lies CLAUDE.md und bestimme die aktive Phase und das Arbeitsverzeichnis
+2. Lies die task.md der aktiven Phase
+3. Finde den Arbeitsabschnitt: den ersten Abschnitt mit ### Unterüberschriften und - [ ] Checkboxen
+4. Ignoriere flache Checkbox-Listen ohne Unterüberschriften (das ist die Definition of Done)
+5. Erstelle eine nummerierte Liste aller offenen Tasks (- [ ]) mit:
+   - Nummer
+   - Task-Beschreibung
+   - Gruppenname (### Überschrift)
+   - Output-Pfad (falls im Text erkennbar)
+```
+
+- [ ] **Step 2: Write the command file — Permission Check**
+
+```markdown
+## Schritt 1.5 — Berechtigungen prüfen
+
+Prüfe ob die folgenden Tools ohne Nachfrage verfügbar sind: Read, Write, Edit, Glob, Grep, Agent.
+
+Falls Berechtigungen fehlen, zeige dem User:
+> ⚠ Für den Batch-Modus werden folgende Berechtigungen benötigt: [Liste]
+> Bitte konfiguriere sie in `.claude/settings.json` oder `.claude/settings.local.json`:
+> ```json
+> { "permissions": { "allow": ["Read", "Write", "Edit", "Glob", "Grep", "Agent"] } }
+> ```
+
+Brich dann ab. Starte den Batch NICHT ohne bestätigte Berechtigungen.
+```
+
+- [ ] **Step 3: Write the command file — Parameter Query**
+
+```markdown
+## Schritt 2 — Parameter abfragen
+
+Falls der User beim Aufruf keine Parameter angegeben hat, oder Parameter fehlen, frage interaktiv:
+
+Zeige die Task-Liste aus Schritt 1 und frage:
+
+> Ich sehe N offene Tasks in Phase X:
+> [nummerierte Liste mit Gruppenname]
+>
+> **Welche Tasks?** (alle / Nummern z.B. 1-3 / Gruppenname z.B. wave-3)
+> **Checkpoint-Modus?** (after-each / after-batch:N / at-end)
+
+Warte auf die Antworten des Users.
+
+Zeige dann eine Zusammenfassung des geplanten Batch-Laufs:
+> **Batch-Plan:**
+> - Tasks: [ausgewählte Tasks]
+> - Gruppen: [Gruppenreihenfolge, parallele Tasks markiert]
+> - Checkpoint: [gewählter Modus]
+> - Phase: [aktive Phase]
+>
+> Soll ich starten?
+
+Warte auf Bestätigung.
+```
+
+- [ ] **Step 4: Write the command file — Group Processing Loop**
+
+```markdown
+## Schritt 3 — Gruppen abarbeiten
+
+Arbeite die Gruppen in der Reihenfolge ihrer ### Überschriften ab.
+
+### Für jede Gruppe:
+
+**Context bestimmen:**
+- Basis: rules der aktiven Phase + status.md
+- Phasen-spezifisch: falls Phase-Rules Context definieren (z.B. discovery-waves.md), folge diesen
+- Task-spezifisch: falls der Task (context: datei1, datei2) deklariert, lade diese
+- Fallback: alle existierenden Output-Dateien vorheriger Gruppen im aktuellen Batch
+
+**Tasks dispatchen:**
+- Falls die Gruppe nur einen Task hat: dispatche einen Subagent
+- Falls die Gruppe mehrere Tasks hat: dispatche alle als parallele Subagents
+
+Für jeden Subagent verwende das Agent-Tool mit folgendem Prompt-Muster:
+
+> Du bist ein Subagent im re:struct Framework.
+>
+> **Dein Auftrag:** [Task-Beschreibung aus task.md]
+> **Output-Datei:** [Pfad]
+> **Context-Dateien zum Lesen:** [Liste der Context-Dateien]
+> **Regeln:** Beachte @rules/framework-constraints.md und @rules/language-conventions.md
+>
+> Schreibe dein Ergebnis in die Output-Datei.
+> Gib am Ende eine Zusammenfassung in 2-3 Sätzen zurück (NUR die Zusammenfassung als Return-Value, nicht in die Datei).
+
+**Auf Abschluss warten:**
+- Warte bis alle Subagents der Gruppe fertig sind
+- Sammle die Summaries und eventuelle Fehler
+
+**Fehlerbehandlung:**
+- Falls ein Subagent fehlschlägt: notiere den Fehler, blockiere aber nicht andere parallele Tasks
+- Fehlgeschlagene Tasks werden beim Checkpoint gemeldet
+```
+
+- [ ] **Step 5: Write the command file — Checkpoint Handling**
+
+```markdown
+### Checkpoint-Logik
+
+Nach jeder abgeschlossenen Gruppe: prüfe ob ein Checkpoint fällig ist.
+
+**after-each:** Immer Checkpoint nach jeder Gruppe.
+**after-batch:N:** Checkpoint nach jeder N-ten Gruppe. Zähler beginnt bei 1.
+**at-end:** Kein Checkpoint bis alle Gruppen fertig sind.
+
+**Bei einem Checkpoint:**
+
+Zeige dem User:
+> **Checkpoint — Gruppe "[Gruppenname]" abgeschlossen**
+>
+> | Task | Output | Status | Summary |
+> |------|--------|--------|---------|
+> | [Name] | [Dateipfad] | ✅ / ❌ | [2-3 Sätze] |
+>
+> **Befehle:** `weiter` | `stopp` | `zeig mir [datei]`
+
+- Bei `weiter`: nächste Gruppe starten
+- Bei `stopp`: gehe zu Schritt 4 (Konsolidierung)
+- Bei `zeig mir [datei]`: Datei lesen und anzeigen, dann erneut fragen
+
+**Phasen-Checkpoint (immer aktiv):**
+Prüfe nach jeder Gruppe ob alle offenen Tasks der Phase erledigt sind (prüfe die DoD-Sektion).
+Falls ja: STOPP, unabhängig vom Checkpoint-Modus:
+> **Alle Tasks dieser Phase sind abgeschlossen.**
+> Ein Phasenwechsel ist möglich. Nutze `/project:phase-transition` wenn du bereit bist.
+
+Gehe dann zu Schritt 4.
+```
+
+- [ ] **Step 6: Write the command file — Consolidation**
+
+```markdown
+## Schritt 4 — Konsolidierung
+
+Führe die folgenden Updates durch (nur der Orchestrator, nie Subagents):
+
+1. **task.md aktualisieren:**
+   - Markiere erfolgreich abgeschlossene Tasks mit [x]
+   - Fehlgeschlagene Tasks bleiben [ ]
+
+2. **status.md aktualisieren:**
+   - Aktuellen Stand in 2-3 Sätzen
+   - Liste der abgeschlossenen Tasks
+   - Falls Fehler auftraten: dokumentiere sie
+   - Nächste Schritte (verbleibende offene Tasks)
+
+3. **decisions-log.md:**
+   - Falls während des Batch Entscheidungen relevant wurden: Einträge erstellen
+   - Format: siehe docs/templates/decisions-log-entry.md
+
+4. **Phasen-spezifische Shared Artifacts:**
+   - Falls die Phase geteilte Dokumente hat (z.B. capabilities-inventory.md in Discovery):
+     aktualisiere sie mit den konsolidierten Ergebnissen aus den Subagent-Summaries
+
+## Schritt 5 — Commit-Vorschlag
+
+Zeige dem User:
+> **Batch abgeschlossen.** Zusammenfassung:
+> - [N] Tasks erfolgreich, [M] fehlgeschlagen
+> - Aktualisiert: task.md, status.md [, decisions-log.md] [, weitere]
+> - Output-Dateien: [Liste der neuen Dateien]
+>
+> Soll ich einen Commit erstellen?
+
+Falls ja: erstelle einen Commit mit einer Nachricht die den Batch beschreibt,
+z.B. `feat: complete Wave 2 entry-points analysis`
+```
+
+- [ ] **Step 7: Assemble the complete command file**
+
+Combine Steps 1-6 into the final `.claude/commands/run-batch.md` file.
+Ensure the flow is continuous and there are no gaps between sections.
+
+- [ ] **Step 8: Verify the command file against the spec**
+
+Cross-check every section of the spec against the command file:
+- [ ] Task parsing (work section vs DoD) — Schritt 1
+- [ ] Permission check — Schritt 1.5
+- [ ] Interactive parameter query — Schritt 2
+- [ ] Group processing with parallelization — Schritt 3
+- [ ] Subagent contract (input/output/boundaries) — Subagent prompt template
+- [ ] Context loading strategy (base/phase/task) — Context bestimmen
+- [ ] Checkpoint modes (after-each, after-batch:N, at-end) — Checkpoint-Logik
+- [ ] Phase protection (hard stop) — Phasen-Checkpoint
+- [ ] Error handling (failed tasks) — Fehlerbehandlung
+- [ ] Consolidation (task.md, status.md, decisions-log, shared artifacts) — Schritt 4
+- [ ] Commit proposal — Schritt 5
+- [ ] session-start/end replacement — implicitly handled (no reference to session-start needed)
+
+- [ ] **Step 9: Commit**
+
+```bash
+git add .claude/commands/run-batch.md
+git commit -m "feat: add run-batch orchestrator command"
+```
+
+---
+
+### Task 3: Create `.claude/settings.json`
+
+**Files:**
+- Create: `.claude/settings.json`
+
+Project-level permissions that ensure the batch orchestrator and its subagents
+can operate without permission prompts. This file is separate from the existing
+`.claude/settings.local.json` (which contains user-specific permissions).
+
+- [ ] **Step 1: Write settings.json**
+
+```json
+{
+  "permissions": {
+    "allow": [
+      "Read",
+      "Glob",
+      "Grep",
+      "Edit",
+      "Write",
+      "Agent"
+    ]
+  }
+}
+```
+
+- [ ] **Step 2: Verify no conflict with settings.local.json**
+
+Read `.claude/settings.local.json` and confirm the two files don't conflict.
+`settings.local.json` has specific Bash and Skill permissions; `settings.json`
+adds the core tool permissions. These are additive, no conflict expected.
+
+- [ ] **Step 3: Commit**
+
+```bash
+git add .claude/settings.json
+git commit -m "feat: add project-level permissions for batch orchestration"
+```
+
+---
+
+### Task 4: Update CLAUDE.md
+
+**Files:**
+- Modify: `CLAUDE.md` (two sections)
+
+- [ ] **Step 1: Add batch-orchestration rule reference**
+
+In the `## Regeln` section, add:
+```
+@rules/batch-orchestration.md
+```
+
+- [ ] **Step 2: Add run-batch to Workflow-Commands**
+
+In the `## Workflow-Commands` section, add:
+```
+- `/project:run-batch` — Batch von geplanten Tasks semi-autonom abarbeiten (Checkpoints, Parallelisierung)
+```
+
+- [ ] **Step 3: Verify CLAUDE.md is consistent**
+
+Read the updated CLAUDE.md and verify:
+- The rule reference is in the Regeln section alongside existing rules
+- The command is listed alongside existing commands
+- No other sections need updating
+
+- [ ] **Step 4: Commit**
+
+```bash
+git add CLAUDE.md
+git commit -m "docs: reference batch orchestration rule and command in CLAUDE.md"
+```
+
+---
+
+### Task 5: Verification Dry-Run
+
+**Files:**
+- Read-only: all created/modified files
+
+- [ ] **Step 1: Verify file structure**
+
+Confirm all expected files exist:
+```
+rules/batch-orchestration.md        ✓
+.claude/commands/run-batch.md       ✓
+.claude/settings.json               ✓
+CLAUDE.md (updated)                 ✓
+```
+
+- [ ] **Step 2: Verify rules are loadable**
+
+Read CLAUDE.md and confirm `@rules/batch-orchestration.md` resolves correctly
+(the file exists at the referenced path).
+
+- [ ] **Step 3: Verify command is invocable**
+
+Confirm `.claude/commands/run-batch.md` exists in the commands directory
+alongside the existing commands (session-start, session-end, decision, etc.).
+
+- [ ] **Step 4: Read through the complete command file end-to-end**
+
+Read `run-batch.md` as if you were Claude encountering it for the first time.
+Check for:
+- Is the flow unambiguous? Can every step be followed without guessing?
+- Are there references to files or concepts that aren't explained?
+- Is the subagent prompt template complete?
+- Are all checkpoint modes handled?
+
+- [ ] **Step 5: Trace through an example scenario**
+
+Mentally trace through this scenario:
+- Phase 1 Discovery is active
+- task.md has Wave 1 (1 task), Wave 2 (1 task), Wave 3 (3 parallel tasks)
+- User says: `/project:run-batch items:all checkpoint:after-each`
+
+Verify the command file produces this behavior:
+1. Reads task.md, finds 5 tasks in 3 groups
+2. Checks permissions
+3. Shows task list, asks for confirmation
+4. Dispatches Wave 1 subagent → checkpoint → user says "weiter"
+5. Dispatches Wave 2 subagent → checkpoint → user says "weiter"
+6. Dispatches 3 Wave 3 subagents in parallel → checkpoint
+7. All tasks done → phase checkpoint → hard stop
+8. Consolidation → commit proposal
+
+- [ ] **Step 6: Final commit (if any fixes were needed)**
+
+```bash
+git add -A
+git commit -m "fix: adjustments from dry-run verification"
+```
--- a/docs/superpowers/specs/2026-03-21-batch-orchestrator-design.md
+++ b/docs/superpowers/specs/2026-03-21-batch-orchestrator-design.md
@ -0,0 +1,309 @@
+# Batch Orchestrator for re:struct
+
+## Problem Statement
+
+When tasks are pre-planned (e.g., discovery waves, user stories, architecture decisions),
+the human operator is reduced to a manual scheduler:
+
+```
+session-start → "do task X" → session-end → clear context → commit → repeat
+```
+
+This is repetitive and adds no value when the next steps are already defined.
+
+## Solution Overview
+
+A lightweight in-session orchestrator that reads planned tasks from `task.md`,
+dispatches subagents for execution (fresh context per task), and pauses at
+configurable checkpoints for human review.
+
+**Approach:** Slim orchestrator + subagent isolation (Approach 3 from evaluation).
+
+- Subagents do the heavy work with their own context
+- Orchestrator reads only task.md + subagent summaries (never full output files)
+- Parallelization within groups, sequential between groups
+- Phase boundary is a hard stop — never crossed automatically
+
+## Task Parsing from task.md
+
+### Which section does the orchestrator parse?
+
+A `task.md` may contain multiple sections with checkboxes — e.g., a "Waves" section with
+sub-headings AND a flat "Definition of Done" section. These serve different purposes:
+
+- **Work section** (with `###` sub-headings): actionable tasks with grouping — this is what
+  the orchestrator parses. Identified by having sub-headings (`###`) under it.
+- **Definition of Done section** (flat list): acceptance criteria for phase completion —
+  used only for phase-boundary checks, not for dispatching work.
+
+**Convention:** The orchestrator parses the **first section in task.md that contains
+`###` sub-headings with `- [ ]` checkboxes**. This is the work plan. Flat checkbox lists
+without sub-headings are treated as DoD and ignored for dispatching.
+
+Example `task.md` with both sections:
+
+```markdown
+## Analyse-Wellen                        ← WORK SECTION (orchestrator parses this)
+### Wave 1 — Structure
+- [ ] structure-map.md
+
+### Wave 2 — Entry Points
+- [ ] entry-points.md
+
+### Wave 3 — Module Deep Dives
+- [ ] Module: auth
+- [ ] Module: invoicing
+- [ ] Module: reporting
+
+## Ergebnisse (Definition of Done)       ← DoD SECTION (phase check only)
+- [ ] structure-map.md
+- [ ] domain-model.md
+- [ ] capabilities-inventory.md
+```
+
+### Conventions
+
+- `- [ ]` = open task, eligible for orchestrator
+- `- [x]` = completed, skipped
+- Tasks under the same `###` heading = **parallelization group** (can run concurrently)
+- Tasks under different `###` headings = **sequential** (Group 2 starts after Group 1)
+- Each task requires an **output path** — either explicit in the text or derivable from task name
+- Each task must contain enough context for the subagent: what to do, which sources to read
+
+### What the orchestrator does NOT read from task.md
+
+- Checkpoint mode and batch size — provided by the user at invocation time
+- The Definition of Done section — only relevant for phase-boundary checks
+
+## Checkpoint Modes
+
+Three modes, selected by the user at batch start:
+
+| Mode | Behavior | Typical Use |
+|------|----------|-------------|
+| `after-each` (default) | Pause after each group, show summaries, wait for "continue" | New/uncertain tasks, want to review every output |
+| `after-batch:N` | Run N groups, then pause | "Do the next 3, then I'll check" |
+| `at-end` | Run all groups, consolidated review at the end | Well-understood tasks, high confidence |
+
+### Checkpoint Behavior
+
+At each checkpoint the orchestrator:
+
+1. Shows per completed task: task name, output file path, 2-3 sentence summary (provided by subagent)
+2. User can respond:
+   - `weiter` (continue) — next task / next batch
+   - `stopp` (stop) — orchestrator consolidates current results and ends
+   - `zeig mir [file]` (show me) — review an output file in detail before deciding
+3. After final checkpoint: consolidation (status.md, decisions-log, task.md checkboxes, commit proposal)
+
+### Checkpoint with Parallel Tasks
+
+| Mode | Behavior with group of 3 parallel tasks |
+|------|----------------------------------------|
+| `after-each` | All 3 run in parallel, checkpoint when **all 3 complete** — show all summaries |
+| `after-batch:N` | Batch counter counts groups, not individual tasks |
+| `at-end` | Run through, checkpoint only after all groups |
+
+Parallelization does not change checkpoint rhythm — checkpoints refer to groups, not individual subagents.
+
+## Phase Protection
+
+- The orchestrator only processes tasks from the **active phase** (defined in CLAUDE.md "Aktiver Scope")
+- When the last open task of a phase completes: orchestrator **stops unconditionally** and informs the user that all DoD items are done and a phase transition is possible
+- The orchestrator **never** triggers `/project:phase-transition` itself or starts tasks from the next phase
+- Even in `at-end` mode: phase boundary = hard stop, no override possible
+
+This is an implicit fourth checkpoint type: the **phase checkpoint**, which always applies regardless of selected mode.
+
+## Relationship to session-start / session-end
+
+The batch orchestrator **replaces** the manual session-start/end cycle for batch work:
+
+- **session-start is not required** before run-batch. The orchestrator reads task.md and
+  CLAUDE.md directly — it does not need the user to restore context manually.
+- **session-end is incorporated** into the orchestrator's consolidation step (Step 4).
+  The consolidation performs the same updates as session-end: status.md, decisions-log.md,
+  task.md checkboxes. A separate session-end after run-batch is not needed.
+- **The user may still use session-start** before run-batch if they want to review context
+  first — but it is optional, not required.
+
+In short: `/project:run-batch` is a self-contained replacement for the
+`session-start → work → session-end → clear → commit` cycle.
+
+## Orchestrator Flow
+
+```
+User: "Work through the open discovery tasks" (or /project:run-batch)
+  │
+  ├─ 1. Read task.md of active phase, parse open tasks
+  │
+  ├─ 1.5. Permission check:
+  │     • Verify required tools are allowed (Read, Write, Edit, Glob, Grep, Agent)
+  │     • If permissions missing → inform user, suggest settings.json config, abort
+  │
+  ├─ 2. Check batch parameters — anything missing? → Ask interactively:
+  │     • "Which tasks? (all open / specific items / range)"
+  │     • "Checkpoint mode? (after-each / after-batch:N / at-end)"
+  │     • Show summary + wait for confirmation
+  │
+  ├─ 3. Process groups sequentially:
+  │     │
+  │     ├─ Group 1:
+  │     │   ├─ Dispatch tasks in parallel (subagents)
+  │     │   ├─ Wait for completion
+  │     │   ├─ On subagent failure: mark task as failed, include error
+  │     │   │   in checkpoint summary, do not block other parallel tasks
+  │     │   ├─ Checkpoint per selected mode
+  │     │   └─ On "stop" → consolidation → end
+  │     │
+  │     ├─ Group 2:
+  │     │   └─ ... (only after Group 1 completes + checkpoint passes)
+  │     │
+  │     └─ Phase boundary reached?
+  │           → HARD STOP, inform user
+  │
+  ├─ 4. Consolidation (orchestrator only, never subagents):
+  │     • task.md: mark completed items [x], failed items stay [ ]
+  │     • status.md: update current state (include failures if any)
+  │     • decisions-log.md: add entries if applicable
+  │     • Phase-specific shared artifacts (e.g., capabilities-inventory.md
+  │       in Discovery): update with consolidated findings from subagents
+  │
+  └─ 5. Commit proposal to user
+```
+
+## Subagent Contract
+
+### Context Loading Strategy
+
+Each subagent needs context to do its work. The orchestrator determines context through
+a layered approach:
+
+**Base context (always loaded):**
+- Project rules of the active phase (framework-constraints, language-conventions, parallel-agents)
+- status.md of the active phase (current state summary)
+
+**Phase-specific context (loaded by convention):**
+- If phase-specific rules define context requirements (e.g., `discovery-waves.md` specifies
+  "Wave 2 loads structure-map.md"), the orchestrator follows those rules.
+
+**Task-specific context (optional, from task.md):**
+- Tasks in task.md can declare explicit context dependencies:
+  ```markdown
+  - [ ] domain-model.md (context: structure-map.md, entry-points.md)
+  ```
+- If no explicit context is declared: the orchestrator loads all existing output files
+  from prior groups in the current batch as context.
+
+**Context budget principle:** The orchestrator always prefers condensed documents
+(structure-map.md, status.md) over raw sources. Raw code is only passed when
+the task explicitly requires it (e.g., a module deep-dive task references a source path).
+
+### Input (provided by orchestrator)
+
+- Exactly one task with clear assignment
+- Context files per the loading strategy above
+- Output file path
+- Project rules (framework-constraints, language-conventions, parallel-agents)
+
+### Output (returned to orchestrator)
+
+- Output file written to disk at the specified path
+- Summary (2-3 sentences) as return value to orchestrator
+
+### Boundaries
+
+- Subagent writes **only** its own output file — never shared state
+- Subagent does **not** update task.md, status.md, or decisions-log.md
+- Orchestrator reads **only** subagent summaries — never full output files
+
+## Skill Interface
+
+### Command
+
+```
+/project:run-batch                              → asks everything interactively
+/project:run-batch items:1-3 checkpoint:after-each
+/project:run-batch items:all checkpoint:at-end
+/project:run-batch items:wave-3 checkpoint:after-batch:2
+```
+
+### Natural Language Triggers (examples)
+
+```
+"Arbeite die nächsten 3 offenen Tasks ab, zeig mir nach jedem das Ergebnis"
+"Führe alle offenen Wave-3-Module parallel durch"
+"Mach die Discovery-Tasks durch, Checkpoint am Ende"
+```
+
+### Interactive Parameter Query
+
+When parameters are missing or incomplete, the orchestrator asks before starting:
+
+```
+Orchestrator: "Ich sehe 5 offene Tasks in Phase 1 — Discovery:
+  1. [ ] structure-map.md (Wave 1)
+  2. [ ] entry-points.md (Wave 2)
+  3. [ ] Module: auth (Wave 3)
+  4. [ ] Module: invoicing (Wave 3)
+  5. [ ] Module: reporting (Wave 3)
+
+Welche Tasks soll ich abarbeiten? (alle / Nummern / Bereich)
+Checkpoint-Modus? (after-each / after-batch:N / at-end)"
+```
+
+## Parallelization Rules
+
+- Tasks under the same heading = one parallelization group
+- Within a group: all tasks dispatched simultaneously as subagents
+- Between groups: strictly sequential — Group 2 starts only after Group 1 fully completes AND checkpoint passes
+- Checkpoint mode applies **per group**, not per individual task within parallel groups
+
+## Permissions
+
+Required permissions in `.claude/settings.json`:
+
+```json
+{
+  "permissions": {
+    "allow": [
+      "Read",
+      "Glob",
+      "Grep",
+      "Edit",
+      "Write",
+      "Agent"
+    ]
+  }
+}
+```
+
+- Configured once in settings.json, no runtime flags needed
+- Subagents inherit the parent session's permission context
+- Orchestrator checks permissions at startup (Step 1.5) and aborts with guidance if missing
+
+## Artifacts
+
+| Artifact | Purpose |
+|----------|---------|
+| `rules/batch-orchestration.md` | Rules for orchestrator/subagent contract |
+| `.claude/commands/run-batch.md` | The skill/command definition |
+| `.claude/settings.json` | Permission configuration |
+| `docs/superpowers/specs/2026-03-21-batch-orchestrator-design.md` | This spec |
+| `docs/superpowers/specs/2026-03-21-batch-orchestrator-future.md` | Future improvements |
+
+## Decisions Made
+
+| Decision | Rationale |
+|----------|-----------|
+| In-session orchestrator with subagents (not external script) | Keeps checkpoints interactive, stays in Claude ecosystem |
+| Slim orchestrator (summaries only, no full output reads) | Prevents context overflow with many tasks |
+| task.md as single source of truth | No redundant runbook file, existing structure is machine-readable |
+| Heading-based parallelization groups | Already matches existing task.md structure |
+| Phase boundary = hard stop | Phase transitions require human intent per framework rules |
+| Missing parameters always queried interactively | Prevents accidental batch runs with wrong settings |
+| Permission check at startup | Prevents hanging on permission prompts mid-batch |
+| Default checkpoint mode: after-each | Safest option, user opts into more autonomy explicitly |
+| Orchestrator parses work section (with ###), not DoD section | Avoids ambiguity between actionable tasks and acceptance criteria |
+| run-batch replaces session-start/end cycle | Eliminates redundant manual steps; consolidation is built into the flow |
+| Layered context loading for subagents | Base context always available; phase-specific and task-specific context keeps the approach generic across phases |
--- a/docs/superpowers/specs/2026-03-21-batch-orchestrator-future.md
+++ b/docs/superpowers/specs/2026-03-21-batch-orchestrator-future.md
@ -0,0 +1,39 @@
+# Batch Orchestrator — Future Improvements
+
+Improvements identified during design but explicitly out of scope for the initial implementation.
+
+## Live Subagent Streaming
+
+**What:** Real-time visibility into subagent work as it happens, not just after completion.
+
+**Why deferred:** Claude Code subagents currently return results only on completion. Live streaming would require changes to the Agent tool or a workaround (e.g., file-watching with periodic reads).
+
+**Possible approaches:**
+- Subagent writes incremental progress to a `.progress` file; orchestrator polls periodically
+- Future Claude Code feature: streaming subagent output to parent
+
+## Cross-Wave Parallelization
+
+**What:** When Wave 3 Module A completes, start Wave 4 work for Module A while Wave 3 Module B is still running.
+
+**Why deferred:** Creates complex dependency graphs and merge conflicts on shared output files. The sequential group model is simpler and sufficient for current needs.
+
+**Prerequisite:** Clear per-module output isolation across phases, not just within a phase.
+
+## Automatic Retry on Subagent Failure
+
+**What:** If a subagent fails (context overflow, tool error), automatically retry with adjusted parameters.
+
+**Why deferred:** Failure modes need to be observed in practice first before designing retry logic. Current approach: orchestrator reports failure, user decides.
+
+## Batch Resume After Interruption
+
+**What:** If a batch is interrupted (crash, user closes VS Code), resume from where it left off.
+
+**Why deferred:** task.md checkboxes already track completion. A partial batch leaves some items `[x]` and others `[ ]`. Restarting `/project:run-batch` naturally picks up remaining items. Explicit resume logic adds complexity with little benefit over this natural behavior.
+
+## Dynamic Parallelization Hints
+
+**What:** Allow task.md to contain explicit parallelization overrides beyond the heading-based grouping (e.g., `<!-- parallel: false -->` to force sequential within a group).
+
+**Why deferred:** Heading-based grouping covers known use cases. Override syntax adds complexity that may never be needed.