LATTICE // WORKFLOWS

Six phases. Enforced gates. Session recovery that actually works.

RESEARCH → PLAN → IMPLEMENT → VALIDATE → REVIEW → DELIVER. Every phase logs events, every gate blocks the next, and every checkpoint survives context compaction.

run · phase-gate timeline

RUNNING

RESEARCH

PLAN

IMPLEMENT

VALIDATE

REVIEW

DELIVER

RESEARCH

PLAN

IMPLEMENT

VALIDATE

REVIEW

DELIVER

LEDGER // PROBLEM

Every AI agent session is an invisible black box

Without structure, “I'll build this” is untraceable. When context compacts, state evaporates. When the session crashes, the agent restarts from zero, repeating solved problems and re-discovering known gotchas. Every other framework has this problem. TRW solved it.

WITHOUT TRWBlack box

On crash
Session restarts from zero
Prior work
Every decision in the compacted window is lost
Gotchas
Re-discovered the hard way each time

WITH TRWTraceable

On crash
pre_compact_state.json
Resume at the exact checkpoint
Prior work
append-only
events.jsonl is append-only — nothing silently dropped
Gotchas
trw_session_start
Top-N learnings injected before work begins

LATTICE // PHASES

The 6-phase lifecycle

Each phase has explicit exit criteria. A phase gate blocks the next phase until evidence passes. The orchestrator enforces this — no phase advances on intent alone.

RESEARCH

PLAN

IMPLEMENT

VALIDATE

REVIEW

DELIVER

Recall prior work, inspect codebase, register findings. Exit: plan.md draft with ≥3 evidence paths and formation selected.

Exit

plan.md draft, ≥3 evidence paths

Design approach, identify dependencies, choose formation. Exit: acceptance criteria written, shards planned, wave_manifest.yaml created.

Exit

Acceptance criteria, wave_manifest.yaml

Execute with periodic checkpoints, shard self-review. Exit: shards/waves complete or checkpointed, tests written.

Exit

Shards complete, tests written

Run build_check, verify coverage, integration check. Exit: coverage ≥ target, all gates pass, zero P0 issues.

Exit

Coverage ≥ target, gates pass

Audit diff for quality (DRY/KISS/SOLID), record learnings. Exit: critic reviewed, adversarial audit passed, no P0 findings.

Exit

Audit passed, no P0 findings

Sync artifacts, checkpoint, close run. Exit: PR created or archived, final.md written, CLAUDE.md synced.

Exit

PR created, final.md, CLAUDE.md synced

RESEARCH
01/06
Recall prior work, inspect codebase, register findings. Exit: plan.md draft with ≥3 evidence paths and formation selected.
Exit
plan.md draft, ≥3 evidence paths
PLAN
02/06
Design approach, identify dependencies, choose formation. Exit: acceptance criteria written, shards planned, wave_manifest.yaml created.
Exit
Acceptance criteria, wave_manifest.yaml
IMPLEMENT
03/06
Execute with periodic checkpoints, shard self-review. Exit: shards/waves complete or checkpointed, tests written.
Exit
Shards complete, tests written
VALIDATE
04/06
Run build_check, verify coverage, integration check. Exit: coverage ≥ target, all gates pass, zero P0 issues.
Exit
Coverage ≥ target, gates pass
REVIEW
05/06
Audit diff for quality (DRY/KISS/SOLID), record learnings. Exit: critic reviewed, adversarial audit passed, no P0 findings.
Exit
Audit passed, no P0 findings
DELIVER
06/06
Sync artifacts, checkpoint, close run. Exit: PR created or archived, final.md written, CLAUDE.md synced.
Exit
PR created, final.md, CLAUDE.md synced

TRACE // COMPACTION

The context-compaction problem

AI CLIs compact context when tokens fill. Pre-TRW: the agent loses state, restarts work, repeats solved problems. TRW: a pre-compact.sh hook fires BEFORE compaction, saving { run_path, phase, events_logged, last_checkpoint }. On restart, session-start.sh reads state and resumes from the exact resume point.

WITHOUT TRWState evicted

state→compact→evicted

Context fills → agent compacts → state evicted
On restart: no memory of the active phase
Agent re-discovers already-solved problems
Work done may be repeated or silently lost

WITH TRWState preserved

state→checkpoint→resumed

pre-compact hook fires before the window fills
Writes pre_compact_state.json atomically (temp → rename)
session-start reads state → resumes at the exact phase
Events log is append-only — no data lost (shared across subagent-start too)

LEDGER // CHECKPOINTS

Atomic checkpoints

trw_checkpoint() writes atomic snapshots (temp file → rename) at milestones. JSONL events append-only to events.jsonl. A crash at any point leaves a recoverable state. Delegated runs use agents like trw-researcher and trw-implementer that inherit the same checkpoint contract.

.trw/context/pre_compact_state.json

{
  "timestamp": "2026-04-16T03:14:09Z",
  "trigger": "mcp_tool",
  "run_path": ".trw/runs/my-feature/20260416T031400Z-abc1234",
  "phase": "IMPLEMENT",
  "events_logged": 47,
  "last_checkpoint": "pre-compaction safety checkpoint",
  "pending_ceremony": ["trw_review", "trw_deliver"],
  "last_5_events": [
    {"event": "tool_invocation", "data": {"tool_name": "trw_checkpoint"}},
    {"event": "tool_invocation", "data": {"tool_name": "trw_learn"}},
    ...
  ]
}

LEDGER // SCORING

Ceremony scoring

Each run scores 0–100. The avg_ceremony_score metric is surfaced on /metrics. Compliance is measured, not aspired to. trw_build_check and trw_instructions_sync feed into the final score.

SESSION_STARTCalled as first action

30 pts

REFLECTION_COMPLETELearnings recorded

30 pts

CHECKPOINT≥1 trw_checkpoint() call

20 pts

LEARNING≥1 trw_learn() call

10 pts

BUILD_CHECKtrw_build_check ran and passed

10 pts

PERFECT_RUNAll five criteria met

100 pts

LATTICE // RELATED

Pairs with

Verification

Phase 4 (VALIDATE) is the verification gate. The workflow does not advance until build evidence passes.

Memory

The DELIVER phase runs trw_instructions_sync, refreshing the client instruction file so every future session starts with the latest protocol.

Requirements

The PLAN phase produces the PRD. Every subsequent phase verifies implementation against it.

LEDGER // FAQ

Common questions

Can I skip phases?

For MINIMAL-tier runs only. Standard and full runs enforce the complete cycle. Most runs benefit from the full sequence because the exit criteria at each gate catch issues cheaply.

What exactly gets checkpointed?

Run state including the active phase, events logged so far, PRD scope, file ownership map, and the last five events. This is written atomically (temp file → rename) so a crash at any moment leaves a recoverable state.

Does this work offline?

Yes. Run directories are fully local — .trw/runs/{task}/{run_id}/. No cloud connectivity is required. The pre_compact_state.json hook writes to disk before compaction fires, regardless of network.

How long does a full phase cycle take?

Highly task-dependent. A small fix can cycle through in a single session; a full feature with multi-agent formations may span several sessions. The point is not speed — it is that each phase has evidence and the next session resumes exactly where the previous one stopped.

What happens if VALIDATE fails?

The phase gate blocks advancement. The orchestrator routes back to IMPLEMENT with the failing evidence recorded in events.jsonl. The run does not progress to REVIEW until build_check, coverage, and integration checks all pass.

How is DELIVER different from a checkpoint?

Checkpoints are internal save-points that let a run resume mid-phase. DELIVER is the terminal phase: it syncs artifacts (PR, final.md, CLAUDE.md), promotes high-impact learnings, and closes the run. A run can have many checkpoints but only one DELIVER.

TERMINAL // RUN_STRUCTURED

Turn every agent session into a traceable engineering run

Every checkpointed session resumes exactly where it stopped. Every validated learning surfaces in the next one.