Changelog
Release notes and version history for TRW. Track how the operating layer, memory engine, and delivery tooling evolved over time. This page covers shipped releases only, so it stays a reliable history instead of turning into a speculative roadmap.
Current Versions
0.41.1
trw-mcp
0.6.6
trw-memory
0.4.0
trw-eval
v24.5_TRW
Framework
How to read it
Released, not proposed
Entries below are release snapshots tied to shipped package versions. If work is still moving, it belongs in requirements, sprint docs, or implementation notes until it is actually released.
Releases
Instruction Accuracy & Tool Filtering
- Instruction files now automatically filter to show only available tools, preventing agents from calling unavailable tools
- New CLI command to validate instruction accuracy against the active tool set
- Delivery gate warns when project instructions reference tools not available in the current configuration
Sequential Evaluation & Failure Analysis
- Sequential evaluation chains measure how learning compounds across multiple tasks
- Planning scorer now rewards evidence-based exploration before planning, reducing false positives
- Transcript mining identifies recurring failure patterns across evaluation runs
- New causal analysis for understanding why specific evaluation runs fail
OpenCode Release Hardening
- OpenCode support release hardened with full-package verification
- Outcome-correlation writes now persist cleanly for SQLite-backed learnings
- Session-boundary and template-version regressions aligned to current runtime behavior
- Bundled OpenCode artifacts remain synchronized with source skills, hooks, and agents
Architecture & Optimization
- Configuration decomposition into focused domain modules
- Optimized tool responses — significant token reduction per call
- Agent consolidation to focused roles with clean upgrade path
- Instruction file optimization and reduced hook overhead
- Contextual learning injection on phase change
- Platform OAuth and authentication hardening with cold-start resilience
Codex Provider Support
- Full Codex bootstrap: one command generates all project configs, instructions, and skill definitions with merge-safe updates
- Codex client profile: light-ceremony mode optimized for Codex's context constraints and workflow
- Skill path normalization: existing configs auto-corrected on update for consistent directory references
- Bootstrap stability: fixed edge case where project initialization could fail in certain directory layouts
Architecture & Code Quality Sprint
- Resolved critical architectural layering issues to enforce clean dependency boundaries across the codebase
- Major module decomposition: large modules split into focused, maintainable units across both packages
- Exception handling audit: all error handlers reviewed and justified, plus security fixes in bundled scripts
- trw-memory 0.6.0: improved CLI error handling, cleaner module organization, and a new abstract storage interface
Developer Experience Sprint
- Embeddable server mode: MCP server can now be used as a library for testing and integration, with full type annotation support
- Config system refactored: large configuration module split into focused, maintainable components
- Import cycle cleanup: circular dependencies eliminated for faster startup and cleaner architecture
- New CLI commands: auto-generated config reference, project uninstall with dry-run mode, and --version flag
Session Resilience Hardening
- Activity heartbeat: tool calls now signal liveness so long-running sessions are never incorrectly marked stale
- Database maintenance: proactive storage cleanup prevents unbounded growth during long sessions
- Reliable session boundaries: session start events are always recorded, ensuring delivery gates work regardless of tool call order
- Fixed multiple stale-run detection bugs: active sessions are now correctly identified and prior session data is properly isolated
Executable Assertions & Self-Verifying Memory
- Machine-verifiable assertions attached to memory entries — learnings auto-verify against your codebase to detect staleness
- Assertion-aware recall: entries with failing assertions are automatically deprioritized in search results
- Security-first verification: path traversal protection, pattern limits, and binary file detection
- Comprehensive test coverage across both packages for all new verification capabilities
Device Auth & One-Line Install
- One-line install: a single shell command detects your environment, installs from PyPI, and configures your project
- Device authorization flow — authenticate your CLI via browser with QR code support
- OAuth social login with Google and GitHub, including Google One Tap for instant sign-in
- CLI auth commands: login, logout, and status with multi-organization support
Observation Masking & Open Source Prep
- Adaptive context budget: tool responses automatically compress as sessions grow longer, significantly reducing token costs
- Open-source publication prep: BSL 1.1 license, public README, secrets scrubbed
- Redundancy detection: repeated identical tool responses are automatically suppressed to save context
- Instruction cache invalidation on upgrade — version bumps force re-render across all projects
Framework v24.4 & Structured Logging
- Framework version bump to v24.4 — coordinated release across all packages
- Structured logging overhaul: consistent, queryable log events across all modules
- Expanded cross-package integration test suite covering end-to-end tool workflows
- Agent Teams reliability fix — parallel agent work is properly preserved during cleanup
Client Profiles & Adaptive Ceremony
- 8 built-in client profiles: claude-code, cursor-ide, cursor-cli, opencode, codex, copilot, gemini, and aider
- Profile-aware ceremony: process overhead adapts to each platform's context window and capabilities
- Response optimizer: tool responses automatically compacted for LLM context efficiency
- Multiple delivery reliability fixes and internal code quality improvements
Multi-Platform Support & Full Tool Suite
- Multi-platform ceremony: full, light, and off modes — spans Claude Code, Cursor IDE, Cursor CLI, OpenCode, Codex, Copilot, Gemini, and Aider
- Complete orchestration toolkit: tools, resources, skills, and bundled agents for the full development lifecycle
- Learning injection: high-impact learnings auto-promote into instruction files for future sessions
- Installer UX overhaul with health checks and cross-platform server management
Agent Teams Safety & Type System
- Pre-spawn safety checks: agent creation blocked when uncommitted changes could cause conflicts
- Comprehensive type system: strong typing at all major cross-module boundaries for reliability
- Scoring accuracy improvements: calibrated weights and removed placeholder dimensions
- File ownership enforcement: prevents multiple agents from editing the same files simultaneously
Documentation Overhaul
- Complete rewrite of all documentation pages with progressive disclosure and scannable layouts
- Expanded references covering the full tool, skill, and agent catalog
- SEO overhaul: structured data, custom social cards for key pages, and expanded sitemap
- Full API reference with endpoint docs, auth details, examples, and error formats
Interactive Memory Visualizations
- Interactive memory dashboard: growth timeline, tier migration, smart search, knowledge graph, and forgetting curve visualizations
- Cross-widget interactions: search highlights matching data across dashboard sections
- Homepage growth timeline showing how framework knowledge compounds across tasks
trw-memory: Standalone Memory Engine
- Standalone persistent memory engine — purpose-built for AI coding agents with fast local storage
- Hybrid search combining sparse keyword and dense vector retrieval for high-quality recall
- Knowledge graph, tiered storage lifecycle, semantic dedup, and LLM-powered consolidation
- Integrations with popular AI frameworks: LangChain, LlamaIndex, CrewAI, and OpenAI-compatible APIs
Next Steps
Use quickstart to confirm the current install and access flow, configuration to understand changed defaults, and the API reference when a release touches hosted platform behavior.