KPI Snapshot
4
Loop Stages
Detect → Capture → Store → Apply
3
Storage Tiers
Raw logs → Project rules → Behavioral core
~30%
Error Detection Rate
Confident hallucinations missed
3×
Promotion Threshold
Recurrences before permanent storage
Loop Stage Effectiveness
Stage-by-stage effectiveness (qualitative assessment)
Hook-based
Manual log
Static rules
Session memory
Error detection
High
Medium
Low
Low
Capture quality
Medium
High
Medium
Medium
Storage durability
Medium
High
High
Medium
Application ease
High
Medium
High
Low
Human dependency
Low
Medium
High
Medium
Component Maturity Bars
Error Detection Coverage
Learning Type ROI
Storage Tier Durability
Implementation Comparison
Real implementations assessed
OpenClaw
Self-Improvement
Self-Improvement
CLAUDE.md
/ Cursor Rules
/ Cursor Rules
Session Memory
/ Daily Logs
/ Daily Logs
RAG-based
Learning
Learning
Structured logging
✓ Strict schema
✗ Freeform
~ Semi-structured
~ Depends on embed
Promotion pathway
✓ Built-in
✗ Manual
~ Heartbeat distill
✗ N/A
Cross-session
✓ Persistent files
✓ Persistent files
~ File-based
✓ Vector store
Error hooks
✓ PostToolUse
✗
✗
~ Platform dep.
Recurrence tracking
✓ Count + dates
✗
✗
✗
The Detection Gap
~95%
Detectable
Command failures, timeouts, exceptions
~35%
Partial
Suboptimal output, missed better approaches
~5%
Blind Spot
Confident hallucinations, silent failures
Assessment based on production testing across OpenClaw, Claude Code, and Cursor agent platforms, March 2026. Effectiveness ratings are qualitative — not automated benchmarks.