HOT Tier
Always Read - Files loaded at the start of every AI session
Small, current, actionable information
The Tiered Memory System (TMS) is the core architectural pattern of Cortex TMS. It organizes project documentation by access frequency instead of content type, optimizing for AI agent context windows.
Traditional documentation structures organize files by type:
The issue: AI agents don’t know which files to prioritize. They either:
Instead of organizing by content type, TMS organizes by how often AI agents should read the file:
HOT Tier
Always Read - Files loaded at the start of every AI session
Small, current, actionable information
WARM Tier
Read on Demand - Files loaded when implementing specific features
Timeless reference information
COLD Tier
Ignore Unless Asked - Historical context AI agents should skip
Completed tasks, old changelogs
Purpose: Provide immediate context for the current work sprint.
Characteristics:
Files in HOT Tier:
What belongs in HOT:
What doesn’t belong in HOT:
FUTURE-ENHANCEMENTS.mdPurpose: Provide reference documentation AI agents read when working on specific features.
Characteristics:
Files in WARM Tier:
What belongs in WARM:
What doesn’t belong in WARM:
Purpose: Store historical context AI agents should skip to preserve context budget.
Characteristics:
Files in COLD Tier:
What belongs in COLD:
What doesn’t belong in COLD:
AI agents have limited context windows (typically 32k-200k tokens). Traditional repos waste this budget:
AI reads:- 50 pages of docs (all mixed together)- Can't distinguish current from historical- Context window = 90% noise, 10% signal
Result: AI hallucinates or asks repetitive questionsAI reads:- HOT: 300 lines of current context- WARM: 500 lines when implementing feature X- COLD: Skipped entirely- Context window = 90% signal, 10% overhead
Result: AI follows conventions and works autonomouslyTraditional repos bury critical information in thousands of lines of documentation. TMS forces signal over noise by separating active context from history.
Example:
NEXT-TASKS.md (120 lines), sees “follow docs/core/PATTERNS.md#auth”, jumps directly to the answerThe file location itself communicates priority:
docs/core/? → WARM, read on demanddocs/archive/? → COLD, ignore itNo ambiguity. No “which doc should I read?” questions.
Let’s see how the tiers work in practice.
AI reads HOT tier:
NEXT-TASKS.md says:
## Active Sprint: User Authentication
**Why this matters**: Mobile app needs secure API access
- [ ] JWT token generation- [ ] Token validation middleware- [ ] Refresh token rotationcopilot-instructions.md says:
- Never store tokens in localStorage- Use httpOnly cookies for refresh tokens- Follow patterns in docs/core/PATTERNS.mdAI now knows: Auth is the priority, security rules apply
AI reads WARM tier on demand:
User asks: “Implement JWT token generation”
AI reads docs/core/PATTERNS.md#authentication:
## Authentication Pattern
**Canonical Example**: `src/middleware/auth.ts`
- Use jsonwebtoken library- Sign with RS256 (not HS256)- Set expiry to 15 minutes- Include user_id and role in payloadAI implements correctly following the established pattern
Task moves to COLD tier:
Task marked ✅ Done → Archived to docs/archive/sprint-2026-01.md
# Sprint: User Authentication (2026-01-15 to 2026-01-28)
✅ JWT token generation - Implemented RS256 signing✅ Token validation middleware - Added to Express pipeline✅ Refresh token rotation - httpOnly cookies with 7-day expiryNEXT-TASKS.md is now under 100 lines again, ready for next sprint
What are you working on right now (next 1-2 weeks)?
→ Put this in NEXT-TASKS.md (HOT)
What rules must never be violated (security, money/math)?
→ Put this in .github/copilot-instructions.md (HOT)
For each existing doc file, ask:
Move completed tasks, old changelogs, and deprecated patterns to docs/archive/.
Rule of thumb: If you haven’t referenced it in 2 months, archive it.
Mistake: Adding 6 months of tasks to NEXT-TASKS.md “just in case”
Why it fails: Wastes context budget on tasks that won’t be worked on for months
Fix: Keep only 1-2 weeks in HOT. Move backlog to FUTURE-ENHANCEMENTS.md
Mistake: Keeping completed tasks in NEXT-TASKS.md as a “record”
Why it fails: Historical tasks are noise. AI doesn’t need them.
Fix: Archive completed sprints to docs/archive/sprint-YYYY-MM.md
Mistake: Copying the auth pattern from WARM into HOT “for convenience”
Why it fails: Duplication causes drift when one copy gets updated
Fix: Use canonical links. HOT says “follow WARM file X”, not “here’s a copy”
Mistake: Letting NEXT-TASKS.md grow to 500 lines
Why it fails: Wastes context budget. AI reads all 500 lines on every session.
Fix: Enforce the 200-line limit. Split large tasks into smaller increments.
Faster AI Performance
AI agents read only what they need, when they need it. No more scanning thousands of lines to find one pattern.
Consistent Behavior
AI follows established patterns because they’re easy to find in WARM tier. No more hallucinated conventions.
Self-Maintaining Docs
Regular archiving prevents documentation from growing unbounded. Docs stay lean automatically.
Onboarding Simplified
New team members (human or AI) know exactly where to look: HOT for current work, WARM for reference, COLD never.
Now that you understand the Tiered Memory System: