Memory Systems Skill

Overview

This skill addresses agent persistence across sessions through layered architectures balancing immediate context with long-term knowledge retention. Effective memory systems enable agents to learn, maintain consistency, and reason over accumulated knowledge.

Quick Start

Identify needs - What must persist? (entities, decisions, patterns)
Choose architecture - File-based, vector, graph, or hybrid
Design retrieval - How will memory be accessed?
Implement storage - With temporal validity
Monitor growth - Prune and consolidate regularly

When to Use

Building cross-session agents
Maintaining entity consistency
Implementing reasoning over accumulated knowledge
Designing learning systems
Creating growing knowledge bases
Building temporal-aware state tracking

Memory Spectrum

Memory ranges from volatile to permanent:

Layer	Persistence	Latency	Capacity
Working Memory	Context window	Zero	Limited
Short-term	Session-scoped	Low	Moderate
Long-term	Cross-session	Medium	Large
Archival	Permanent	High	Unlimited

Effective systems layer multiple types:

Working memory - Current context window
Short-term - Session facts, active tasks
Long-term - Learned patterns, entity knowledge
Entity-specific - Per-entity history
Temporal graphs - Time-aware relationships

Architecture Options

1. File-System-as-Memory

Structure:

memory/
├── entities/
│   └── {entity_id}.json
├── sessions/
│   └── {session_id}/
├── knowledge/
│   └── {topic}.md
└── index.json

Pros: Simple, debuggable, version-controlled Cons: No semantic search, manual organization

Implementation:

class FileMemory:
    def __init__(self, base_path: str):
        self.base = Path(base_path)

    def store(self, key: str, value: dict, category: str = "general"):
        path = self.base / category / f"{key}.json"
        path.parent.mkdir(parents=True, exist_ok=True)
        value["_stored_at"] = datetime.utcnow().isoformat()
        path.write_text(json.dumps(value, indent=2))

    def retrieve(self, key: str, category: str = "general") -> Optional[dict]:
        path = self.base / category / f"{key}.json"
        if path.exists():
            return json.loads(path.read_text())
        return None

2. Vector RAG with Metadata

Structure:

class MemoryEntry:
    id: str
    content: str
    embedding: List[float]
    metadata: dict  # entity_tags, temporal_validity, confidence
    created_at: datetime
    valid_until: Optional[datetime]

Pros: Semantic search, scalable Cons: Loses relationship information, no temporal queries

Enhancement with metadata:

def search_with_temporal_filter(
    query: str,
    as_of: datetime = None,
    entity_filter: List[str] = None
) -> List[MemoryEntry]:
    results = vector_search(query)
    return [r for r in results
            if r.is_valid_at(as_of or datetime.utcnow())
            and (not entity_filter or r.has_entity(entity_filter))]

3. Knowledge Graph

Structure:

Entities: [Person, Project, Decision, Event]
Relations: [owns, participates_in, decided_by, happened_at]

Pros: Preserves relationships, relational queries Cons: Complex setup, query language learning curve

Key capability:

MATCH (p:Person)-[:PARTICIPATES_IN]->(proj:Project)
      -[:HAS_DECISION]->(d:Decision)
WHERE d.date > $since
RETURN p.name, d.description, d.date

4. Temporal Knowledge Graph

Structure:

class TemporalFact:
    subject: str
    predicate: str
    object: str
    valid_from: datetime
    valid_until: Optional[datetime]
    source: str
    confidence: float

Pros: Time-travel queries, fact evolution tracking Cons: Most complex, highest overhead

Capability example:

# What was the project status on date X?
facts = temporal_graph.query_as_of(
    subject="project-alpha",
    predicate="has_status",
    as_of=datetime(2025, 6, 15)
)

Performance Benchmarks

Architecture	Accuracy	Retrieval Time	Best For
Temporal KG	94.8%	2.58s	Complex relationships
GraphRAG	75-85%	Variable	Balanced
Vector RAG	60-70%	Fast	Simple semantic
File-based	N/A	Fast	Simple persistence

Vector Store Limitations

Problems:

"Vector stores lose relationship information"
Cannot answer queries traversing relationships
Lack temporal mechanisms for current vs. outdated facts

Example failure:

Query: "Who approved the decision that affected Project X?"
Vector RAG: Returns documents mentioning approvals and Project X
            but cannot connect the relationship chain

Solution: Combine vector search with graph traversal:

def hybrid_query(query: str):
    # Semantic search for relevant entities
    entities = vector_search(query)

    # Graph traversal for relationships
    for entity in entities:
        related = graph.traverse(entity.id, max_depth=2)
        entity.relationships = related

    return entities

Memory Lifecycle

Writing

def store_memory(
    content: str,
    category: str,
    entities: List[str],
    valid_from: datetime = None,
    valid_until: datetime = None,
    confidence: float = 1.0
):
    entry = MemoryEntry(
        id=generate_id(),
        content=content,
        embedding=embed(content),
        metadata={
            "category": category,
            "entities": entities,
            "confidence": confidence
        },
        valid_from=valid_from or datetime.utcnow(),
        valid_until=valid_until
    )
    storage.save(entry)

Reading

def recall_memory(
    query: str,
    context: dict,
    as_of: datetime = None,
    limit: int = 10
) -> List[MemoryEntry]:
    # 1. Semantic search
    candidates = vector_search(query, limit=limit * 3)

    # 2. Temporal filtering
    valid = [c for c in candidates if c.is_valid_at(as_of)]

    # 3. Context relevance scoring
    scored = [(c, relevance_score(c, context)) for c in valid]

    # 4. Return top results
    return sorted(scored, key=lambda x: x[1], reverse=True)[:limit]

Consolidation

def consolidate_memories(category: str, older_than_days: int = 30):
    """Combine related old memories into summaries."""
    old_memories = get_memories(
        category=category,
        before=datetime.utcnow() - timedelta(days=older_than_days)
    )

    # Group by entity
    grouped = group_by_entity(old_memories)

    for entity, memories in grouped.items():
        if len(memories) > threshold:
            summary = generate_summary(memories)
            store_memory(summary, category="consolidated", entities=[entity])
            archive_memories(memories)

Best Practices

Do

Match architecture to query requirements
Implement progressive disclosure for memory access
Use temporal validity to prevent outdated info conflicts
Consolidate periodically to manage growth
Design graceful retrieval failures
Monitor storage size and query performance

Don't

Store everything (be selective)
Ignore temporal validity
Mix fact types without categorization
Skip consolidation indefinitely
Trust old memories without verification
Ignore retrieval latency in design

Error Handling

Error	Cause	Solution
Stale data returned	Missing temporal filter	Add validity checks
Contradictory facts	Multiple sources	Use confidence scoring
Memory bloat	No consolidation	Implement periodic cleanup
Slow retrieval	Index issues	Optimize embeddings/indexes
Lost relationships	Vector-only storage	Add graph layer

Metrics

Metric	Target	Description
Retrieval accuracy	>85%	Relevant results returned
Temporal accuracy	>95%	Correct time-based filtering
Storage efficiency	<100MB/month	Reasonable growth
Query latency	<500ms	P95 retrieval time
Consolidation rate	Monthly	Old memories summarized

Related Skills

multi-agent-patterns - Agent coordination
context-management - Context optimization
session-memory - Session persistence

Version History

1.0.0 (2026-01-19): Initial release adapted from Agent-Skills-for-Context-Engineering

memory-systems

Memory Systems Skill

Overview

Quick Start

When to Use

Memory Spectrum

Architecture Options

1. File-System-as-Memory

2. Vector RAG with Metadata

3. Knowledge Graph

4. Temporal Knowledge Graph

Performance Benchmarks

Vector Store Limitations

Memory Lifecycle

Writing

Reading

Consolidation

Best Practices

Do

Don't

Error Handling

Metrics

Related Skills

Version History