Python SDK · Live on PyPI · Patent Pending

Your agent processed 10,000 conversations yesterday. And learned nothing.

The Wisdom Layer makes agents remember, reflect, and improve — without fine-tuning.

A lightweight Python SDK that wraps any LLM with persistent memory, autonomous reflection cycles, and an internal critic that evaluates every output against the agent’s own self-authored rules.

pip install wisdom-layer

22% → 2% fabrication rate on the same model (Claude Haiku). n=45, single corpus, broader eval in progress.

See the Code Read the Docs Read the Methodology

1,498 tests 7 persistent agents 6+ months 10× fewer fabrications

How It Works

Install the SDK. Wrap your agent. It remembers conversations, reflects on patterns, writes its own rules, and evaluates its own output — all persisted across sessions. Your agent on Day 30 is measurably different from your agent on Day 1.

1. Capture

Agent records real interactions into three-tier memory with semantic search.

2. Reflect

Dream cycles extract patterns, synthesize journals, and audit existing rules.

3. Evolve

Agent writes and applies its own behavioral rules. Critic evaluates every output.

wisdom_agent.py

# pip install "wisdom-layer[ollama]"
# Works with local models (Ollama) or cloud providers
from wisdom_layer import WisdomAgent, AgentConfig
from wisdom_layer.llm.ollama import OllamaAdapter
from wisdom_layer.storage.sqlite import SQLiteBackend

llm = OllamaAdapter(model="llama3.2")
agent = WisdomAgent(
    agent_id="support-agent",
    config=AgentConfig(name="Support Agent"),
    llm=llm,
    backend=SQLiteBackend("./agent.db", embed_fn=llm.embed),
)
await agent.initialize()

# Agent remembers this conversation
await agent.memory.capture("conversation", {"user": msg})

# Agent reflects overnight — reconsolidates, audits, journals
report = await agent.dreams.trigger()

# Agent evaluates its own output against learned rules
review = await agent.critic.evaluate(response)

What Your Agents Gain

Core subsystems, built and tested. When an agent captures experience, reflects on it, writes rules from it, and evaluates against those rules — it develops persistent identity across sessions. No fine-tuning. No retraining. Just architecture.

Built for production: spend ceilings, append-only provenance, and longitudinal health monitoring on every paid tier.

Free

Three-Tier Memory

Raw events → consolidated knowledge → reflective journals. Semantic search across all tiers. Automatic salience scoring and decay. Every insight traces back to source.

Pro

Dream Cycles

Autonomous reflection pipeline: reconsolidate memories, evolve directives, audit coherence, synthesize journals. Schedulable with cron-like intervals or trigger on-demand.

Pro

Internal Critic

Evaluates agent output against active directives in real time. Catches narrative inflation, confidence miscalibration, and source grounding failures before they reach your users.

Pro

Directive Evolution

Agents propose their own behavioral rules from experience. Rules follow a lifecycle: provisional → active → permanent. Human-approved, automatically enforced.

Pro

Health Trajectory

Composite wisdom score (0–1) snapshotted daily. Cognitive-state classifier (healthy / stagnant / drifting / overloaded). 30-day trajectory window on Pro, unlimited on Enterprise. Catches drift before it ships.

Pro

Append-Only Provenance

Every mutation logged: memory captures, directive promotions, dream phases, snapshots. agent.provenance.trace() for any entity, .explain() (Enterprise) for narrated chains, .export() (Enterprise) for compliance archival.

Pro

BudgetGuard + Cost Estimation

Hard-enforced spend ceilings on three windows: daily, monthly, and per-cycle. Calls fail at the cap, not warnings in a log. Pre-flight cost estimate before any dream cycle so you decide before you spend. Per-call metering, CSV export on Enterprise.

Pro

LangGraph & MCP

Drop-in LangGraph nodes (recall, capture, dream, directives). MCP server for Claude Code and Cursor. LangChain BaseStore adapter. See docs →

Pro

Dashboard

Browser-based visualization of your agent’s cognitive architecture. Health gauges, directive lifecycle, memory search, dream history, and full configuration panel. pip install wisdom-layer[dashboard]

Free

6 LLM Adapters

Anthropic, OpenAI, Gemini, Ollama (local), LiteLLM (100+ providers), and CallableAdapter for custom inference. Model-agnostic by design. Zero vendor lock-in.

What the Architecture Has Produced

The Wisdom Layer SDK is the formalized version of patterns extracted from work across three domains. The SDK is the packaging of what already worked.

Persistent Agent Research

7 agents running continuously for 6+ months on a digital-brain architecture inspired by functional neuroscience. Source material for the SDK and the benchmarks published on this page.

Scientific Discovery

A computational pharmacogenomics platform built on the pre-SDK architecture is producing cross-platform-validated biomarker candidates. Active collaboration discussions with academic cancer centers.

Dogfooded Daily

Loom-code, an internal AI-assisted coding tool built on the same architecture, runs across 20+ of my own development repositories. Each repo accumulates hundreds of memories that distill into 10–15 targeted directives, measurably reducing error rates in agent-driven code generation.

Patent pending. 1,498 tests passing. v1.0 live on PyPI — model-agnostic, zero infrastructure required (SQLite included). Integrates with LangGraph, MCP, and LangChain.

Benchmarked Against Vanilla LLM

45 questions. Same model (Claude Haiku). Same prompts. The only variable: the Wisdom Layer architecture.

Wisdom Layer 97.8%

Vanilla LLM 77.8%

Truthful response rate — higher is better

10×

Fewer fabrications

1 vs 10 across 45 questions

0 vs 6

Synthesis fabrications

Cross-domain reasoning — all 9 correct

0 vs 4

Directive fabrications

Rules from real mistakes can’t be guessed

Single-corpus early eval — broader multi-corpus evaluation in progress.
Pro tier only. Real-time Critic not engaged. Same model both conditions. Full methodology on request.

What’s Built vs What’s Coming

We’d rather you know exactly where we are. Everything marked “Shipped” is tested, documented, and available now via pip install wisdom-layer. Everything else has a timeline.

Capability	Status
Three-tier memory (capture, search, consolidate, decay)	Shipped
Dream cycles (reconsolidate, evolve, audit, journal, synthesize)	Shipped
Internal critic (evaluate, audit, veto)	Shipped
Directive evolution (propose, promote, decay, lifecycle)	Shipped
6 LLM adapters (Anthropic, OpenAI, Gemini, Ollama, LiteLLM, Callable)	Shipped
SQLite & Postgres backends	Shipped
Dream scheduling (cron-like intervals, pause/resume)	Shipped
Full provenance tracking & LLM-narrated explain	Shipped
Health analytics (wisdom score, cognitive classifier, trajectory)	Shipped
Cost visibility & budget guards (daily/monthly caps)	Shipped
Export/import, cross-backend clone, re-embed	Shipped
Retry policy, graceful shutdown, dream checkpointing	Shipped
Dashboard (health, directives, memory, dreams, config)	Shipped
LangGraph nodes (recall, capture, dream, directives)	Shipped
MCP server (Claude Code, Cursor, Windsurf)	Shipped
LangChain adapter (WisdomStore + legacy BaseMemory)	Shipped
Feature flags & tier enforcement	Shipped
SyncWisdomAgent (blocking wrapper for scripts/Jupyter)	Shipped
PyPI distribution & Cython-compiled internals	Shipped
v1.0 GA (public launch)	Shipped
Multi-agent coordination	Planned — v1.1+

Pricing

Start free. Upgrade when your agents need full behavioral evolution.

Free

Local experimentation & evaluation

No credit card. Anonymous mode also works.

✓ Install the SDK and create an agent
✓ Tier 1 memory + semantic search
✓ Inspect directives forming
✓ SQLite storage, 1 agent
✗ Dream cycles
✗ Critic & directive evolution
✗ Tier 2/3 memory

Start Free

Pro

Full cognition SDK for serious builders

$250/mo

Founder rate — 12 months or first raise

✓ Everything in Free
✓ Full directive lifecycle
✓ Critic enforcement
✓ Dream cycles & scheduling
✓ Tier 2/3 memory + semantic search
✓ Health analytics & provenance
✓ Dashboard
✓ SQLite & Postgres backends
✓ LangGraph, MCP, LangChain integrations
✓ Cost visibility & budget guards
✓ Email support

For your team or product. Multi-tenant deployments (one agent per customer) typically move to Enterprise.

Upgrade to Pro

Enterprise

Embedded into products & regulated workflows

Custom

Starts at $5k/mo

✓ Everything in Pro
✓ Custom storage / backends
✓ Provenance export & cost export
✓ Unlimited health-trajectory window
✓ Multi-tenant & multi-agent deploymentsv1.1
✓ Custom dream phasesv1.1
✓ Advisory + implementation support
✓ Contractual IP, audit, deployment terms
✓ SLA + integration sprint available

Talk to Us

Pricing reflects founding rates and is subject to change. Final terms are confirmed in your service agreement.

The Wisdom Layer makes agents remember, reflect, and improve — without fine-tuning.

What Changes After 7 Days?

Day 1

Day 3

Day 7

Day 14+

How It Works

1. Capture

2. Reflect

3. Evolve

What Your Agents Gain

Three-Tier Memory

Dream Cycles

Internal Critic

Directive Evolution

Health Trajectory

Append-Only Provenance

BudgetGuard + Cost Estimation

LangGraph & MCP

Dashboard

6 LLM Adapters

Built to be trusted in production.

What the Architecture Has Produced

Persistent Agent Research

Scientific Discovery

Dogfooded Daily

Benchmarked Against Vanilla LLM

What’s Built vs What’s Coming

Pricing

Free

Pro

Enterprise

Research & Writing

We Tested Our Agent Against Itself. The One With History Won by 10×.

The Wisdom Layer: The Missing Architecture Between LLMs and Intelligence

Synthetic Epistemology: How a $0.80/M-Token Model Engineered a Protocol to Falsify Itself

From Memory to Judgment: Engineering Agents That Actually Learn

Get Started

Talk to the Founder