v1.1

Shared memory for agents and the people they work with

Shared memory for agents and the people they work with

Memory API for agent builders — extracts facts and work artifacts from conversations, versions them as they change, and shares them across groups.

Your AI tool has access to everything, but it still doesn't understand your business. With XTrace, it does.

Memory Hub

···

XTrace Memory API

···

XTrace Vector Database

···

Memory Hub

···

XTrace Memory API

···

XTrace Vector Database

···

Memory Hub

XTrace Memory API

XTrace Memory API

XTrace Vector Database

XTrace Vector Database

Benchmarks

The numbers that matter in production.

92.3

%

LOCOMO

Long-form conversational memory. Highest reported score on the leaderboard.

92.3

%

LOCOMO

Long-form conversational memory. Highest reported score on the leaderboard.

95

%

fewer tokens

vs. dumping chat history. Because resolved beliefs beat raw transcripts.

95

%

fewer tokens

vs. dumping chat history. Because resolved beliefs beat raw transcripts.

<80

ms

p50 recall latency

p99 < 220ms. Fast enough for voice. Global edge on Enterprise.

<80

ms

p50 recall latency

p99 < 220ms. Fast enough for voice. Global edge on Enterprise.

LOCOMO — long-term conversational memory

Higher is better. Evaluated on the public LOCOMO suite (Snap Research). Competitor scores from published vendor papers & blogs.

XTrace Memory

92.3%

Mem0 (selective)

91.6%

Zep

75.1%

Letta (filesystem)

74.0%

OpenAI Memory

~52.9%

Notes: Mem0 score = their 2026 token-efficient pipeline (mem0.ai/research). Zep = Zep's own corrected number per their rebuttal (getzep.com). Letta = their published filesystem-only baseline. OpenAI

Memory is estimated from Mem0's comparative "26% relative lift over OpenAI" figure.

LOCOMO — long-term conversational memory

Higher is better. Evaluated on the public LOCOMO suite (Snap Research). Competitor scores from published vendor papers & blogs.

XTrace Memory

92.3%

Mem0 (selective)

91.6%

Zep

75.1%

Letta (filesystem)

74.0%

OpenAI Memory

~52.9%

Notes: Mem0 score = their 2026 token-efficient pipeline (mem0.ai/research). Zep = Zep's own corrected number per their rebuttal (getzep.com). Letta = their published filesystem-only baseline. OpenAI

Memory is estimated from Mem0's comparative "26% relative lift over OpenAI" figure.

Under the hood

Why not just a RAG or vector DB?

A vector DB answers "what words are most similar?" A memory engine answers "what's true right now?" Those are different queries, and they need different data structures.

Dimension

Naive RAG

"what words are most similar?"

XTrace Memory

"what's true right now?"

What it stores

What it stores

Embeddings of strings. No notion of "true right now."

Embeddings of strings. No notion of "true right now."

Structured facts with timestamps, sources, and

confidence.

Structured facts with timestamps, sources, and

confidence.

Conflicts

Conflicts

Contradictions coexist forever. Retrieval picks whichever scores higher.

Contradictions coexist forever. Retrieval picks whichever scores higher.

Newer facts revise older ones. Retractions are first- class.

Newer facts revise older ones. Retractions are first- class.

Lineage

Lineage

None. You can't answer why do you believe this?

None. You can't answer why do you believe this?

Every answer carries lineage. One call returns the fact + its provenance.

Every answer carries lineage. One call returns the fact + its provenance.

Time decay

Time decay

None. A fact from Jan 2024 ranks with a fact from yesterday.

None. A fact from Jan 2024 ranks with a fact from yesterday.

Decay curves per fact type. Preferences age differently than contracts.

Decay curves per fact type. Preferences age differently than contracts.

Integration

You'll end up writing a belief layer on top.

You'll end up writing a belief layer on top.

One SDK call. Real Intelligence right out of the box. We handle all the complexities for you.

One SDK call. Real Intelligence right out of the box. We handle all the complexities for you.

How it works

One write. One recall.
A belief engine in the middle.

Click a node on the timeline to see how memory builds up with XTrace

use cases

Our memory is the difference between an agent and a chatbot.

Support agents

Know the user's plan, their last three tickets, and the policy changes since they signed up. Stop asking "what's your order number?"

Coding agents

Remember which patterns this repo favors, what conventions were deprecated last month, and which files the user owns.

Voice agents

<80ms p50 is the latency budget you need. Facts-only recall means short prompts — fewer tokens, lower cost, faster TTFB.

Research agents

Track what's been read, what's been ruled out, and which claims are still load- bearing. Supersede chains = reproducible research.

Companions

Users want an AI that actually knows them. Not one that forgot their dog's name between sessions.

Workflow / ops agents

Long-running agents that span days and tools. Lineage matters here more than anywhere, XTrace Memory gives you audit + replay.

Same engine, two surfaces

Your agent has memory. Your team should too.

Your agent has memory.
Your team should too.

Most Memory API customers ship their agent, then roll memory out to the rest of the team. That's Memory Hub: the same belief engine, surfaced as a product humans can use.

you are here

Memory API

Memory for your agent

A drop-in SDK. Your code, your agent, your stack. One engine per application.

·

Typescript SDK and API

·

Belief-revision based long term memory system

·

Scoped per agent · per user · per group

·

Competitive pricing

Memory Hub

Memory for your company

A platform your whole team uses. Captures across every AI tool. Serves every teammate and every agent.

·

UI for humans · API for agents

·

Notion, Slack, Drive, ChatGPT, Claude...

·

Audience tiers · E2E Personal Space

·

Per-seat · starts at team-size of 5

Per-seat · starts at team-size of 3