Semantic Memory, Running Entirely in Your Browser

Preparing…

Add a memory

Semantic search

Search enables once the embedding model (~23 MB) finishes loading. It is fetched from this site only — no CDN, no third parties.

Stored memories

Verify it yourself: open your browser's network tab. After the initial same-origin asset load, this page makes no further requests — embedding, storage and search all happen locally. Reloading the page discards everything.

This is the real stack, shrunk to a page

The demo uses the same components as the self-hosted MCP Memory Service — just without persistence, consolidation, or multi-client access.

In this demo

sqlite-vec (WASM, in-memory)

→

In the service

sqlite-vec backend with persistent storage, WAL, hybrid cloud sync

In this demo

all-MiniLM-L6-v2 ONNX in the browser

→

In the service

the same ONNX embedding model, server-side — no GPU, no API keys

In this demo

one page, one session

→

In the service

MCP + REST + OAuth 2.1 for 20+ AI clients, knowledge graph, consolidation

pip install mcp-memory-service

Back to mcpmemory.services View the source