rmcp-memex by Loctree - MCP Server

mcp_memex

Lightweight Model Context Protocol (MCP) server written in Rust. It provides a local Retrieval-Augmented Generation (RAG) toolset backed by an embedded LanceDB vector store and local embeddings. If an MLX HTTP server is available, it is used for embeddings and reranking; otherwise the server falls back to on‑device embeddings via fastembed.

Tools exposed to MCP clients

rag_index(path, namespace?) — index a file (UTF‑8 text or PDF) into the local vector store
rag_index_text(text, id?, namespace?, metadata?) — index raw text (UUID generated when id is omitted)
rag_search(query, k=10, namespace?) — search indexed chunks and return the top‑k results
memory_upsert(namespace, id, text, metadata?) — upsert single chunk into vector memory
memory_get(namespace, id) — fetch stored chunk
memory_search(namespace, query, k=5) — semantic search within a namespace
memory_delete(namespace, id) — delete a chunk by id
memory_purge_namespace(namespace) — drop all chunks in a namespace

Overview

Stack: Rust 2021, Tokio, Clap
Vector store: Embedded LanceDB (no external DB needed)
Embeddings: Optional MLX HTTP bridge; automatic fastembed fallback
Caching/persistence: moka (in‑memory) + sled (local key/value)
IO: reqwest for HTTP; pdf-extract for PDF text
Transport: JSON‑RPC over stdin/stdout (compatible with MCP hosts)

Binary entry point: src/bin/mcp_memex.rs (binary name: mcp_memex). Library API exposes ServerConfig + run_stdio_server for embedding; server logs to stdout/stderr and reads JSON‑RPC requests from stdin.

Requirements

Rust toolchain with Cargo (stable)
macOS or Linux (Windows likely works but untested)
Protobuf compiler: required by some dependencies at build time. If you don’t have it, install it (e.g., macOS: brew install protobuf; Linux: apt install protobuf-compiler).
Optional MLX bridge: HTTP server with /v1/embeddings, /v1/rerank, /v1/models.

Quick start

# build
cargo build --release

# run (uses local fastembed by default; LanceDB at ~/.mcp-servers/mcp_memex/lancedb)
cargo run --release -- --log-level info
# logs go to stderr; stdout is reserved for JSON-RPC responses

Embed as a library

use mcp_memex::{run_stdio_server, ServerConfig};

# async context
let config = ServerConfig::default()
    .with_db_path("/tmp/lancedb"); // override as needed
run_stdio_server(config).await?;

Configuration CLI flags (from src/lib.rs)

--features string (default "filesystem,memory,search")
--cache-mb usize (default 4096)
--db-path string (default "~/.mcp-servers/mcp_memex/lancedb")
--log-level trace|debug|info|warn|error (default info)

Environment variables

DISABLE_MLX — if set, disables MLX bridge; fastembed only
DRAGON_BASE_URL — base URL for MLX HTTP (default http://localhost)
MLX_JIT_MODE — "true" to use a single port for all models (default false)
MLX_JIT_PORT — JIT mode port (default 1234)
EMBEDDER_PORT — non‑JIT embeddings port (default 12345)
RERANKER_PORT — non‑JIT rerank port (default 12346)
EMBEDDER_MODEL — embeddings model id (default Qwen/Qwen3-Embedding-4B)
RERANKER_MODEL — reranker model id (default Qwen/Qwen3-Reranker-4B)
FASTEMBED_CACHE_PATH / HF_HUB_CACHE — if unset, the server sets both to $HOME/.cache/fastembed to avoid .fastembed_cache in each cwd
LANCEDB_PATH — overrides the --db-path for the embedded DB (default ~/.mcp-servers/mcp_memex/lancedb)
PROTOC — path to protoc if build.rs cannot find the vendored binary

Example (MLX non‑JIT)

export DRAGON_BASE_URL=http://localhost
export EMBEDDER_PORT=5555
 export RERANKER_PORT=5556

Tools (RPC)

rag_index(path: string, namespace?: string)
- Extracts text (PDF via pdf-extract; others as UTF‑8)
- Chunks to size 512 with overlap 128; embeds (MLX or fastembed)
- Inserts into LanceDB table mcp_documents (auto‑created), default namespace "rag"
rag_index_text(text: string, id?: string, namespace?: string, metadata?: object)
- Single-chunk insert with optional custom id (UUID generated when missing)
- Default namespace "rag"
rag_search(query: string, k: number=10, namespace?: string)
- Embeds the query, searches LanceDB, reranks with MLX if available (cosine fallback)
- Returns id, namespace, text, score, metadata
memory_upsert(namespace: string, id: string, text: string, metadata?: object)
- Convenience wrapper to store a single chunk in a namespace
memory_get(namespace: string, id: string)
- Returns the stored chunk (id, namespace, text, metadata)
memory_search(namespace: string, query: string, k: number=5)
- Semantic search constrained to the namespace (rerank + cosine fallback)
memory_delete(namespace: string, id: string)
memory_purge_namespace(namespace: string)

Scripts

build-macos.sh — builds release and creates a minimal app bundle at ~/.mcp-servers/MCPServer.app with CFBundleExecutable=mcp_memex
install.sh — builds the release binary; pass --bundle-macos to also create the app bundle

Project structure

src/main.rs • src/lib.rs • src/handlers • src/embeddings • src/rag • src/storage
build.rs • build-macos.sh • install.sh • Cargo.toml

Tests

cargo test

License MIT — see LICENSE.

Known limitations

Only text and PDF ingestion are supported (no HTML/Markdown parsing yet)
LanceDB collection name is fixed to mcp_documents (namespacing is handled via a column)
Minimal JSON‑RPC loop intended for MCP hosts; richer transport may be added