mcp-pdf-reader

3.4

If you are the rightful owner of mcp-pdf-reader and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A PDF file reading server based on FastMCP, supporting PDF text extraction, OCR recognition, and image extraction via the MCP protocol.

📄 MCP PDF Server

A PDF file reading server based on FastMCP.

Supports PDF text extraction, OCR recognition, and image extraction via the MCP protocol, with a built-in web debugger for easy testing.

🚀 Features

read_pdf_text
Extracts normal text from a PDF (page by page).
read_by_ocr
Uses OCR to recognize text from scanned or image-based PDFs.
read_pdf_images
Extracts all images from a specified PDF page (Base64 encoded output).

📂 Project Structure

mcp-pdf-server/
├── pdf_resources/        # Directory for uploaded and processed PDF files
├── txt_server.py         # Main server entry point
└── README.md             # Project documentation

⚙️ Installation

Recommended Python version: 3.9+

pip install pymupdf mcp

Note: To use OCR features, you may need a MuPDF build with OCR support or external OCR libraries.

🔦 Start the Server

Run the following command:

python txt_server.py

You should see logs like:

Serving on http://127.0.0.1:6231

🌐 Web Debugging Interface

Open your browser and visit:

http://127.0.0.1:6231

Select a tool from the left panel
Fill in parameters on the right panel
Click "Run" to test the tool

No coding required — easily debug and test via the web UI.

🛠️ API Tool List

Tool	Description	Input Parameters	Returns
`read_pdf_text`	Extracts normal text from PDF pages	`file_path`, `start_page`, `end_page`	List of page texts
`read_by_ocr`	Recognizes text via OCR	`file_path`, `start_page`, `end_page`, `language`, `dpi`	OCR extracted text
`read_pdf_images`	Extracts images from a PDF page	`file_path`, `page_number`	List of images (Base64 encoded)

📝 Example Usage

Extract text from pages 1 to 5:

mcp run read_pdf_text --args '{"file_path": "pdf_resources/example.pdf", "start_page": 1, "end_page": 5}'

Perform OCR recognition on page 1:

mcp run read_by_ocr --args '{"file_path": "pdf_resources/example.pdf", "start_page": 1, "end_page": 1, "language": "eng"}'

Extract all images from page 3:

mcp run read_pdf_images --args '{"file_path": "pdf_resources/example.pdf", "page_number": 3}'

📢 Notes

Files must be placed inside the pdf_resources/ directory, or an absolute path must be provided.
OCR functionality requires appropriate OCR support in the environment.
When processing large files, adjust memory and timeout settings as needed.

📜 License

This project is licensed under the MIT License.
For commercial use, please credit the original source.

Related MCP Servers

View all file_systems servers →

markdownify-mcp

4.6

by zcaceres

Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format.

file_systems

DesktopCommanderMCP

4.4

by wonderwhy-er

Desktop Commander MCP is a tool that allows users to search, update, manage files, and run terminal commands using AI, without incurring API token costs.

file_systems

Filesystem MCP Server

4.3

by modelcontextprotocol

Node.js server implementing Model Context Protocol (MCP) for filesystem operations.

file_systems

Office-Word-MCP-Server

4.3

by GongRzhe

A Model Context Protocol (MCP) server for creating, reading, and manipulating Microsoft Word documents.

file_systems

edgeone-pages-mcp

4.2

by TencentEdgeOne

An MCP service for deploying HTML content, folder, and zip file to EdgeOne Pages and obtaining a publicly accessible URL.

cloud_platforms

mcp-obsidian

3.8

by MarkusPfundstein

MCP server to interact with Obsidian via the Local REST API community plugin.

file_systems

filesystem

3.7

by javillegasna

Python server implementing Model Context Protocol (MCP) for filesystem operations.

file_systems

mcp-storage-server

3.7

by storacha

Storacha MCP Storage Server is a Model Context Protocol server implementation for decentralized storage, enabling AI applications to store and retrieve files through a standardized interface.

file_systems

mcp-claude-code

3.6

by SDGLBL

An implementation of Claude Code capabilities using the Model Context Protocol (MCP).

developer_tools

mcp-apple-notes

3.6

by RafalWilinski

A Model Context Protocol (MCP) server that enables semantic search and RAG over Apple Notes, allowing AI assistants like Claude to search and reference notes during conversations.

knowledge_and_memory

applescript-mcp

3.6

by joshrutkowski

A Model Context Protocol server that enables LLM applications to interact with macOS through AppleScript, providing a standardized interface for AI applications to control system functions, manage files, handle notifications, and more.

os_automation

gistpad-mcp

3.6

by lostintangent

GistPad MCP is a server for managing and sharing personal knowledge, daily notes, and reusable prompts via GitHub Gists, integrated with VS Code and GistPad.dev.

knowledge_and_memory

mcp-text-editor

3.6

by tumf

A Model Context Protocol (MCP) server that provides line-oriented text file editing capabilities through a standardized API. Optimized for LLM tools with efficient partial file access to minimize token usage.

file_systems

google-workspace-mcp

3.6

by aaronsb

The Google Workspace MCP Server is a Model Context Protocol server that allows users to manage their Google Workspace, including Gmail, Calendar, and Drive, through a secure and efficient interface.

cloud_platforms

UnityMCPIntegration

3.6

by quazaai

This package provides a seamless integration between Model Context Protocol (MCP) and Unity Editor, allowing AI assistants to understand and interact with Unity projects in real-time.

developer_tools

ultimate_mcp_server

3.5

by Dicklesworthstone

Ultimate MCP Server is a comprehensive AI agent operating system that provides advanced capabilities for cognitive augmentation, tool use, and intelligent orchestration.

ai_chatbot

obsidian-mcp

3.5

by newtype-01

Obsidian MCP (Model Context Protocol) 服务器用于连接 AI 模型与 Obsidian 知识库，支持笔记和文件夹的管理操作。

knowledge_and_memory

obsidian-mcp-rest

3.5

by PublikPrinciple

An MCP server implementation that provides access to Obsidian vaults through a local REST API.

file_systems

vertex-ai-mcp-server

3.5

by shariqriazz

This project implements a Model Context Protocol (MCP) server that provides a comprehensive suite of tools for interacting with Google Cloud's Vertex AI Gemini models, focusing on coding assistance and general query answering.

ai_chatbot

aws-sa-tools-mcp-server

3.5

by Havoc24k

A Model Context Protocol (MCP) server that provides tools to interact with AWS services.

cloud_platforms

mcp-server-multiverse

3.5

by lamemind

A middleware server that enables multiple isolated instances of the same MCP servers to coexist independently with unique namespaces and configurations.

file_systems

pdf-reader-mcp

3.5

by sylphxltd

Empower your AI agents with the ability to securely read and extract information from PDF files using the PDF Reader MCP Server.

file_systems

mcp-server

3.5

by Apillon

The Apillon MCP Server provides modules for Storage, Hosting, and NFT functionalities using the Model Context Protocol (MCP).

cloud_storage

choturobo

3.5

by vishalmysore

Chotu Robo Server is an MCP server designed for Arduino-based robotics, integrating AI with hardware components for remote control and automation.

ai_chatbot

ebook-mcp

3.5

by onebirdrocks

Ebook-MCP is a Model Context Protocol server designed for processing electronic books, supporting EPUB and PDF formats.

file_systems

wordpress-mcp-server

3.5

by prathammanocha

A comprehensive Model Context Protocol (MCP) server for interacting with WordPress sites via the REST API.

communication

rust-mcp-filesystem

3.5

by rust-mcp-stack

Rust MCP Filesystem is a high-performance, asynchronous MCP server for efficient filesystem operations, rewritten in Rust for enhanced capabilities.

file_systems