pdfbox-mcp by amannm - MCP Server

PDFBox MCP Server

A Model Context Protocol (MCP) server that provides PDF processing capabilities using Apache PDFBox.

extract_text - Extract text content from PDF files with optional page range support
get_metadata - Extract PDF metadata (title, author, creation date, etc.)
get_page_count - Get the total number of pages in a PDF file

Build the project:

mvn compile

Run the server:

mvn exec:java

Extract text content from a PDF file.

Parameters:

Extract metadata from a PDF file including title, author, creation date, etc.

Parameters:

Get the number of pages in a PDF file.

Parameters:

Currently in development. The core functionality is implemented but compilation requires MCP SDK API adjustments.