mcp-pdf2md

mcp-pdf2md

3.4

If you are the rightful owner of mcp-pdf2md and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

MCP-PDF2MD is a high-performance PDF to Markdown conversion service using MinerU API, supporting batch processing for local files and URL links.

MCP-PDF2MD is a robust service designed to convert PDF documents into structured Markdown format. It leverages the MinerU API to provide high-quality extraction of text, images, and layout information from PDF files. The service supports both local files and URL links, offering flexibility in processing. It is capable of batch processing, making it efficient for handling large volumes of documents. The integration with LLM clients like Claude Desktop ensures seamless operation, while the intelligent processing feature automatically selects the best method for conversion. The service maintains the original document structure, including headings, paragraphs, lists, and more, ensuring that the output is both accurate and human-readable. Additionally, it includes features like formula conversion to LaTeX, table extraction, and cleanup optimization to remove unnecessary elements like headers and footers.

Features

  • Format Conversion: Convert PDF files to structured Markdown format.
  • Multi-source Support: Process both local PDF files and URL links.
  • Batch Processing: Support multi-file batch conversion for efficient handling of large volumes of PDF files.
  • Structure Preservation: Maintain the original document structure, including headings, paragraphs, lists, etc.
  • Formula Conversion: Automatically recognize and convert formulas in the document to LaTeX format.

Tools

  1. convert_pdf_url

    Convert PDF URL to Markdown

  2. convert_pdf_file

    Convert local PDF file to Markdown