openai-ocr-mcp
If you are the rightful owner of openai-ocr-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
A Model Context Protocol (MCP) server that provides OCR functionality using OpenAI's vision capabilities.
The OpenAI OCR MCP Server is designed to facilitate Optical Character Recognition (OCR) by leveraging OpenAI's advanced vision models. It integrates seamlessly with Cursor IDE, allowing users to extract text from images efficiently. The server supports multiple image formats and ensures organized file management through content-based hashing. It also features robust error handling and detailed logging to aid in troubleshooting. The server is optimized for high-detail image analysis and processes images through OpenAI's vision API, making it a powerful tool for text extraction tasks.
Features
- Image Text Extraction: Extract text from various image formats using OpenAI's GPT-4.1-mini vision model.
- Automatic Text File Creation: Automatically saves extracted text alongside the source image.
- Content-Based File Naming: Uses unique content hashing for organized file management.
- Multiple Image Format Support: Supports JPG, PNG, GIF, and WebP formats.
- Robust Error Handling: Comprehensive validation and error reporting.