openai-ocr-mcp

3.2

If you are the rightful owner of openai-ocr-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A Model Context Protocol (MCP) server that provides OCR functionality using OpenAI's vision capabilities.

The OpenAI OCR MCP Server is designed to facilitate Optical Character Recognition (OCR) by leveraging OpenAI's advanced vision models. It integrates seamlessly with Cursor IDE, allowing users to extract text from images efficiently. The server supports multiple image formats and ensures organized file management through content-based hashing. It also features robust error handling and detailed logging to aid in troubleshooting. The server is optimized for high-detail image analysis and processes images through OpenAI's vision API, making it a powerful tool for text extraction tasks.

Features

Image Text Extraction: Extract text from various image formats using OpenAI's GPT-4.1-mini vision model.
Automatic Text File Creation: Automatically saves extracted text alongside the source image.
Content-Based File Naming: Uses unique content hashing for organized file management.
Multiple Image Format Support: Supports JPG, PNG, GIF, and WebP formats.
Robust Error Handling: Comprehensive validation and error reporting.

Related MCP Servers

View all ai_chatbot servers →

biomcp

4.6

by genomoncology

BioMCP is an open-source toolkit designed to enhance AI assistants with specialized biomedical knowledge by connecting them to authoritative biomedical data sources.

research_and_data

elevenlabs-mcp

4.6

by elevenlabs

Official ElevenLabs Model Context Protocol (MCP) server for interaction with Text to Speech and audio processing APIs.

ai_chatbot

repomix

4.5

by yamadashy

Repomix is a tool that packs your codebase into AI-friendly formats, making it easier to use with AI tools like LLMs.

developer_tools

mindsdb

4.5

by mindsdb

MindsDB is an open-source server that enables seamless interaction with large-scale federated data using the Model Context Protocol (MCP).

databases

mcp-notion-server

4.3

by suekou

MCP Server for the Notion API, enabling LLM to interact with Notion workspaces. Additionally, it employs Markdown conversion to reduce context size when communicating with LLMs, optimizing token usage and making interactions more efficient.

databases

MiniMax-MCP

4.3

by MiniMax-AI

Official MiniMax Model Context Protocol (MCP) server for interaction with Text to Speech and video/image generation APIs.

ai_chatbot

whois-mcp

4.3

by bharathvaj-ganesan

The Whois MCP server allows AI agents to perform WHOIS lookups to retrieve domain details.

research_and_data

mcp-server-reddit

4.1

by Hawstein

A Model Context Protocol server providing access to Reddit public API for LLMs.

ai_chatbot

rapidocr-mcp

4.0

by z4none

RapidOCR MCP Server is a Model Context Protocol server that provides an easy-to-use OCR interface.

ai_chatbot

gateway

4.0

by centralmind

CentralMind Gateway is a tool designed to expose databases to AI agents via MCP or OpenAPI protocols, providing secure, LLM-optimized APIs.

databases

runno MCP

3.9

by taybenlor

`@runno/mcp` is a Model Context Protocol server that provides a secure code execution environment for AI assistants.

ai_chatbot

blender-mcp

3.8

by ahujasid

BlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), enabling prompt-assisted 3D modeling, scene creation, and manipulation.

entertainment_and_media

unity-mcp

3.8

by justinpbarnett

Unity MCP connects your Unity Editor to LLMs using the Model Context Protocol, enabling AI assistants to interact with Unity for asset management, scene control, script editing, and task automation.

ai_chatbot

Cua Agent

3.8

by trycua

cua-mcp-server is a Model Context Protocol (MCP) server for the Computer-Use Agent (CUA), enabling integration with Claude Desktop and other MCP clients.

developer_tools

mcp-server-whisper

3.8

by arcaputo3

MCP Server Whisper is a Model Context Protocol server designed for advanced audio transcription and processing using OpenAI's Whisper and GPT-4o models.

ai_chatbot

azure-mcp

3.8

by Azure

The Azure MCP Server implements the MCP specification to create a seamless connection between AI agents and Azure services.

cloud_platforms

unreal-mcp

3.8

by chongdashu

This project enables AI assistant clients like Cursor, Windsurf, and Claude Desktop to control Unreal Engine through natural language using the Model Context Protocol (MCP).

developer_tools

hyper-mcp

3.7

by tuananh

hyper-mcp is a fast, secure MCP server that extends its capabilities through WebAssembly plugins.

ai_chatbot

Tianji

3.7

by msgbyte

A server based on Model Context Protocol (MCP) that provides tools for interacting with the Tianji platform.

ai_chatbot

codemcp

3.7

by ezyang

codemcp is a tool that integrates with Claude Desktop to provide a pair programming assistant, allowing direct code editing and testing.

developer_tools

minima

3.7

by dmayboroda

Minima is an open-source RAG on-premises container solution that integrates with ChatGPT and MCP, allowing for fully local or hybrid installations.

ai_chatbot

MCP-Chinese-Getting-Started-Guide

3.7

by liaokongVFX

The Model Context Protocol (MCP) is a groundbreaking open-source protocol enabling large language models to seamlessly connect with various external data sources and tools. It provides a standardized method for AI models to interact with their environment, much like a USB-C interface for AI applications.

ai_chatbot

MCP Component Server

3.7

by baidubce

The MCP Component Server is a FastMCP server-based implementation that converts AppBuilder Components into FastMCP tools, enabling seamless integration of Baidu cloud AI services into MCP-compatible environments.

cloud_platforms

chroma-mcp

3.7

by chroma-core

Chroma MCP Server is an open-source embedding database server that facilitates the integration of LLM applications with external data sources using the Model Context Protocol.

databases

rails-mcp-server

3.7

by maquina-app

A Ruby implementation of a Model Context Protocol (MCP) server for Rails projects, allowing LLMs to interact with Rails projects.

developer_tools

jinni

3.7

by smat-dev

Jinni is a tool designed to efficiently provide Large Language Models (LLMs) with the context of your projects by consolidating relevant project files.

developer_tools

obsidian-mcp-tools

3.6

by jacksteamdev

MCP Tools for Obsidian enables AI applications like Claude Desktop to securely access and work with your Obsidian vault through the Model Context Protocol (MCP).

ai_chatbot