vision-agent-mcp

landing-ai/vision-agent-mcp

3.4

If you are the rightful owner of vision-agent-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

VisionAgent MCP Server is a lightweight, side-car server that facilitates communication between MCP-compatible clients and Landing AI’s VisionAgent REST APIs, enabling natural-language computer-vision and document-analysis commands.

Tools

Functions exposed to the LLM to take actions

agentic-document-analysis

Parse PDFs/images to extract text, tables, charts, and diagrams.

text-to-object-detection

Detect objects using free-form prompts and outputs bounding boxes.

text-to-instance-segmentation

Provides pixel-perfect masks for images.

activity-recognition

Recognizes multiple activities in video with start/end timestamps.

depth-pro

High-resolution monocular depth estimation for single images.

Prompts

Interactive templates invoked by user choice

No prompts

Resources

Contextual data attached and managed by the client

No resources