DINO-X-MCP

IDEA-Research/DINO-X-MCP

3.5

If you are the rightful owner of DINO-X-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

DINO-X MCP enables large language models to perform fine-grained object detection and image understanding, powered by DINO-X and Grounding DINO 1.6 API.

Tools

Functions exposed to the LLM to take actions

detect-all-objects

Detects and localizes all recognizable objects in an image.

object-detection-by-text

Detects and localizes objects in an image based on a natural language prompt.

detect-human-pose-keypoints

Detects 17 human body keypoints per person in an image for pose estimation.

visualize-detections

Visualizes detection results by drawing bounding boxes and labels on the image.

Prompts

Interactive templates invoked by user choice

No prompts

Resources

Contextual data attached and managed by the client

No resources