IDEA-Research/DINO-X-MCP
If you are the rightful owner of DINO-X-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
DINO-X MCP enables large language models to perform fine-grained object detection and image understanding, powered by DINO-X and Grounding DINO 1.6 API.
Tools
Functions exposed to the LLM to take actions
detect-all-objects
Detects and localizes all recognizable objects in an image.
object-detection-by-text
Detects and localizes objects in an image based on a natural language prompt.
detect-human-pose-keypoints
Detects 17 human body keypoints per person in an image for pose estimation.
visualize-detections
Visualizes detection results by drawing bounding boxes and labels on the image.
Prompts
Interactive templates invoked by user choice
No prompts
Resources
Contextual data attached and managed by the client