MiniMax-AI/MiniMax-MCP

4.3

MiniMax-MCP is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.

If you are the rightful owner of MiniMax-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to dayong@mcphub.com.

Official MiniMax Model Context Protocol (MCP) server for interaction with Text to Speech and video/image generation APIs.

Try MiniMax-MCP with chat:

Tools

Functions exposed to the LLM to take actions

text_to_audio

Convert text to audio with a given voice and save the output audio file to a given directory. Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. Voice id is optional, if not provided, the default voice will be used.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

Args:
    text (str): The text to convert to speech.
    voice_id (str, optional): The id of the voice to use. For example, "male-qn-qingse"/"audiobook_female_1"/"cute_boy"/"Charming_Lady"...
    model (string, optional): The model to use.
    speed (float, optional): Speed of the generated audio. Controls the speed of the generated speech. Values range from 0.5 to 2.0, with 1.0 being the default speed. 
    vol (float, optional): Volume of the generated audio. Controls the volume of the generated speech. Values range from 0 to 10, with 1 being the default volume.
    pitch (int, optional): Pitch of the generated audio. Controls the speed of the generated speech. Values range from -12 to 12, with 0 being the default speed.
    emotion (str, optional): Emotion of the generated audio. Controls the emotion of the generated speech. Values range ["happy", "sad", "angry", "fearful", "disgusted", "surprised", "neutral"], with "happy" being the default emotion.
    sample_rate (int, optional): Sample rate of the generated audio. Controls the sample rate of the generated speech. Values range [8000,16000,22050,24000,32000,44100] with 32000 being the default sample rate.
    bitrate (int, optional): Bitrate of the generated audio. Controls the bitrate of the generated speech. Values range [32000,64000,128000,256000] with 128000 being the default bitrate.
    channel (int, optional): Channel of the generated audio. Controls the channel of the generated speech. Values range [1, 2] with 1 being the default channel.
    format (str, optional): Format of the generated audio. Controls the format of the generated speech. Values range ["pcm", "mp3","flac"] with "mp3" being the default format.
    language_boost (str, optional): Language boost of the generated audio. Controls the language boost of the generated speech. Values range ['Chinese', 'Chinese,Yue', 'English', 'Arabic', 'Russian', 'Spanish', 'French', 'Portuguese', 'German', 'Turkish', 'Dutch', 'Ukrainian', 'Vietnamese', 'Indonesian', 'Japanese', 'Italian', 'Korean', 'Thai', 'Polish', 'Romanian', 'Greek', 'Czech', 'Finnish', 'Hindi', 'auto'] with "auto" being the default language boost.
    output_directory (str): The directory to save the audio to.

Returns:
    Text content with the path to the output file and name of the voice used.

list_voices

List all voices available.

Args:
    voice_type (str, optional): The type of voices to list. Values range ["all", "system", "voice_cloning"], with "all" being the default.
Returns:
    Text content with the list of voices.

voice_clone

Clone a voice using provided audio files. The new voice will be charged upon first use.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

 Args:
    voice_id (str): The id of the voice to use.
    file (str): The path to the audio file to clone or a URL to the audio file.
    text (str, optional): The text to use for the demo audio.
    is_url (bool, optional): Whether the file is a URL. Defaults to False.
    output_directory (str): The directory to save the demo audio to.
Returns:
    Text content with the voice id of the cloned voice.

play_audio

Play an audio file. Supports WAV and MP3 formats. Not supports video.

 Args:
    input_file_path (str): The path to the audio file to play.
    is_url (bool, optional): Whether the audio file is a URL.
Returns:
    Text content with the path to the audio file.

generate_video

Generate a video from a prompt.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

 Args:
    model (str, optional): The model to use. Values range ["T2V-01", "T2V-01-Director", "I2V-01", "I2V-01-Director", "I2V-01-live", "MiniMax-Hailuo-02"]. "Director" supports inserting instructions for camera movement control. "I2V" for image to video. "T2V" for text to video. "MiniMax-Hailuo-02" is the latest model with best effect, ultra-clear quality and precise response.
    prompt (str): The prompt to generate the video from. When use Director model, the prompt supports 15 Camera Movement Instructions (Enumerated Values)
        -Truck: [Truck left], [Truck right]
        -Pan: [Pan left], [Pan right]
        -Push: [Push in], [Pull out]
        -Pedestal: [Pedestal up], [Pedestal down]
        -Tilt: [Tilt up], [Tilt down]
        -Zoom: [Zoom in], [Zoom out]
        -Shake: [Shake]
        -Follow: [Tracking shot]
        -Static: [Static shot]
    first_frame_image (str): The first frame image. The model must be "I2V" Series.
    duration (int, optional): The duration of the video. The model must be "MiniMax-Hailuo-02". Values can be 6 and 10.
    resolution (str, optional): The resolution of the video. The model must be "MiniMax-Hailuo-02". Values range ["768P", "1080P"]
    output_directory (str): The directory to save the video to.
    async_mode (bool, optional): Whether to use async mode. Defaults to False. If True, the video generation task will be submitted asynchronously and the response will return a task_id. Should use `query_video_generation` tool to check the status of the task and get the result.
Returns:
    Text content with the path to the output video file.

query_video_generation

Query the status of a video generation task.

Args:
    task_id (str): The task ID to query. Should be the task_id returned by `generate_video` tool if `async_mode` is True.
    output_directory (str): The directory to save the video to.
Returns:
    Text content with the status of the task.

text_to_image

Generate a image from a prompt.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

 Args:
    model (str, optional): The model to use. Values range ["image-01"], with "image-01" being the default.
    prompt (str): The prompt to generate the image from.
    aspect_ratio (str, optional): The aspect ratio of the image. Values range ["1:1", "16:9","4:3", "3:2", "2:3", "3:4", "9:16", "21:9"], with "1:1" being the default.
    n (int, optional): The number of images to generate. Values range [1, 9], with 1 being the default.
    prompt_optimizer (bool, optional): Whether to optimize the prompt. Values range [True, False], with True being the default.
    output_directory (str): The directory to save the image to.
Returns:
    Text content with the path to the output image file.

music_generation

Create a music generation task using AI models. Generate music from prompt and lyrics.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

Args:
    prompt (str): Music creation inspiration describing style, mood, scene, etc.
        Example: "Pop music, sad, suitable for rainy nights". Character range: [10, 300]
    lyrics (str): Song lyrics for music generation.
        Use newline (\n) to separate each line of lyrics. Supports lyric structure tags [Intro][Verse][Chorus][Bridge][Outro] 
        to enhance musicality. Character range: [10, 600] (each Chinese character, punctuation, and letter counts as 1 character)
    stream (bool, optional): Whether to enable streaming mode. Defaults to False
    sample_rate (int, optional): Sample rate of generated music. Values: [16000, 24000, 32000, 44100]
    bitrate (int, optional): Bitrate of generated music. Values: [32000, 64000, 128000, 256000]
    format (str, optional): Format of generated music. Values: ["mp3", "wav", "pcm"]. Defaults to "mp3"
    output_directory (str, optional): Directory to save the generated music file
    
Note: Currently supports generating music up to 1 minute in length.

Returns:
    Text content with the path to the generated music file or generation status.

voice_design

Generate a voice based on description prompts.

COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user.

 Args:
    prompt (str): The prompt to generate the voice from.
    preview_text (str): The text to preview the voice.
    voice_id (str, optional): The id of the voice to use. For example, "male-qn-qingse"/"audiobook_female_1"/"cute_boy"/"Charming_Lady"...
    output_directory (str, optional): The directory to save the voice to.
Returns:
    Text content with the path to the output voice file.

Prompts

Interactive templates invoked by user choice

No prompts

Resources

Contextual data attached and managed by the client

No resources

Author

MiniMax-AI

Claim Ownership

Verify you have write access to the repository

Repository

https://github.com/MiniMax-AI/MiniMax-MCP

Homepage

https://www.minimax.io/platform

GitHub Stars

1,153

License

MIT License

Last publish date

2025-04-10

Last update date

2026-01-06

Server configs

in claude desktop

{
  "mcpServers": {
    "MiniMax": {
      "command": "uvx",
      "args": [
        "minimax-mcp",
        "-y"
      ],
      "env": {
        "MINIMAX_API_KEY": "insert-your-api-key-here",
        "MINIMAX_MCP_BASE_PATH": "local-output-dir-path, such as /User/xxx/Desktop",
        "MINIMAX_API_HOST": "api host, https://api.minimax.io | https://api.minimaxi.com",
        "MINIMAX_API_RESOURCE_MODE": "optional, [url|local], url is default, audio/image/video are downloaded locally or provided in URL format"
      }
    }
  }
}

in cursor

Go to `Cursor -> Preferences -> Cursor Settings -> MCP -> Add new global MCP Server` to add above config.

Top Comments

Related MCP Servers

View all ai_chatbot servers →

google-docs-mcp

4.8

by a-bonus

The Ultimate Google Docs MCP Server connects Claude Desktop or other MCP clients to Google Docs, enabling advanced document manipulation through the Model Context Protocol (MCP) and the fastmcp library.

ai_chatbot

biomcp

4.7

by genomoncology

BioMCP is an open-source toolkit designed to enhance AI assistants with specialized biomedical knowledge by connecting them to authoritative biomedical data sources.

research_and_data

elevenlabs-mcp

4.6

by elevenlabs

Official ElevenLabs Model Context Protocol (MCP) server for interaction with Text to Speech and audio processing APIs.

ai_chatbot

mindsdb

4.6

by mindsdb

MindsDB is an open-source server that enables seamless interaction with large-scale federated data using the Model Context Protocol (MCP).

databases

tavily-mcp

4.6

by tavily-ai

The Tavily MCP server is a Model Context Protocol server that integrates with AI systems to provide real-time web search and data extraction capabilities.

browser_automation

mcp-notion-server

4.5

by suekou

MCP Server for the Notion API, enabling LLM to interact with Notion workspaces. Additionally, it employs Markdown conversion to reduce context size when communicating with LLMs, optimizing token usage and making interactions more efficient.

databases

mcp-hfspace

4.5

by evalstate

mcp-hfspace MCP Server connects to Hugging Face Spaces with minimal setup, providing Image Generation capabilities to Claude Desktop.

ai_chatbot

mcp-trends-hub

4.4

by baranwang

Trends Hub is a one-stop aggregation service for trending topics across the web, based on the Model Context Protocol (MCP).

ai_chatbot

pg-mcp-server

4.4

by stuzero

unknown

databases

HowToCook-mcp

4.3

by worryzyy

HowToCook-MCP Server is a Model Context Protocol server that transforms AI assistants into personal chefs, helping users plan meals and recommend recipes.

ai_chatbot

runno MCP

4.3

by taybenlor

`@runno/mcp` is a Model Context Protocol server that provides a secure code execution environment for AI assistants.

ai_chatbot

jinni

4.2

by smat-dev

Jinni is a tool designed to efficiently provide Large Language Models (LLMs) with the context of your projects by consolidating relevant project files.

developer_tools

mcp-server-gemini

4.2

by aliargun

Gemini MCP Server is a Model Context Protocol server implementation that allows Claude Desktop to interact with Google's Gemini AI models.

ai_chatbot

think-mcp-server

4.1

by PhillipRt

The Think Tool MCP Server is an official implementation of Anthropic's 'think' tool, designed to enhance Claude's reasoning capabilities by providing a structured space for complex problem-solving.

ai_chatbot

python-mcp-server-client

4.1

by GobinFan

MCP Server is a server implementing the Model Context Protocol (MCP) to provide a standardized interface for AI models, connecting external data sources and tools like file systems, databases, or APIs.

ai_chatbot

rails-mcp-server

4.1

by maquina-app

A Ruby implementation of a Model Context Protocol (MCP) server for Rails projects, allowing LLMs to interact with Rails projects.

developer_tools

ai-agent-marketplace-index-mcp

4.1

by AI-Agent-Hub

MCP Server for AI Agent Marketplace Index from DeepNLP, allowing AI assistants to search available AI agents by keywords or categories.

browser_automation

backlog-mcp-server

4.1

by nulab

A Model Context Protocol (MCP) server for interacting with the Backlog API, providing tools for managing projects, issues, wiki pages, and more through AI agents.

developer_tools

rapidocr-mcp

4.1

by z4none

RapidOCR MCP Server is a Model Context Protocol server that provides an easy-to-use OCR interface.

ai_chatbot

mcp-perplexity

4.1

by daniel-lxs

The Perplexity MCP Server provides a Python-based interface to the Perplexity API, offering tools for querying responses, maintaining chat history, and managing conversations.

communication

vertex-ai-mcp-server

4.0

by shariqriazz

This project implements a Model Context Protocol (MCP) server that provides a comprehensive suite of tools for interacting with Google Cloud's Vertex AI Gemini models, focusing on coding assistance and general query answering.

ai_chatbot

mem0-mcp

4.0

by pinkpixel-dev

A Model Context Protocol (MCP) server that integrates with Mem0.ai to provide persistent memory capabilities for LLMs.

ai_chatbot

google-search

4.0

by web-agent-master

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches and extract results.

browser_automation

drawio-mcp-server

4.0

by lgazo

The Draw.io MCP server is a Model Context Protocol implementation that integrates Draw.io's diagramming capabilities with AI agentic systems.

ai_chatbot

gateway

4.0

by centralmind

CentralMind Gateway is a tool designed to expose databases to AI agents via MCP or OpenAPI protocols, providing secure, LLM-optimized APIs.

databases

mcp-tavily

3.9

by RamXX

Tavily MCP Server is a Model Context Protocol server that provides AI-powered web search capabilities using Tavily's search API.

research_and_data

magic-mcp

3.8

by 21st-dev

Magic Component Platform (MCP) is a powerful AI-driven tool that helps developers create beautiful, modern UI components instantly through natural language descriptions.

developer_tools