MiniMax-MCP

4.3

MiniMax-MCP is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.

If you are the rightful owner of MiniMax-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

Official MiniMax Model Context Protocol (MCP) server for interaction with Text to Speech and video/image generation APIs.

text_to_audio

Convert text to audio with a given voice and save the output audio file to a given directory. Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. Voice id is optional, if not provided, the default voice will be used. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: text (str): The text to convert to speech. voice_id (str, optional): The id of the voice to use. For example, "male-qn-qingse"/"audiobook_female_1"/"cute_boy"/"Charming_Lady"... model (string, optional): The model to use. speed (float, optional): Speed of the generated audio. Controls the speed of the generated speech. Values range from 0.5 to 2.0, with 1.0 being the default speed. vol (float, optional): Volume of the generated audio. Controls the volume of the generated speech. Values range from 0 to 10, with 1 being the default volume. pitch (int, optional): Pitch of the generated audio. Controls the speed of the generated speech. Values range from -12 to 12, with 0 being the default speed. emotion (str, optional): Emotion of the generated audio. Controls the emotion of the generated speech. Values range ["happy", "sad", "angry", "fearful", "disgusted", "surprised", "neutral"], with "happy" being the default emotion. sample_rate (int, optional): Sample rate of the generated audio. Controls the sample rate of the generated speech. Values range [8000,16000,22050,24000,32000,44100] with 32000 being the default sample rate. bitrate (int, optional): Bitrate of the generated audio. Controls the bitrate of the generated speech. Values range [32000,64000,128000,256000] with 128000 being the default bitrate. channel (int, optional): Channel of the generated audio. Controls the channel of the generated speech. Values range [1, 2] with 1 being the default channel. format (str, optional): Format of the generated audio. Controls the format of the generated speech. Values range ["pcm", "mp3","flac"] with "mp3" being the default format. language_boost (str, optional): Language boost of the generated audio. Controls the language boost of the generated speech. Values range ['Chinese', 'Chinese,Yue', 'English', 'Arabic', 'Russian', 'Spanish', 'French', 'Portuguese', 'German', 'Turkish', 'Dutch', 'Ukrainian', 'Vietnamese', 'Indonesian', 'Japanese', 'Italian', 'Korean', 'Thai', 'Polish', 'Romanian', 'Greek', 'Czech', 'Finnish', 'Hindi', 'auto'] with "auto" being the default language boost. output_directory (str): The directory to save the audio to. Returns: Text content with the path to the output file and name of the voice used.

Try it

Result:

list_voices

List all voices available. Args: voice_type (str, optional): The type of voices to list. Values range ["all", "system", "voice_cloning"], with "all" being the default. Returns: Text content with the list of voices.

Try it

Result:

voice_clone

Clone a voice using provided audio files. The new voice will be charged upon first use. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: voice_id (str): The id of the voice to use. file (str): The path to the audio file to clone or a URL to the audio file. text (str, optional): The text to use for the demo audio. is_url (bool, optional): Whether the file is a URL. Defaults to False. output_directory (str): The directory to save the demo audio to. Returns: Text content with the voice id of the cloned voice.

Try it

Result:

play_audio

Play an audio file. Supports WAV and MP3 formats. Not supports video. Args: input_file_path (str): The path to the audio file to play. is_url (bool, optional): Whether the audio file is a URL. Returns: Text content with the path to the audio file.

Try it

Result:

generate_video

Generate a video from a prompt. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: model (str, optional): The model to use. Values range ["T2V-01", "T2V-01-Director", "I2V-01", "I2V-01-Director", "I2V-01-live", "MiniMax-Hailuo-02"]. "Director" supports inserting instructions for camera movement control. "I2V" for image to video. "T2V" for text to video. "MiniMax-Hailuo-02" is the latest model with best effect, ultra-clear quality and precise response. prompt (str): The prompt to generate the video from. When use Director model, the prompt supports 15 Camera Movement Instructions (Enumerated Values) -Truck: [Truck left], [Truck right] -Pan: [Pan left], [Pan right] -Push: [Push in], [Pull out] -Pedestal: [Pedestal up], [Pedestal down] -Tilt: [Tilt up], [Tilt down] -Zoom: [Zoom in], [Zoom out] -Shake: [Shake] -Follow: [Tracking shot] -Static: [Static shot] first_frame_image (str): The first frame image. The model must be "I2V" Series. duration (int, optional): The duration of the video. The model must be "MiniMax-Hailuo-02". Values can be 6 and 10. resolution (str, optional): The resolution of the video. The model must be "MiniMax-Hailuo-02". Values range ["768P", "1080P"] output_directory (str): The directory to save the video to. async_mode (bool, optional): Whether to use async mode. Defaults to False. If True, the video generation task will be submitted asynchronously and the response will return a task_id. Should use `query_video_generation` tool to check the status of the task and get the result. Returns: Text content with the path to the output video file.

Try it

Result:

query_video_generation

Query the status of a video generation task. Args: task_id (str): The task ID to query. Should be the task_id returned by `generate_video` tool if `async_mode` is True. output_directory (str): The directory to save the video to. Returns: Text content with the status of the task.

Try it

Result:

text_to_image

Generate a image from a prompt. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: model (str, optional): The model to use. Values range ["image-01"], with "image-01" being the default. prompt (str): The prompt to generate the image from. aspect_ratio (str, optional): The aspect ratio of the image. Values range ["1:1", "16:9","4:3", "3:2", "2:3", "3:4", "9:16", "21:9"], with "1:1" being the default. n (int, optional): The number of images to generate. Values range [1, 9], with 1 being the default. prompt_optimizer (bool, optional): Whether to optimize the prompt. Values range [True, False], with True being the default. output_directory (str): The directory to save the image to. Returns: Text content with the path to the output image file.

Try it

Result:

music_generation

Create a music generation task using AI models. Generate music from prompt and lyrics. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: prompt (str): Music creation inspiration describing style, mood, scene, etc. Example: "Pop music, sad, suitable for rainy nights". Character range: [10, 300] lyrics (str): Song lyrics for music generation. Use newline (\n) to separate each line of lyrics. Supports lyric structure tags [Intro][Verse][Chorus][Bridge][Outro] to enhance musicality. Character range: [10, 600] (each Chinese character, punctuation, and letter counts as 1 character) stream (bool, optional): Whether to enable streaming mode. Defaults to False sample_rate (int, optional): Sample rate of generated music. Values: [16000, 24000, 32000, 44100] bitrate (int, optional): Bitrate of generated music. Values: [32000, 64000, 128000, 256000] format (str, optional): Format of generated music. Values: ["mp3", "wav", "pcm"]. Defaults to "mp3" output_directory (str, optional): Directory to save the generated music file Note: Currently supports generating music up to 1 minute in length. Returns: Text content with the path to the generated music file or generation status.

Try it

Result:

voice_design

Generate a voice based on description prompts. COST WARNING: This tool makes an API call to Minimax which may incur costs. Only use when explicitly requested by the user. Args: prompt (str): The prompt to generate the voice from. preview_text (str): The text to preview the voice. voice_id (str, optional): The id of the voice to use. For example, "male-qn-qingse"/"audiobook_female_1"/"cute_boy"/"Charming_Lady"... output_directory (str, optional): The directory to save the voice to. Returns: Text content with the path to the output voice file.

Try it

Result:

Related MCP Servers

View all ai_chatbot servers →

claude-task-master

4.8

by eyaltoledano

Task Master is a task management system for AI-driven development with Claude, designed to work seamlessly with Cursor AI.

ai_chatbot

biomcp

4.6

by genomoncology

BioMCP is an open-source toolkit designed to enhance AI assistants with specialized biomedical knowledge by connecting them to authoritative biomedical data sources.

research_and_data

repomix

4.5

by yamadashy

Repomix is a tool that packs your codebase into AI-friendly formats, making it easier to use with AI tools like LLMs.

developer_tools

pg-mcp-server

4.4

by stuzero

unknown

databases

mcp-notion-server

4.3

by suekou

MCP Server for the Notion API, enabling LLM to interact with Notion workspaces. Additionally, it employs Markdown conversion to reduce context size when communicating with LLMs, optimizing token usage and making interactions more efficient.

databases

Sequential Thinking

4.3

by modelcontextprotocol

An MCP server implementation that provides a tool for dynamic and reflective problem-solving through a structured thinking process.

research_and_data

ai-agent-marketplace-index-mcp

4.1

by AI-Agent-Hub

MCP Server for AI Agent Marketplace Index from DeepNLP, allowing AI assistants to search available AI agents by keywords or categories.

browser_automation

github-mcp-server

4.1

by github

The GitHub MCP Server is a Model Context Protocol server that integrates with GitHub APIs for automation and interaction.

developer_tools

gateway

4.0

by centralmind

CentralMind Gateway is a tool designed to expose databases to AI agents via MCP or OpenAPI protocols, providing secure, LLM-optimized APIs.

databases

unity-mcp

3.8

by justinpbarnett

Unity MCP connects your Unity Editor to LLMs using the Model Context Protocol, enabling AI assistants to interact with Unity for asset management, scene control, script editing, and task automation.

ai_chatbot

whatsapp-mcp

3.8

by lharries

This is a Model Context Protocol (MCP) server for WhatsApp, allowing users to interact with their WhatsApp messages and contacts through an LLM like Claude.

communication

modelcontextprotocol

3.8

by ppl-ai

The Perplexity Ask MCP Server integrates the Sonar API to provide Claude with real-time, web-wide research capabilities.

research_and_data

mcp-crawl4ai-rag

3.8

by coleam00

Crawl4AI RAG MCP Server is a powerful implementation of the Model Context Protocol (MCP) integrated with Crawl4AI and Supabase, providing AI agents and AI coding assistants with advanced web crawling and RAG capabilities.

browser_automation

mcp-server-whisper

3.8

by arcaputo3

MCP Server Whisper is a Model Context Protocol server designed for advanced audio transcription and processing using OpenAI's Whisper and GPT-4o models.

ai_chatbot

tabby-mcp-server

3.8

by thuanpham582002

Tabby-MCP-Server is a powerful plugin that implements a Model Context Protocol (MCP) server, enabling AI-powered terminal control and automation.

ai_chatbot

azure-mcp

3.8

by Azure

The Azure MCP Server implements the MCP specification to create a seamless connection between AI agents and Azure services.

cloud_platforms

unreal-mcp

3.8

by chongdashu

This project enables AI assistant clients like Cursor, Windsurf, and Claude Desktop to control Unreal Engine through natural language using the Model Context Protocol (MCP).

developer_tools

anyquery

3.7

by julien040

Anyquery is a versatile SQL query engine that allows users to run SQL queries on various data sources, including files, databases, and applications, with support for LLMs and MySQL server functionality.

databases

UnityMCP

3.7

by Arodoid

UnityMCP is a Unity Editor plugin that integrates the Model Context Protocol (MCP) for seamless interaction between Unity and AI assistants.

developer_tools

obsidian-mcp

3.7

by StevenStavrakis

Obsidian MCP Server is a Model Context Protocol server that allows AI assistants to interact with Obsidian vaults, providing tools for managing notes and tags.

ai_chatbot

enrichmcp

3.7

by featureform

MCPEngine is a production-grade, HTTP-first implementation of the Model Context Protocol (MCP), providing a secure, scalable, and modern framework for exposing data, tools, and prompts to Large Language Models (LLMs) via MCP.

developer_tools

mcp-youtube

3.7

by anaisbetts

The YouTube MCP Server utilizes `yt-dlp` to download subtitles from YouTube videos and connects to Claude.ai using the Model Context Protocol.

ai_chatbot

spotify-mcp

3.7

by varunneal

MCP project to connect Claude with Spotify using spotipy-dev's API.

entertainment_and_media

actors-mcp-server

3.7

by apify

Implementation of an MCP server for all Apify Actors, enabling interaction with one or more Apify Actors defined in the MCP Server configuration.

research_and_data

jinni

3.7

by smat-dev

Jinni is a tool designed to efficiently provide Large Language Models (LLMs) with the context of your projects by consolidating relevant project files.

developer_tools

mcp-server-mas-sequential-thinking

3.6

by FradSer

The Sequential Thinking Multi-Agent System (MAS) is an advanced problem-solving tool using a Multi-Agent System architecture built with the Agno framework, designed to enhance analysis and problem decomposition through coordinated, specialized agents.

research_and_data

google-search

3.6

by web-agent-master

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches and extract results.

browser_automation