BigUncle/Fast-Whisper-MCP-Server

3.4

If you are the rightful owner of Fast-Whisper-MCP-Server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to dayong@mcphub.com.

A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.

Tools

Resources

Prompts

Whisper Speech Recognition MCP Server

A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.

Features

Integrated with Faster Whisper for efficient speech recognition
Batch processing acceleration for improved transcription speed
Automatic CUDA acceleration (if available)
Support for multiple model sizes (tiny to large-v3)
Output formats include VTT subtitles, SRT, and JSON
Support for batch transcription of audio files in a folder
Model instance caching to avoid repeated loading
Dynamic batch size adjustment based on GPU memory

Installation

Dependencies

Python 3.10+
faster-whisper>=0.9.0
torch==2.6.0+cu126
torchaudio==2.6.0+cu126
mcp[cli]>=1.2.0

Installation Steps

Clone or download this repository
Create and activate a virtual environment (recommended)
Install dependencies:

pip install -r requirements.txt

PyTorch Installation Guide

Install the appropriate version of PyTorch based on your CUDA version:

CUDA 12.6:

pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu126

CUDA 12.1:

pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu121

CPU version:

pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cpu

You can check your CUDA version with nvcc --version or nvidia-smi.

Usage

Starting the Server

On Windows, simply run start_server.bat.

On other platforms, run:

python whisper_server.py

Configuring Claude Desktop

Open the Claude Desktop configuration file:
- Windows: %APPDATA%\Claude\claude_desktop_config.json
- macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Add the Whisper server configuration:

{
  "mcpServers": {
    "whisper": {
      "command": "python",
      "args": ["D:/path/to/whisper_server.py"],
      "env": {}
    }
  }
}

Restart Claude Desktop

Available Tools

The server provides the following tools:

get_model_info - Get information about available Whisper models
transcribe - Transcribe a single audio file
batch_transcribe - Batch transcribe audio files in a folder

Performance Optimization Tips

Using CUDA acceleration significantly improves transcription speed
Batch processing mode is more efficient for large numbers of short audio files
Batch size is automatically adjusted based on GPU memory size
Using VAD (Voice Activity Detection) filtering improves accuracy for long audio
Specifying the correct language can improve transcription quality

Local Testing Methods

Use MCP Inspector for quick testing:

mcp dev whisper_server.py

Use Claude Desktop for integration testing
Use command line direct invocation (requires mcp[cli]):

mcp run whisper_server.py

Error Handling

The server implements the following error handling mechanisms:

Audio file existence check
Model loading failure handling
Transcription process exception catching
GPU memory management
Batch processing parameter adaptive adjustment

Project Structure

whisper_server.py: Main server code
model_manager.py: Whisper model loading and caching
audio_processor.py: Audio file validation and preprocessing
formatters.py: Output formatting (VTT, SRT, JSON)
transcriber.py: Core transcription logic
start_server.bat: Windows startup script

License

MIT

Acknowledgements

This project was developed with the assistance of these amazing AI tools and models:

GitHub Copilot - AI pair programmer
Trae - Agentic AI coding assistant
Cline - AI-powered terminal
DeepSeek - Advanced AI model
Claude-3.7-Sonnet - Anthropic's powerful AI assistant
Gemini-2.0-Flash - Google's multimodal AI model
VS Code - Powerful code editor
Whisper - OpenAI's speech recognition model
Faster Whisper - Optimized Whisper implementation

Special thanks to these incredible tools and the teams behind them.

Related MCP Servers

View all ai_chatbot servers →

mindsdb

4.6

by mindsdb

MindsDB is an open-source server that enables seamless interaction with large-scale federated data using the Model Context Protocol (MCP).

databases

repomix

4.6

by yamadashy

Repomix is a tool that packs your codebase into AI-friendly formats, making it easier to use with AI tools like LLMs.

developer_tools

Model Context Protocol

4.5

by pollinations

A Model Context Protocol (MCP) server for the Pollinations APIs that enables AI assistants like Claude to generate images, text, and audio directly.

ai_chatbot

mcp-trends-hub

4.4

by baranwang

Trends Hub is a one-stop aggregation service for trending topics across the web, based on the Model Context Protocol (MCP).

ai_chatbot

whois-mcp

4.3

by bharathvaj-ganesan

The Whois MCP server allows AI agents to perform WHOIS lookups to retrieve domain details.

research_and_data

MiniMax-MCP

4.3

by MiniMax-AI

Official MiniMax Model Context Protocol (MCP) server for interaction with Text to Speech and video/image generation APIs.

ai_chatbot

jinni

4.2

by smat-dev

Jinni is a tool designed to efficiently provide Large Language Models (LLMs) with the context of your projects by consolidating relevant project files.

developer_tools

perplexity-mcp

4.2

by jsonallen

A Model Context Protocol (MCP) server that provides web search functionality using Perplexity AI's API, compatible with the Anthropic Claude desktop client.

research_and_data

mcp-server-gemini

4.2

by aliargun

Gemini MCP Server is a Model Context Protocol server implementation that allows Claude Desktop to interact with Google's Gemini AI models.

ai_chatbot

python-mcp-server-client

4.2

by GobinFan

MCP Server is a server implementing the Model Context Protocol (MCP) to provide a standardized interface for AI models, connecting external data sources and tools like file systems, databases, or APIs.

ai_chatbot

mcp-server-example

4.2

by alejandro-ao

This repository contains an implementation of a Model Context Protocol (MCP) server for educational purposes, demonstrating how to build a functional MCP server that can integrate with various LLM clients.

developer_tools

mcp-think-tool

4.1

by DannyMac180

A Model Context Protocol (MCP) server implementing the 'think' tool for improving Claude's complex reasoning capabilities.

ai_chatbot

think-mcp-server

4.1

by PhillipRt

The Think Tool MCP Server is an official implementation of Anthropic's 'think' tool, designed to enhance Claude's reasoning capabilities by providing a structured space for complex problem-solving.

ai_chatbot

bazi-mcp

4.1

by cantian-ai

Unlock precise Bazi insights with the Bazi MCP, the first AI-powered Bazi calculator.

ai_chatbot

rails-mcp-server

4.1

by maquina-app

A Ruby implementation of a Model Context Protocol (MCP) server for Rails projects, allowing LLMs to interact with Rails projects.

developer_tools

ai-agent-marketplace-index-mcp

4.1

by AI-Agent-Hub

MCP Server for AI Agent Marketplace Index from DeepNLP, allowing AI assistants to search available AI agents by keywords or categories.

browser_automation

backlog-mcp-server

4.1

by nulab

A Model Context Protocol (MCP) server for interacting with the Backlog API, providing tools for managing projects, issues, wiki pages, and more through AI agents.

developer_tools

square-mcp-server

4.1

by square

The Square Model Context Protocol Server (Beta) allows AI assistants to interact with Square's connect API using the Model Context Protocol standard.

cloud_platforms

vertex-ai-mcp-server

4.0

by shariqriazz

This project implements a Model Context Protocol (MCP) server that provides a comprehensive suite of tools for interacting with Google Cloud's Vertex AI Gemini models, focusing on coding assistance and general query answering.

ai_chatbot

github-mcp-server

4.0

by github

The GitHub MCP Server is a Model Context Protocol server that integrates with GitHub APIs for automation and interaction.

developer_tools

mcp-server-openai

4.0

by pierrebrunelle

Query OpenAI models directly from Claude using MCP protocol.

ai_chatbot

google-search

4.0

by web-agent-master

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches and extract results.

browser_automation

chronulus-mcp

3.9

by ChronulusAI

MCP Server for Chronulus allows users to interact with Chronulus AI Forecasting & Prediction Agents using Claude.

ai_chatbot

NASA-MCP-server

3.9

by ProgramComputer

A Model Context Protocol (MCP) server for NASA APIs, providing a standardized interface for AI models to interact with NASA's vast array of data sources.

research_and_data

hyper-mcp

3.9

by tuananh

hyper-mcp is a fast, secure MCP server that extends its capabilities through WebAssembly plugins.

ai_chatbot

magic-mcp

3.8

by 21st-dev

Magic Component Platform (MCP) is a powerful AI-driven tool that helps developers create beautiful, modern UI components instantly through natural language descriptions.

developer_tools

unity-mcp

3.8

by CoplayDev

Unity MCP is a bridge that allows AI assistants to interact with the Unity Editor via a local Model Context Protocol (MCP) Client.

ai_chatbot