audio-transcriber by Knuckles-Team - MCP Server

Audio-Transcriber

Version: 0.5.52

Transcribe your .wav .mp4 .mp3 .flac files to text or record your own audio!

This repository is actively maintained - Contributions are welcome!

Contribution Opportunities:

Support new models

Usage:

Short Flag	Long Flag	Description
-h	--help	See Usage
-b	--bitrate	Bitrate to use during recording
-c	--channels	Number of channels to use during recording
-d	--directory	Directory to save recording
-e	--export	Export txt, srt, and vtt files
-f	--file	File to transcribe
-l	--language	Language to transcribe
-m	--model	Model to use: <tiny, base, small, medium, large>
-n	--name	Name of recording
-r	--record	Specify number of seconds to record to record from microphone

Example:

audio-transcriber --file '~/Downloads/Federal_Reserve.mp4' --model 'large'
audio-transcriber --record 60 --directory '~/Downloads/' --name 'my_recording.wav' --model 'tiny'

Model Information:

Courtesy of and Credits to OpenAI: Whisper.ai

Size	Parameters	English-only model	Multilingual model	Required VRAM	Relative speed
tiny	39 M	`tiny.en`	`tiny`	~1 GB	~32x
base	74 M	`base.en`	`base`	~1 GB	~16x
small	244 M	`small.en`	`small`	~2 GB	~6x
medium	769 M	`medium.en`	`medium`	~5 GB	~2x
large	1550 M	N/A	`large`	~10 GB	1x

Installation Instructions:

Use with AI

Configure mcp.json

{
  "mcpServers": {
    "audio_transcriber": {
      "command": "uv",
      "args": [
        "run",
        "--with",
        "audio-transcriber",
        "audio-transcriber-mcp"
      ],
      "env": {
        "WHISPER_MODEL": "medium",            // Optional
        "TRANSCRIBE_DIRECTORY": "~/Downloads" // Optional
      },
      "timeout": 200000
    }
  }
}

Deploy MCP Server as a container

docker pull knucklessg1/audio-transcriber:latest

Modify the compose.yml

services:
  audio-transcriber:
    image: knucklessg1/audio-transcriber:latest
    environment:
      - HOST=0.0.0.0
      - PORT=8021
    ports:
      - 8021:8021

Install Python Package

python -m pip install audio-transcriber

uv pip install --upgrade audio-transcriber

Ubuntu Dependencies

sudo apt-get update
sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 ffmpeg gcc -y

Repository Owners: