brucehe3/video-sum-mcp
If you are the rightful owner of video-sum-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
A Model Context Protocol (MCP) server that extracts content from multiple video platforms and generates intelligent knowledge graphs.
Video Content Summarization MCP Server
A Model Context Protocol (MCP) server that extracts content from multiple video platforms and generates intelligent knowledge graphs.
Features
š Multi-Platform Support
- Douyin (TikTok China) - Short video content extraction
- Bilibili - Video and live streaming content
- Xiaohongshu (Little Red Book) - Social media posts with OCR support
- Zhihu - Q&A platform content
⨠Advanced Capabilities
- OCR Text Recognition - Extract text from images using PaddleOCR
- Knowledge Graph Generation - Intelligent content structuring
- Chinese Content Optimization - Specialized processing for Chinese text
- Context-Aware Extraction - Smart content understanding and quality control
Installation
Prerequisites
- Python 3.8 or higher
- Anaconda (recommended for dependency management)
Setup
- Clone the repository:
git clone https://github.com/fakad/video-sum-mcp.git
cd video-sum-mcp
- Create and activate conda environment:
conda create -n vsc python=3.8
conda activate vsc
- Install dependencies:
pip install -r requirements.txt
Configuration
For Claude Desktop
Add this configuration to your Claude Desktop config file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"video-sum-mcp": {
"command": "python",
"args": ["/path/to/video-sum-mcp/main.py"],
"cwd": "/path/to/video-sum-mcp",
"env": {
"CONDA_DEFAULT_ENV": "vsc"
}
}
}
}
For Other MCP Clients
The server can be started directly:
python main.py
Usage
Basic Video Processing
# Example: Process a Bilibili video
result = process_video(
url="https://www.bilibili.com/video/BV1234567890",
output_format="markdown"
)
Supported URL Formats
- Douyin:
https://v.douyin.com/...
or full URLs - Bilibili:
https://www.bilibili.com/video/...
- Xiaohongshu:
https://www.xiaohongshu.com/discovery/item/...
- Zhihu:
https://www.zhihu.com/question/...
Context-Enhanced Processing
For platforms with anti-crawling measures, you can provide context:
result = process_video(
url="https://...",
context_text="Additional context information..."
)
Features in Detail
OCR Integration
- Automatic image text extraction from Xiaohongshu posts
- PaddleOCR for accurate Chinese character recognition
- Batch processing for multiple images
Knowledge Graph Generation
- Structured content analysis
- Intelligent relationship mapping
- Quality control and validation
Anti-Crawling Strategies
- Smart fallback mechanisms
- Context-based extraction
- User guidance for optimal results
Development
Project Structure
video-sum-mcp/
āāā core/ # Core functionality modules
ā āāā extractors/ # Platform-specific extractors
ā āāā processors/ # Content processing logic
ā āāā knowledge_graph/ # Knowledge graph generation
ā āāā managers/ # Resource management
āāā scripts/ # MCP server implementation
āāā main.py # Main entry point
āāā requirements.txt # Python dependencies
āāā pyproject.toml # Project configuration
Running Tests
python -m pytest
Dependencies
Key dependencies include:
bilibili-api-python
- Bilibili API integrationyt-dlp
- Video downloading capabilitiesPaddleOCR
- OCR text recognitionbeautifulsoup4
- Web scrapingrequests
- HTTP requests
See requirements.txt
for complete list.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature
) - Commit your changes (
git commit -m 'Add some amazing feature'
) - Push to the branch (
git push origin feature/amazing-feature
) - Open a Pull Request
License
This project is licensed under the MIT License - see the file for details.
Acknowledgments
- Built using the Model Context Protocol
- OCR powered by PaddleOCR
- Platform integrations using various open-source APIs