lashineLaw/alloy-automation-RAG
If you are the rightful owner of alloy-automation-RAG and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to dayong@mcphub.com.
This document provides a comprehensive guide to configuring and using the Zilliz Cloud MCP server with Claude Code for vector database operations.
Zilliz Cloud MCP Server Configuration
This directory contains configuration for the Zilliz Cloud MCP (Model Context Protocol) server integration with Claude Code.
🎯 Overview
Zilliz Cloud is a fully managed vector database service (managed Milvus). This MCP server enables Claude Code to interact with your vector database using natural language commands.
Cluster: lastlora-vector
Region: GCP US West 1 (Serverless)
MCP Package: @zilliz/claude-context-mcp
🚀 Quick Start
Prerequisites
- Node.js installed
- Claude Code CLI or Claude Desktop
- Zilliz Cloud account with active cluster
Installation
The MCP server is already installed globally:
npm install -g @zilliz/claude-context-mcp
Configuration
Credentials are stored in .env.local (not committed to git):
# Load environment variables
source .env.local # Linux/Mac
# or
export $(cat .env.local | xargs) # Alternative
🔑 Credentials
All credentials are stored in .env.local file in this directory. Never commit this file to version control.
Required Credentials
- ZILLIZ_API_KEY: API authentication key
- ZILLIZ_TOKEN: Database access token
- ZILLIZ_ENDPOINT: Cluster endpoint URL
Optional Credentials
- ZILLIZ_USER: Database username
- ZILLIZ_PASSWORD: Database password
- ZILLIZ_ORG_ID: Organization identifier
- ZILLIZ_CLUSTER_ID: Cluster identifier
- ZILLIZ_CLUSTER_NAME: Human-readable cluster name
💻 Usage with Claude Code
MCP Server Tools
Once configured, you can use natural language commands like:
Collection Management
"List all collections in my Zilliz database"
"Create a collection called 'product_embeddings' with 1536 dimensions"
"Describe the 'documents' collection"
"Delete the 'old_data' collection"
Vector Operations
"Insert vectors with metadata into the 'embeddings' collection"
"Search for the top 5 most similar vectors to [query_vector]"
"Delete vectors where id in [1, 2, 3]"
Hybrid Search
"Search for vectors similar to [query] where category='real-estate' and price > 500000"
"Find documents similar to [text] published after 2024-01-01"
Cluster Information
"Show cluster statistics"
"Get storage usage information"
"List all indexes in the database"
🏗️ Cluster Details
Cluster Information
- Name: lastlora-vector
- ID: in05-520fac19203081e
- Type: Serverless (auto-scaling)
- Region: GCP US West 1
- Provider: Google Cloud Platform
Endpoint
https://in05-520fac19203081e.serverless.gcp-us-west1.cloud.zilliz.com
Features
- ✅ Serverless architecture (auto-scaling)
- ✅ Full text search (Milvus 2.5+)
- ✅ Hybrid search (vector + metadata)
- ✅ Multiple similarity metrics (L2, IP, COSINE)
- ✅ Dynamic index optimization
📚 Common Use Cases
1. Semantic Search
Store and search document embeddings for semantic similarity matching.
2. Image Similarity
Store image embeddings and find visually similar images.
3. Recommendation Systems
Store user/product embeddings for personalized recommendations.
4. Question Answering
Store knowledge base embeddings for RAG (Retrieval Augmented Generation).
5. Anomaly Detection
Store normal patterns and identify anomalies via vector distance.
🔧 Configuration Files
.env.local
Contains all sensitive credentials. EXCLUDED FROM GIT.
CLAUDE.md
Project-specific guidance for Claude Code when working with Zilliz operations.
.gitignore
Ensures sensitive files are not committed to version control.
🛡️ Security Best Practices
DO ✅
- Store credentials in
.env.local - Add
.env.localto.gitignore - Rotate API keys regularly
- Use environment variables in scripts
- Limit API key permissions to minimum required
DON'T ❌
- Commit
.env.localto git - Share credentials in plain text
- Hardcode credentials in source files
- Use production credentials in development
- Expose credentials in logs or error messages
📖 Documentation
Official Resources
- Zilliz Cloud Docs: https://docs.zilliz.com/
- MCP Server Docs: https://docs.zilliz.com/docs/zilliz-mcp-server
- Milvus Documentation: https://milvus.io/docs
- MCP Protocol: https://modelcontextprotocol.io/
GitHub Repositories
- Zilliz MCP Server: https://github.com/zilliztech/zilliz-mcp-server
- Milvus MCP Server: https://github.com/zilliztech/mcp-server-milvus
Tutorials
- Getting Started: https://zilliz.com/blog/introducing-zilliz-mcp-server
- MCP + Milvus Guide: https://milvus.io/docs/milvus_and_mcp.md
🧪 Testing the Connection
Quick Test
Load environment variables and test connection:
# Load credentials
export $(cat .env.local | grep -v '^#' | xargs)
# Test connection (via Claude Code MCP)
# In Claude Code, ask:
"List all collections in my Zilliz database"
Expected Response
You should see a list of collections in your lastlora-vector cluster.
🔍 Troubleshooting
Connection Issues
Problem: "Failed to connect to Zilliz Cloud"
Solutions:
- Verify
.env.localcredentials are correct - Check cluster is active in Zilliz Cloud console
- Verify endpoint URL is accessible
- Ensure API key has proper permissions
Authentication Issues
Problem: "Invalid API key or token"
Solutions:
- Regenerate API key in Zilliz Cloud console
- Update
.env.localwith new credentials - Reload environment variables
- Restart Claude Code if needed
MCP Server Issues
Problem: "MCP server not responding"
Solutions:
- Verify
@zilliz/claude-context-mcpis installed globally - Check Node.js version compatibility
- Restart Claude Code
- Check MCP server logs for errors
📊 Cluster Status
To check your cluster status:
- Visit https://cloud.zilliz.com/
- Navigate to your organization:
org-lseftrpmtsrqnczohrvsts - View cluster:
lastlora-vector - Monitor usage, storage, and performance
🎓 Example Workflows
Workflow 1: Create and Populate Collection
1. "Create a collection called 'property_embeddings' with 1536 dimensions using COSINE similarity"
2. "Insert vectors with property metadata (address, price, sqft) into property_embeddings"
3. "Search for properties similar to [embedding_vector]"
4. "Get statistics for the property_embeddings collection"
Workflow 2: Hybrid Search
1. "Search property_embeddings for vectors similar to [query] where price < 2000000 and city='San Francisco'"
2. "Find top 10 most similar properties with at least 3 bedrooms"
Workflow 3: Collection Management
1. "List all collections"
2. "Describe the 'documents' collection schema"
3. "Get row count for all collections"
4. "Drop the 'old_embeddings' collection"
🔗 Related Projects
If you're working with embeddings, consider:
- OpenAI Embeddings: text-embedding-3-small (1536 dims)
- Cohere Embeddings: embed-english-v3.0 (1024 dims)
- Voyage AI: voyage-2 (1024 dims)
- Custom Models: Any model that outputs vector embeddings
📝 Notes
- Serverless clusters auto-scale based on usage
- Free tier available for testing and development
- Production workloads may require capacity unit reservations
- Monitor usage to avoid unexpected costs
- Backup important collections regularly
Created: 2025-10-25 Last Updated: 2025-10-25 Cluster: lastlora-vector (in05-520fac19203081e) Status: ✅ Configured and ready for use