rag-mcp-server
If you are the rightful owner of rag-mcp-server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
RAG MCP Server is a Retrieval-Augmented Generation server using AWS Lambda, OpenSearch Serverless, and OpenAI for embeddings and generation.
The RAG MCP Server is designed to facilitate retrieval-augmented generation using a serverless architecture on AWS. It leverages AWS Lambda for compute, API Gateway for HTTP requests, OpenSearch Serverless for vector storage and search, OpenAI for embeddings and generation, and AWS S3 for document storage. The infrastructure is managed using AWS CDK with Python, ensuring a scalable and efficient deployment. The server is designed to handle MCP calls, process requests, and interact with various AWS services to provide a seamless experience for retrieval and generation tasks.
Features
- Serverless Architecture: Utilizes AWS Lambda and API Gateway for scalable and cost-effective deployment.
- OpenSearch Integration: Employs OpenSearch Serverless for efficient vector storage and k-NN search capabilities.
- OpenAI Embeddings: Integrates OpenAI for generating embeddings and facilitating advanced text generation.
- AWS CDK Management: Infrastructure is managed using AWS Cloud Development Kit with Python for streamlined deployment.
- Secure Secrets Management: Uses AWS Secrets Manager to securely store API keys and other sensitive information.