Doc-Ask-Engine-MCP

RishulGupta/Doc-Ask-Engine-MCP

3.1

If you are the rightful owner of Doc-Ask-Engine-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

Doc Ask Engine MCP is a customizable, agent-powered GenAI chatbot designed for querying internal documents and data, originally developed for Naukri.com.

πŸ—¨οΈDoc Ask Engine MCP

A customizable, agent-powered GenAI chatbot for querying your own documents and data β€” originally built for Naukri.com to automate HR policy queries.


πŸ” What is Doc Ask Engine MCP?

Doc Ask Engine MCP is an AI-powered question-answering engine that lets you "talk" to your internal documents. It uses Generative AI and agentic reasoning, hosted on an MCP (Modular Command Processing) server, to return intelligent answers to user queries based on custom data.

Originally developed during my internship at Naukri.com, it helped answer questions like:

  • πŸ•’ What are the office timings?
  • πŸ“ How many leaves can I take in a year?
  • πŸ’° What’s the reimbursement process?

Now open-sourced and fully customizable β€” plug in your own PDFs to build a Q&A engine for any domain (HR, legal, medical, etc.).


⚑ Features

βœ… GenAI + Agentic reasoning
πŸ“„ PDF-based ingestion
🧠 FAISS-powered similarity search
πŸ–₯️ API served via MCP Server
πŸ”„ Fully customizable with your data


πŸš€ Getting Started

🧾 Step 1: Clone the Repository

git clone https://github.com/RishulGupta/Doc-Ask-Engine-MCP.git
cd Doc-Ask-Engine-MCP

πŸ“ Step 2: Add Your PDFs

Create a folder and add your documents:

Doc-Ask-Engine-MCP/
└── my_data/
    β”œβ”€β”€ handbook.pdf
    └── code_of_conduct.pdf

βš™οΈ Step 3: Modify data_ingestion.py

Edit the file and change this line:

SOURCE_DIRECTORY = "my_data"

Then run:

python data_ingestion.py

This generates a FAISS index from your documents.


🧠 Step 4: Start the MCP Server

python mainwithmcp.py

You’ll see the server starting up.


πŸ—¨οΈ Step 5: Ask Questions

πŸ“‘ Endpoint: http://127.0.0.2:8000/mcp/ask_faiss

Send a request with:

{
  "ask_query": "What are the office working hours?"
}

You'll get a smart AI-generated response based on your uploaded documents.


🧩 How It Works

  1. πŸ“š Parse and chunk PDFs
  2. ✨ Embed text using GenAI
  3. 🧭 Store embeddings with FAISS
  4. πŸ”Ž Retrieve relevant chunks
  5. πŸ€– Agent synthesizes final answer
  6. 🌐 Answer served via MCP endpoint

πŸ‘¨β€πŸ’» Author

Rishul Gupta


πŸ“Œ Notes

  • πŸ”“ Built for Naukri, generalized for everyone