veritas-mcp

movibe-ai/veritas-mcp

3.1

If you are the rightful owner of veritas-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

The Multi-Platform Verification MCP Server is a robust solution for automated application verification across Android, iOS, and Web platforms, ensuring high reliability and comprehensive validation.

Tools
7
Resources
0
Prompts
0

🛡️ Veritas MCP - AgentGuard

Prevents AI agents from reporting false success by actually running tests, builds, and verification

Tests Coverage TypeScript

🎯 Problem Solved

AI agents frequently lie to users about work quality:

  • ❌ "Tests pass" → Tests never actually ran
  • ❌ "Feature works" → Code doesn't compile
  • ❌ "Build succeeds" → Build was never attempted
  • ❌ "No breaking changes" → System completely broken

🛡️ AgentGuard Solution

MANDATORY verification before any agent success claim:

  • Actually executes tests, builds, and validation
  • Evidence capture with screenshots and logs
  • Comprehensive reporting with failure analysis
  • Blocks false positive claims automatically

🚀 Quick Start

Installation

npm install @movibe-ai/veritas-mcp

Basic Usage

import { AgentGuard } from '@movibe-ai/veritas-mcp';

const guard = new AgentGuard();

// Verify agent claims before reporting to user
const result = await guard.verifyAgentClaim({
  claim: {
    type: 'feature_implementation',
    description: 'Added user authentication with JWT tokens',
    changedFiles: ['src/auth.ts', 'src/middleware.ts'],
    expectedBehavior: 'Users can login and access protected routes',
    testCommands: ['npm test', 'npm run test:integration'],
    buildCommands: ['npm run build']
  },
  evidenceLevel: 'comprehensive'
});

if (result.success && result.result.verified) {
  console.log('✅ Agent claim verified - safe to report to user');
  console.log(`Confidence: ${result.result.confidence}%`);
} else {
  console.log('🚨 Agent claim FAILED - do not report success!');
  console.log('Failures:', result.result?.failures);
}

🏗️ Architecture

Core Components

AgentGuard.verifyAgentClaim(claim)VerificationResult
├── CodeChangeVerifier     // Syntax, compilation, imports validation
├── TestExecutionEngine    // ACTUALLY runs tests (Jest/pytest/XCTest)
├── BuildSystemIntegrator  // ACTUALLY builds (npm/gradle/xcode)
└── VerificationOrchestrator // Coordinates all verification steps

Verification Process

  1. Code Verification: Syntax ✓ Compilation ✓ Imports ✓
  2. Test Execution: Unit ✓ Integration ✓ E2E ✓
  3. Build Verification: npm ✓ gradle ✓ xcode ✓
  4. Evidence Collection: Screenshots, logs, metrics
  5. Comprehensive Report: Pass/fail with detailed analysis

🎖️ Features

Multi-Platform Support

  • Node.js: npm, yarn, pnpm builds with Jest/Mocha tests
  • Android: Gradle builds with JUnit/Espresso tests
  • iOS: Xcode builds with XCTest
  • Web: Webpack/Vite builds with Playwright/Cypress E2E

Framework Support

  • Test Frameworks: Jest, pytest, XCTest, Mocha, JUnit
  • Build Systems: npm, yarn, pnpm, gradle, xcodebuild, maven
  • Code Analysis: TypeScript, JavaScript, Python, Java, Swift

Evidence Collection

  • Screenshots: Visual proof of functionality
  • Videos: E2E test recordings
  • Logs: Detailed execution logs
  • Performance: Metrics and timing data

📊 Verification Results

Success Response

{
  success: true,
  result: {
    verified: true,
    confidence: 95,
    codeVerification: { /* detailed results */ },
    testResults: [ /* all test executions */ ],
    buildResults: [ /* all build attempts */ ],
    evidence: { /* screenshots, logs, etc */ },
    failures: [] // Empty on success
  }
}

Failure Response

{
  success: true, // Process completed
  result: {
    verified: false, // Verification failed
    confidence: 25,
    failures: [
      {
        type: 'compilation',
        severity: 'critical', 
        message: 'TypeScript errors in auth.ts:45',
        details: 'Cannot find name express'
      },
      {
        type: 'test',
        severity: 'high',
        message: '3/10 tests failing',
        details: 'Login tests failed with 401 errors'
      }
    ]
  }
}

🧪 Testing

This project follows strict Test-Driven Development (TDD):

# Run all tests
npm test

# Run with coverage
npm run test:coverage

# Run specific test suite
npm test -- tests/unit/AgentGuard.test.ts

Test Statistics

  • 104 tests passing across 7 test suites
  • 91.66% code coverage (statements)
  • TDD methodology throughout

🔧 Development

Prerequisites

  • Node.js ≥18.0.0
  • TypeScript 5.3+

Setup

git clone https://github.com/movibe-ai/veritas-mcp.git
cd veritas-mcp
npm install
npm run build
npm test

Scripts

npm run build          # Compile TypeScript
npm run test           # Run all tests
npm run test:coverage  # Coverage report
npm run lint           # ESLint check
npm run clean          # Clean dist/

🎯 Use Cases

1. Claude Code Integration

Prevent Claude from claiming success without actual verification:

// Before reporting "Feature implemented successfully"
const verification = await agentGuard.verifyAgentClaim({
  claim: {
    type: 'feature_implementation',
    description: 'User dashboard with real-time data',
    changedFiles: ['src/dashboard.tsx', 'src/api.ts'],
    expectedBehavior: 'Dashboard shows live user metrics',
    testCommands: ['npm test', 'npm run test:e2e'],
    buildCommands: ['npm run build']
  }
});

// Only report success if verification passes
if (verification.result?.verified) {
  reportToUser('✅ Feature implemented and verified');
} else {
  reportToUser('🚨 Implementation incomplete - fixing issues...');
}

2. CI/CD Pipeline Integration

Ensure deployment readiness:

const result = await agentGuard.verifyAgentClaim({
  claim: {
    type: 'bug_fix',
    description: 'Fixed authentication timeout issue',
    changedFiles: ['src/auth/session.ts'],
    expectedBehavior: 'Sessions persist correctly',
    testCommands: ['npm run test:auth'],
    buildCommands: ['npm run build:production']
  },
  evidenceLevel: 'comprehensive'
});

// Deploy only if verification passes
if (result.result?.verified && result.result?.confidence > 90) {
  triggerDeployment();
}

3. Code Review Automation

Validate PR claims:

// Validate PR description claims
const verification = await agentGuard.verifyAgentClaim({
  claim: {
    type: 'refactor', 
    description: 'Optimized database queries',
    changedFiles: pullRequest.changedFiles,
    expectedBehavior: 'Same functionality, better performance',
    testCommands: ['npm test', 'npm run test:performance'],
    buildCommands: ['npm run build']
  }
});

// Auto-approve if verification passes
if (verification.result?.verified) {
  approvePullRequest(pullRequest.id);
}

📈 Benefits

For Development Teams

  • Increased Confidence: Trust AI agent outputs completely
  • Time Savings: Automated verification vs manual checking
  • Quality Assurance: Every claim is verified with evidence
  • Risk Reduction: No broken deployments from false claims

For CI/CD Pipelines

  • Deployment Safety: Only verified code reaches production
  • Automated Quality Gates: Built-in verification checkpoints
  • Evidence Trail: Complete audit log of all verifications
  • Failure Prevention: Catch issues before they impact users

for AI Safety

  • Prevents Hallucination: Agents can't claim false success
  • Evidence-Based: All claims backed by actual execution
  • Transparency: Clear reporting of what was verified
  • Accountability: Complete log of verification attempts

🤝 Contributing

We welcome contributions! Please follow TDD methodology:

  1. RED: Write failing tests first
  2. GREEN: Implement minimum code to pass
  3. REFACTOR: Improve while keeping tests green

See for details.


📄 License

MIT License - see file for details.


🙏 Acknowledgments


⚡ Stop AI agents from lying about work quality. Use AgentGuard.