DeepSpringAI_parquet_mcp_server

DeepSpringAI_parquet_mcp_server

3.1

If you are the rightful owner of DeepSpringAI_parquet_mcp_server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A powerful MCP server for manipulating and analyzing Parquet files, designed to work with Claude Desktop.

The Parquet MCP Server is a robust Model Control Protocol server that provides a suite of tools for handling Parquet files. It is particularly useful for data scientists and applications that require efficient data processing and analysis. The server integrates with Claude Desktop and offers functionalities such as text embedding generation, Parquet file analysis, and conversion to DuckDB and PostgreSQL formats. It also supports markdown processing, making it versatile for various data workflows. The server is designed to enhance data manipulation capabilities, especially for large datasets, and supports vector similarity search with PostgreSQL and pgvector.

Features

  • Text Embedding Generation: Converts text columns in Parquet files into vector embeddings using Ollama models.
  • Parquet File Analysis: Extracts detailed information about Parquet files including schema, row count, and file size.
  • DuckDB Integration: Converts Parquet files to DuckDB databases for efficient querying and analysis.
  • PostgreSQL Integration: Converts Parquet files to PostgreSQL tables with pgvector support for vector similarity search.
  • Markdown Processing: Converts markdown files into chunked text with metadata, preserving document structure and links.