Google PageRank for AI agents. 25,000+ tools indexed.

Data Processing Tools

35 tools · avg score 33.2 · sorted by AgentRank score

Data processing MCP servers and tools connect AI agents to data pipeline infrastructure: Apache Kafka streams, Spark clusters, dbt models, Airflow DAGs, and ETL workflows. These tools enable a new class of AI-assisted data engineering where agents can monitor, debug, and even modify data pipelines in response to natural language instructions.

Beyond traditional data engineering tools, this category includes document processing utilities (PDF extraction, OCR, format conversion) and data transformation tools that let agents reshape structured data between formats. These are the plumbing that makes AI-powered analytics workflows possible.

Data processing tools often require careful permissioning — an agent with write access to a production Kafka topic or dbt project can cause serious downstream damage. Look for tools with dry-run modes, explicit schema validation, and audit logging before granting write access in production environments.

# Tool Score Stars Last Commit Language
1 dbt-labs/dbt-mcp 76.0 507 5d ago Python
2 kubeflow/mcp-apache-spark-history-server 73.3 136 5d ago Python
3 yangkyeongmo/mcp-server-apache-airflow 71.1 148 16d ago Python
4 Astoriel/dbt-doctor 59.8 36 7d ago Python
5 call518/MCP-Airflow-API 54.3 44 10d ago Python
6 gabcoyne/airflow-unfactor 54.3 9 14d ago TypeScript
7 wklee610/kafka-mcp 53.3 10 11d ago Python
8 nic01asFr/QgisStreamMCP 50.8 2 21d ago Python
9 NiclasOlofsson/dbt-core-mcp 47.0 11 10d ago Python
10 tuannvm/kafka-mcp-server 46.5 45 10d ago Go
11 pragunbhutani/dbt-llm-agent 44.0 168 7mo ago Python
12 ShreyasDasari/SkyDelay-Intelligence 40.4 1 22d ago Python
13 madamak/apache-airflow-mcp-server 39.7 2 21d ago Python
14 lmfor/LeaguePredictionModelPipeline 34.1 1 1mo ago Python
15 CharlieDigital/runjs 30.9 30 8mo ago C#
16 dbt-labs/streamlit_mcp_cortex 29.3 6 4mo ago Python
17 c-cf/BI-Chart-MCP-Server 28.9 13 11mo ago Python
18 mattijsdp/dbt-docs-mcp 25.3 24 7mo ago Python
19 abhishekbhakat/airflow-mcp-server 23.5 31 5mo ago Python
20 jairus-m/dbt-mcp-claude-devcontainer 23.1 1 7mo ago Shell
21 aswinayyolath/kafka-mcp-server 22.2 2 7mo ago Python
22 brandon-powers/mcp-kafka 21.7 3 1y ago Python
23 dabblefish-solutions/dabblefish__mcp__dbt-core-tools 20.4 1 9mo ago Python
24 armalite/data-product-hub 20.2 8 4mo ago Python
25 SteveZhuhaobo/dbt-for-dm 19.8 1 6mo ago
26 kannandreams/dbt-mcp-server 18.1 3 11mo ago Python
27 CDataSoftware/cdata-sync-mcp-server 17.4 3 7mo ago TypeScript
28 astronomer/astro-airflow-mcp archived 16.6 8 1mo ago Python
29 nehbehl/pandas-mcp-server 16.6 1 11mo ago
30 MammothGrowth/dbt-cli-mcp 16.4 19 8mo ago Python
31 raibid-labs/dgx-spark-mcp 16.0 1 3mo ago TypeScript
32 marcoeg/mcp-bauplan 15.2 3 8mo ago Python
33 TommyBez/dbt-semantic-layer-mcp-server 14.9 11 1y ago TypeScript
34 grovesjosephn/pokemcp 11.5 3 8mo ago TypeScript
35 nikhil-ganage/mcp-server-airflow-token 10.1 1 8mo ago Python