Google PageRank for AI agents. 25,000+ tools indexed.

Data Processing Tools

41 tools · avg score 34.6

Data processing MCP servers and tools connect AI agents to data pipeline infrastructure: Apache Kafka streams, Spark clusters, dbt models, Airflow DAGs, and ETL workflows. These tools enable a new class of AI-assisted data engineering where agents can monitor, debug, and even modify data pipelines in response to natural language instructions.

Beyond traditional data engineering tools, this category includes document processing utilities (PDF extraction, OCR, format conversion) and data transformation tools that let agents reshape structured data between formats. These are the plumbing that makes AI-powered analytics workflows possible.

Data processing tools often require careful permissioning — an agent with write access to a production Kafka topic or dbt project can cause serious downstream damage. Look for tools with dry-run modes, explicit schema validation, and audit logging before granting write access in production environments.

# Name Score Stars Last Commit Language
1 kubeflow/mcp-apache-spark-history-server 76.5 142 1d ago Python
2 dbt-labs/dbt-mcp 76.4 519 1d ago Python
3 yangkyeongmo/mcp-server-apache-airflow 67.8 150 24d ago Python
4 call518/MCP-Airflow-API 53.6 44 1d ago Python
5 pragunbhutani/dbt-llm-agent 50.8 168 7mo ago Python
6 gabcoyne/airflow-unfactor 50.7 9 23d ago TypeScript
7 diffbot/diffbot-mcp 50.1 1 15d ago Python
8 NiclasOlofsson/dbt-core-mcp 50.0 11 2d ago Python
9 wklee610/kafka-mcp 49.8 10 19d ago Python
10 therevenueengineer/polytomic-mcp 49.7 1 10d ago Python
11 Astoriel/dbt-doctor 48.9 80 4d ago Python
12 citronlegacy/deepghs-mcp 48.7 1 19d ago Python
13 pablixnieto2/etld-mcp-server 48.0 2 1d ago JavaScript
14 kyle-chalmers/dbt-agentic-development 47.9 4 17d ago Shell
15 gAmUssA/mcp-kafka 47.6 9 2d ago Java
16 tuannvm/kafka-mcp-server 43.0 46 18d ago Go
17 nic01asFr/QgisStreamMCP 38.0 2 1mo ago Python
18 ShreyasDasari/SkyDelay-Intelligence 36.9 1 1mo ago Python
19 madamak/apache-airflow-mcp-server 36.2 2 1mo ago Python
20 CharlieDigital/runjs 30.9 31 9mo ago C#
21 lmfor/LeaguePredictionModelPipeline 30.6 1 2mo ago Python
22 dbt-labs/streamlit_mcp_cortex 29.1 6 4mo ago Python
23 c-cf/BI-Chart-MCP-Server 28.8 13 1y ago Python
24 brandon-powers/mcp-kafka 26.5 3 1y ago Python
25 abhishekbhakat/airflow-mcp-server 24.0 32 5mo ago Python
26 jairus-m/dbt-mcp-claude-devcontainer 23.1 1 7mo ago Shell
27 aswinayyolath/kafka-mcp-server 22.2 2 7mo ago Python
28 dabblefish-solutions/dabblefish__mcp__dbt-core-tools 20.4 1 9mo ago Python
29 armalite/data-product-hub 20.0 8 5mo ago Python
30 mattijsdp/dbt-docs-mcp 19.9 24 7mo ago Python
31 SteveZhuhaobo/dbt-for-dm 19.7 1 6mo ago
32 kannandreams/dbt-mcp-server 18.1 3 11mo ago Python
33 CDataSoftware/cdata-sync-mcp-server 17.3 3 7mo ago TypeScript
34 astronomer/astro-airflow-mcp archived 16.9 9 2mo ago Python
35 marcoeg/mcp-bauplan 16.8 3 9mo ago Python
36 nehbehl/pandas-mcp-server 16.5 1 11mo ago
37 MammothGrowth/dbt-cli-mcp 16.3 19 9mo ago Python
38 raibid-labs/dgx-spark-mcp 15.7 1 3mo ago TypeScript
39 TommyBez/dbt-semantic-layer-mcp-server 14.9 11 1y ago TypeScript
40 grovesjosephn/pokemcp 11.5 3 9mo ago TypeScript
41 nikhil-ganage/mcp-server-airflow-token 10.1 1 8mo ago Python

Get the weekly AgentRank digest

Top movers, new tools, ecosystem insights — straight to your inbox.