Google for AI agents. 25,000+ tools indexed.

sisig-ai/doctor

Score: 23.2 Rank #9500

Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.

Overview

sisig-ai/doctor is a Python MCP server licensed under MIT. Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.

Ranked #9500 out of 25632 indexed tools.

Ecosystem

Python MIT

Signal Breakdown

Stars 463
Freshness 9mo ago
Issue Health 10%
Contributors 1
Dependents 0
Forks 65
Description Good
License MIT

How to Improve

Description low impact

Expand your description to 150+ characters for better discoverability

Freshness high impact

Last commit was 296 days ago — a recent commit would boost your freshness score

Issue Health high impact

You have 9 open vs 1 closed issues — triaging stale issues improves health

Badge

AgentRank score for sisig-ai/doctor
[![AgentRank](https://agentrank-ai.com/api/badge/tool/sisig-ai--doctor)](https://agentrank-ai.com/tool/sisig-ai--doctor)
<a href="https://agentrank-ai.com/tool/sisig-ai--doctor"><img src="https://agentrank-ai.com/api/badge/tool/sisig-ai--doctor" alt="AgentRank"></a>

Matched Queries

"mcp server""mcp-server"

From the README

<div align="center">
  <picture>
    
  </picture>
<br>

<h2>🩺 Doctor</h2>

A tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents for better and more up-to-date reasoning and code generation.

</div>

---

### 🔍 Overview

Doctor provides a complete stack for:
- Crawling web pages using crawl4ai with hierarchy tracking
- Chunking text with LangChain
- Creating embeddings with OpenAI via litellm
- Storing data in DuckDB with vector search support
- Exposing search functionality via a FastAPI web service
- Making these capabilities available to LLMs through an MCP server
- Navigating crawled sites with hierarchical site maps

---

### 🏗️ Core Infrastructure

#### 🗄️ DuckDB
- Database for storing document data and embeddings with vector search capabilities
- Managed by unified Database class

#### 📨 Redis
- Message broker for asynchronous task processing

#### 🕸️ Crawl Worker
- Processes crawl jobs
- Chunks text
- Creates embeddings

####
Read full README on GitHub →
Are you the maintainer? Claim this listing