sisig-ai/doctor
Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.
Overview
sisig-ai/doctor is a Python MCP server licensed under MIT. Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.
Ranked #9500 out of 25632 indexed tools.
Ecosystem
Python MIT
Signal Breakdown
Stars 463
Freshness 9mo ago
Issue Health 10%
Contributors 1
Dependents 0
Forks 65
Description Good
License MIT
How to Improve
Description low impact
Freshness high impact
Issue Health high impact
Matched Queries
From the README
<div align="center">
<picture>
</picture>
<br>
<h2>🩺 Doctor</h2>
A tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents for better and more up-to-date reasoning and code generation.
</div>
---
### 🔍 Overview
Doctor provides a complete stack for:
- Crawling web pages using crawl4ai with hierarchy tracking
- Chunking text with LangChain
- Creating embeddings with OpenAI via litellm
- Storing data in DuckDB with vector search support
- Exposing search functionality via a FastAPI web service
- Making these capabilities available to LLMs through an MCP server
- Navigating crawled sites with hierarchical site maps
---
### 🏗️ Core Infrastructure
#### 🗄️ DuckDB
- Database for storing document data and embeddings with vector search capabilities
- Managed by unified Database class
#### 📨 Redis
- Message broker for asynchronous task processing
#### 🕸️ Crawl Worker
- Processes crawl jobs
- Chunks text
- Creates embeddings
#### Read full README on GitHub →
Are you the maintainer? Claim this listing