rouapps/caret
Terminal tool for inspecting and cleaning large LLM training datasets. Handles JSONL, Parquet, and CSV with memory-mapped I/O, near-duplicate detection, token visualization, dataset linting, and an MCP server.
Overview
rouapps/caret is a Rust MCP server licensed under MIT. Terminal tool for inspecting and cleaning large LLM training datasets. Handles JSONL, Parquet, and CSV with memory-mapped I/O, near-duplicate detection, token visualization, dataset linting, and an MCP server.
Ranked #3115 out of 25632 indexed tools.
Ecosystem
Rust MIT
Signal Breakdown
Stars 10
Freshness 1mo ago
Issue Health 50%
Contributors 0
Dependents 0
Forks 2
Description Detailed
License MIT
How to Improve
Freshness high impact
Contributors medium impact
Dependents medium impact
Matched Queries
Are you the maintainer? Claim this listing