#Model Evaluation MCP Servers
Discover 3 MCP servers tagged with Model Evaluation on the Vinkius App Catalog.
Comet ML
6 toolsManage machine learning experiments via Comet. Track model metrics, audit project workspaces, and inspect ML run parameters directly from any AI agent.
ROC AUC Evaluator MCP
1 toolsCompute the exact Area Under the ROC Curve for binary classification predictions. Local, mathematically perfect, zero LLM estimation.
Confusion Matrix Engine MCP
1 toolsDeterministically calculate True Positives, FP, Precision, Recall, F1-Score, and Accuracy local. Stop LLM hallucinations when evaluating model metrics.