Braintrust MCP Server
Automate AI evaluations with Braintrust — organize projects, test model datasets, run benchmarks, and manage prompts via any AI agent.
Vinkius AI Gateway soporta streamable HTTP y SSE.
Funciona con todos los agentes de IA que ya usas
…y cualquier cliente compatible con MCP


















Braintrust MCP Server: mira tu AI Agent en acción
Capacidades integradas (10)
create_experiment
Establish a new historical experiment trace to record LLM pipeline tests
create_project
Create a new project environment for tracking AI evaluations and datasets
get_dataset
Retrieve a specific dataset containing exact schemas bounding LLM outputs
get_prompt
Retrieve exact variable contexts and literal text templates for a prompt
insert_dataset_row
Append new test cases into a dataset matrix targeting specific evaluations
list_datasets
List isolated Ground Truth text banks used for automated evaluation scoring
list_env_vars
Probe the Braintrust AI Gateway configurations managing model API keys securely
list_experiments
Retrieve all evaluation experiments mapping model test scores and metrics
list_projects
Retrieve the list of all AI evaluation projects in Braintrust
list_prompts
Retrieve explicitly version-controlled system prompts isolated in Braintrust
Lo que este conector desbloquea
Connect your Braintrust AI observation platform to any agent and maintain intense logic evaluation capabilities directly over conversation.
What you can do
- Project Analytics — Retrieve logic banks and branch isolated AI test sets
- Experiments — Create real trace regression tests appending unique LLM scoring iterations
- Datasets — Query accurate Ground Truth sets and insert new prompt templates mapping your system accuracy
- Prompt Versioning — Grab perfectly frozen semantic prompts without editing core code boundaries
How it works
1. Add this server to your AI cluster
2. Bind your personal Braintrust API ID variables
3. Leverage complex model tuning pipelines querying native AI logic regressions on chat
Automate LLM regression analyses effortlessly. Rather than scrolling tables, your bot handles strict semantic checking via Braintrust infrastructure logic directly.
Who is this for?
- AI Developers — push Ground Truth evaluation text datasets on the fly testing prompt differences
- Machine Learning Engineers — track specific variable distributions checking accurate regressions remotely
- Product Teams — observe exact string prompts dynamically pushing features validating response styles
- Data Scientists — construct massive matrices and evaluate test runs without pulling script queries
Preguntas frecuentes
Dale a tus agentes de IA el poder de Braintrust
Accede a Braintrust y a más de 2.000 servidores MCP — listos para que tus agentes los usen, ahora mismo. Sin código pegamento. Sin integraciones personalizadas. Solo conecta el Vinkius AI Gateway y deja que tus agentes trabajen.
Más en esta categoría

Amazon Marketing Cloud
10 herramientasAdvanced advertising analytics — execute SQL queries and monitor workflows via AI.

EIA Full Access — U.S. Energy Intelligence
34 herramientasThe ultimate U.S. energy data Mega-Server: 34 tools covering petroleum, electricity, natural gas, coal, energy forecasts, state data, and international comparisons — every watt, barrel, and BTU from the federal government's energy agency.

FRED Series — U.S. Economic Time Series
5 herramientasSearch and retrieve data from 816,000+ official U.S. economic time series: GDP, inflation, unemployment, interest rates, money supply — with built-in transformations, frequency aggregation, and vintage analysis.
También podría gustarte

Chroma (Vector DB)
7 herramientasManage vector embeddings via Chroma — list collections, query embeddings, and audit document counts directly from any AI agent.

Grafana k6 Cloud (Load Testing)
10 herramientasManage load tests via k6 Cloud — run tests, monitor performance metrics, and audit thresholds.

GitBook
8 herramientasManage technical documentation via GitBook — list organizations and spaces, handle document pages, search content, and audit collections directly from any AI agent.
