Braintrust MCP Server
Automate AI evaluations with Braintrust — organize projects, test model datasets, run benchmarks, and manage prompts via any AI agent.
Vinkius AI Gateway suporta streamable HTTP e SSE.
Funciona com todos os agentes de IA que você já usa
…e qualquer cliente compatível com MCP


















Braintrust MCP Server: veja o seu AI Agent em ação
Capacidades integradas (10)
create_experiment
Establish a new historical experiment trace to record LLM pipeline tests
create_project
Create a new project environment for tracking AI evaluations and datasets
get_dataset
Retrieve a specific dataset containing exact schemas bounding LLM outputs
get_prompt
Retrieve exact variable contexts and literal text templates for a prompt
insert_dataset_row
Append new test cases into a dataset matrix targeting specific evaluations
list_datasets
List isolated Ground Truth text banks used for automated evaluation scoring
list_env_vars
Probe the Braintrust AI Gateway configurations managing model API keys securely
list_experiments
Retrieve all evaluation experiments mapping model test scores and metrics
list_projects
Retrieve the list of all AI evaluation projects in Braintrust
list_prompts
Retrieve explicitly version-controlled system prompts isolated in Braintrust
O que esse conector desbloqueia
Connect your Braintrust AI observation platform to any agent and maintain intense logic evaluation capabilities directly over conversation.
What you can do
- Project Analytics — Retrieve logic banks and branch isolated AI test sets
- Experiments — Create real trace regression tests appending unique LLM scoring iterations
- Datasets — Query accurate Ground Truth sets and insert new prompt templates mapping your system accuracy
- Prompt Versioning — Grab perfectly frozen semantic prompts without editing core code boundaries
How it works
1. Add this server to your AI cluster
2. Bind your personal Braintrust API ID variables
3. Leverage complex model tuning pipelines querying native AI logic regressions on chat
Automate LLM regression analyses effortlessly. Rather than scrolling tables, your bot handles strict semantic checking via Braintrust infrastructure logic directly.
Who is this for?
- AI Developers — push Ground Truth evaluation text datasets on the fly testing prompt differences
- Machine Learning Engineers — track specific variable distributions checking accurate regressions remotely
- Product Teams — observe exact string prompts dynamically pushing features validating response styles
- Data Scientists — construct massive matrices and evaluate test runs without pulling script queries
Perguntas frequentes
Dê aos seus agentes de IA o poder do Braintrust
Acesse o Braintrust e mais de 2.000 servidores MCP — prontos para seus agentes usarem, agora mesmo. Sem código cola. Sem integrações customizadas. Apenas plugue o Vinkius AI Gateway e deixe seus agentes trabalharem.
Mais nesta categoria

Semantic Scholar
4 ferramentasSearch 200M+ academic papers with AI-powered TLDR summaries, influential citation tracking, and researcher profiles from the Allen Institute for AI.

EIA Full Access — U.S. Energy Intelligence
34 ferramentasThe ultimate U.S. energy data Mega-Server: 34 tools covering petroleum, electricity, natural gas, coal, energy forecasts, state data, and international comparisons — every watt, barrel, and BTU from the federal government's energy agency.

Elastic Enterprise Search
6 ferramentasManage enterprise search via Elastic — search engines and documents, handle indexing, and monitor search analytics directly from any AI agent.
Você também pode gostar

Handwrytten
10 ferramentasAutomate handwritten notes via Handwrytten — manage cards, fonts, and send physical mail directly from any AI agent.

ChargeDesk
8 ferramentasManage billing and payments via ChargeDesk — track charges, refund payments, and manage customers across multiple gateways directly from any AI agent.

NOAA Climate — Historical Weather Records
5 ferramentasHistorical climate data from the planet's largest weather archive: GHCN-Daily temperature and precipitation records, monthly and yearly summaries, 30-year climate normals, and station search from NOAA's National Centers for Environmental Information.
