Conectores llm-benchmarking — despliega uno y tu agente opera al instante.
1 apps
Automate AI evaluations with Braintrust — organize projects, test model datasets, run benchmarks, and manage prompts via any AI agent.