Conectores llm-benchmarking — implante um e seu agente opera na hora.
1 apps
Automate AI evaluations with Braintrust — organize projects, test model datasets, run benchmarks, and manage prompts via any AI agent.