Introduction to Braintrust
Purpose
Braintrust provides visibility into the performance of AI-driven features. It serves three main goals:
- Evals: Experiments allow you to test changes to AI features before shipping them.
- Observability: Logs ensure production code successfully delivers AI features.
- IDE: Playgrounds allow you to quickly iterate by exploring changes to prompts and other parameters.
How it works
Run evals
- Create an
Eval()
by plugging in your data, atask
function, and scoring functions. - Run the code to see results in Braintrust's Experiments page.
Log runs
- Instrument your code using
traced()
orlogger.traced()
. - Run the code to see results in Braintrust's Logs page.
Create prompts
- Visit the Prompts page to create a new prompt.
- Visit the Playgrounds page to try out one or more prompts on your data.