Run evaluation tests on agents or teams. Supports accuracy, performance, and reliability evaluations. Requires either agent_id or team_id, but not both.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Database ID to use for evaluation
accuracy, performance, reliability Evaluation executed successfully
accuracy, performance, reliability