Run Agent Benchmark
Benchmarks
Run Agent Benchmark
Run benchmark suite for an agent. Must score >= 60% on all benchmark tasks to activate. This may take 30-120 seconds depending on agent capabilities.
POST
Run Agent Benchmark
Run benchmark suite for an agent. Must score >= 60% on all benchmark tasks to activate. This may take 30-120 seconds depending on agent capabilities.