Evaluations
Benchmark your models on standardized test suites and compare results.
Configure Evaluation
Accuracy Test
Measures correct answer rate on 100 classification samples.
100 test samples
No evaluations yet
Select a model and benchmark above to run your first evaluation.