Model Evaluation

Run Evaluation
If provided, both models are evaluated for comparison

Limited to 50 samples. Sign in for more.
Higher = faster on GPU

Required for private datasets or pushing results. Get token
Saves predictions and WER for each sample (requires token with write access)
Current Job
No job

No evaluation running. Submit a job to see progress here.

Evaluation History
Time Model Dataset Samples WER Base WER Results Status
No evaluations yet