pinned
Running
30
Benchmark Finder
๐
A space to view and inspect all the tasks in lighteval
LLM evaluation
A space to view and inspect all the tasks in lighteval
Explore LLM benchmark trends over time
Explore and discover all leaderboards from the HF community
Explore and compare AI model scores across official benchmarks
Display and inspect log files
Launch and monitor model evaluation jobs
Generate a command to run model evaluations
Compare tokenization lengths across languages