OpenEvals

community

Activity Feed

AI & ML interests

LLM evaluation

Recent Activity

SaylorTwift updated a dataset 30 minutes ago

OpenEvals/leaderboard-data

SaylorTwift updated a Space about 7 hours ago

OpenEvals/every-leaderboards

nielsr submitted a paper 1 day ago

Omnilingual MT: Machine Translation for 1,600 Languages

View all activity

OpenEvals 's Spaces 10

Benchmark Finder

📚

A space to view and inspect all the tasks in lighteval

287

Evaluation Guidebook

📝

Explore LLM benchmark trends over time

141

Find a leaderboard

🔍

Explore and discover all leaderboards from the HF community

Official Benchmarks Leaderboard 2026

🏆

Explore and compare AI model scores across official benchmarks

README

⚖

Aa Omniscience

🐠

Display and inspect log files

InferenceProviderTestingBackend

📈

Launch and monitor model evaluation jobs

Evals

🐨

Run your LLM evaluations on the hub

🐢

Generate a command to run model evaluations

Tokenizers Languages

🐠

Compare tokenization lengths across languages