Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alessandrobondielli
's Collections
LLMs-to-test
Datasets-ScaleLLM
MechInterp-Papers
Reading List - TextToImage
Datasets-ScaleLLM
updated
Jul 1, 2025
Upvote
-
truthfulqa/truthful_qa
Viewer
•
Updated
Jan 4, 2024
•
1.63k
•
61.1k
•
273
allenai/qasc
Viewer
•
Updated
Jan 4, 2024
•
9.98k
•
1.93k
•
23
Anthropic/model-written-evals
Viewer
•
Updated
Dec 21, 2022
•
3.25k
•
857
•
57
yesilhealth/Health_Benchmarks
Viewer
•
Updated
Apr 20, 2025
•
7.54k
•
1.16k
•
8
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
6.51k
•
1.44k
•
38
Note
Filtrare i subset che non hanno campo choice
tau/commonsense_qa
Viewer
•
Updated
Jan 4, 2024
•
12.1k
•
50.5k
•
134
allenai/sciq
Viewer
•
Updated
Jan 4, 2024
•
13.7k
•
39.8k
•
135
allenai/openbookqa
Viewer
•
Updated
Jan 4, 2024
•
11.9k
•
88.9k
•
126
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
281k
•
312
TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
Jan 19
•
12.1k
•
83.3k
•
429
Upvote
-
Share collection
View history
Collection guide
Browse collections