Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
chain-of-thought
art
Synthetic
medical
code
biology
finance
legal
chemistry
agent
music
climate
Apply filters
Datasets
3,705
Full-text search
Edit filters
Sort: Trending
Active filters:
benchmark
Clear all
Qwen/Qwen-Image-Bench
Viewer
•
Updated
9 days ago
•
1k
•
13.1k
•
28
ServiceNow-AI/eva-bench
Viewer
•
Updated
23 days ago
•
213
•
24
•
17
openvivo/VINS-120K
Viewer
•
Updated
1 day ago
•
131k
•
555
•
10
microsoft/RHELM
Viewer
•
Updated
3 days ago
•
1.31k
•
2.18k
•
10
TuringEnterprises/Multimodal-STEM-HLE-plus-plus
Viewer
•
Updated
19 days ago
•
50
•
2.22k
•
22
nvidia/video-full-duplex-benchmark
Viewer
•
Updated
8 days ago
•
237
•
64
•
6
actava/chi-bench
Benchmark
•
Updated
3 days ago
•
101
•
6.93k
•
54
Modotte/CodeX-2M-Thinking
Viewer
•
Updated
Feb 10
•
2.19M
•
7.16k
•
125
markus-42/OccuFly
Updated
5 days ago
•
400
•
5
datacurve/deep-swe
Benchmark
•
Updated
4 days ago
•
113
•
41
•
5
Ujjwal-Tyagi/ai-ml-foundations-book-collection
Viewer
•
Updated
Apr 24
•
25
•
2.3k
•
47
meituan-longcat/WBench
Benchmark
•
Updated
8 days ago
•
867
•
2.21k
•
17
nyu-visionx/vstat
Viewer
•
Updated
3 days ago
•
530
•
1.52k
•
3
kaushik-harsh-99/Code-Language-Classification
Viewer
•
Updated
7 days ago
•
1.66M
•
145
•
3
fresnellll/ChemCoTBench-V2
Viewer
•
Updated
3 days ago
•
5.62k
•
176
•
3
MBZUAI/UrduMMLU
Viewer
•
Updated
about 10 hours ago
•
26.4k
•
72
•
3
IRVLUTD/RPX
Viewer
•
Updated
about 13 hours ago
•
922
•
3.23k
•
2
InternScience/SFE
Viewer
•
Updated
Dec 24, 2025
•
1.66k
•
912
•
18
ColamentosZJU/Drive-P2D
Preview
•
Updated
1 day ago
•
33
•
2
etri-vilab/MultihopSpatial
Viewer
•
Updated
Mar 20
•
11.3k
•
1.82k
•
4
JianhuiWei/UniVBench
Viewer
•
Updated
10 days ago
•
1.04k
•
21.6k
•
5
skylenage-ai/QwenClawBench
Viewer
•
Updated
Apr 10
•
100
•
221
•
12
llamaindex/ParseBench
Benchmark
•
Updated
Apr 19
•
169k
•
53.9k
•
90
tencent/VulnGym
Viewer
•
Updated
4 days ago
•
592
•
541
•
5
firm-review/FIRM
Viewer
•
Updated
18 days ago
•
678
•
1.38k
•
3
agents-last-exam/agents-last-exam
Viewer
•
Updated
1 day ago
•
149
•
31
•
2
xlangai/RoboFine-bench
Viewer
•
Updated
1 day ago
•
1k
•
1.7k
•
3
RogoAI/big-finance-benchmark
Viewer
•
Updated
3 days ago
•
50
•
263
•
2
jhying/OpenSkillEval
Viewer
•
Updated
5 days ago
•
677
•
2.74k
•
2
RoboStressBench/RoboStressBench-Dataset
Updated
4 days ago
•
1.07k
•
3
Previous
1
2
3
...
100
Next