Darwin-4B-David / .eval_results /gpqa_diamond.yaml
SeaWolf-AI's picture
feat: add .eval_results/gpqa_diamond.yaml for GPQA dataset indexing
1421bfd verified
- dataset:
id: Idavidrein/gpqa
task_id: diamond
value: 85.0
date: '2026-04-27'
source:
url: https://huggingface.co/FINAL-Bench/Darwin-4B-David
name: Model Card
notes: "4B class Gen-2 evolution, Pass@1"