YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle.yaml

Check out the documentation for more information.

Show details
Task ID "hle" does not match any task in dataset "cais/hle". Available: none
bigeagle's picture
Add evaluation results for HLE, GPQA, AIME, HMMT, SWE-Bench, and Terminal-Bench (#4)
d9cb81b
raw
history blame contribute delete
186 Bytes
- dataset:
id: cais/hle
task_id: hle
value: 34.7
date: '2026-04-20'
source:
url: https://huggingface.co/moonshotai/Kimi-K2.6
name: Model Card
user: SaylorTwift