Add GPQA evaluation result

#35

by burtenshaw HF Staff - opened 3 days ago

←

3 days ago

Evaluation Results

This PR adds structured evaluation results using the new .eval_results/ format.

Results are stored as YAML in .eval_results/ folder. See the Eval Results Documentation for the full specification.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment