HLE (Humanity's Last Exam)

About This Benchmark

Designed to test the absolute frontier of human knowledge. Extremely difficult questions spanning 50+ domains including math, science, and engineering. Score is accuracy (%).

Source: Artificial Analysis