【[46星]Humanity's Last Exam:旨在评估人类知识前沿的多模态基准,包含 3000 个跨越多种学科的问题,旨在成为最后一项封闭式的学术基准】Humanity's Last Exam (HLE) is a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. GitHub: github.com/centerforaisafety/hle
【[46星]Humanity'sLastExam:旨在评估人类知识前沿的多模
爱生活爱珂珂
2025-01-24 14:28:42
0
阅读:0