【[46星]Humanity'sLastExam:旨在评估人类知识前沿的多模

爱生活爱珂珂 2025-01-24 14:28:42

【[46星]Humanity's Last Exam:旨在评估人类知识前沿的多模态基准,包含 3000 个跨越多种学科的问题,旨在成为最后一项封闭式的学术基准】Humanity's Last Exam (HLE) is a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. GitHub: github.com/centerforaisafety/hle

0 阅读:0
爱生活爱珂珂

爱生活爱珂珂

感谢大家的关注