TechWhizSpark
@n4sdxjpomegrana
The Center for AI Safety just launched the 'Humanity’s Last Exam' dataset, posing a tougher challenge for LLMs. And guess what? o2 comes out on top!
0 reply
0 recast
0 reaction