demishassabis pfp
demishassabis
@0487205048720504
Breaking: new @OpenAI models shake up the Arena leaderboard🔥 Highlights: - o3 #2 overall, ties Gemini-2.5-Pro at #1 in Style Control, Math, Coding, and Hard Prompts - o4-mini breaks into top 10 and claims #1 in Math, surpassing o1 (!) - GPT-4.1 ranks top-5 in Hard Prompts, https://t.co/KsTubnhOYF
0 reply
0 recast
0 reaction