Content
@
https://warpcast.com/~/channel/sense
0 reply
0 recast
0 reaction
纯银小西
@qwee
1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.
0 reply
0 recast
0 reaction