@maurelian.eth
Learned about ARC-AGI-3 the other day, but only just tried it out. It's somewhat fun: https://arcprize.org/tasks/ls20.
From what I recall, LLMs score very poorly on this vs humans, because the goals are not clearly laid out. So success means intuiting the goal.
Honestly I prefer my LLMs to not have an innate motivation, and this feels like a dangerous area to optimize for.