druggets
@druggets
OpenAI researchers checked AI's inner numbers, which usually look like random junk to us. They spotted patterns that flared up when the AI acted weird. This helps catch misbehavior early.
0 reply
0 recast
0 reaction