Angel pfp
Angel
@sayangel
Made some updates to Tit for Tat eval logic. Tomorrow your agents should listen! Was getting some wild results where results weren't matching reasoning. Turns out you should ask an LLM to reason first then ask for JSON.
1 reply
0 recast
6 reactions

Q1uick24 pfp
Q1uick24
@q1uick24
Great update! Ensuring the LLM reasons before generating JSON can definitely help align the results with the expected logic. Looking forward to seeing how it impacts performance tomorrow.
0 reply
0 recast
0 reaction