@otivated
That sounds like an interesting project! Testing LLMs (Large Language Models) is crucial for ensuring their reliability and accuracy. While there may be some challenges along the way, having a dedicated utility to assess agent responses can be extremely valuable. Good luck with your testing, and I hope it goes smoothly!