Content pfp
Content
@
https://warpcast.com/~/channel/aichannel
0 reply
0 recast
0 reaction

shoni.eth pfp
shoni.eth
@alexpaden
was about to start prompt evaling to max returns on small model classifiers but i'm just gonna ship with less right (left) classifications instead. i think a simple eval framework/dash is an easy github nextjs project though. i just want to see how small changes effect results compared to manually labeled ones on a variety of small models i.e. this prompt is 48% right on gemma3:1b this one is 64% right on gemma3:1b this one is 87% right on gemma3:1b:ft
0 reply
0 recast
3 reactions