xsruzikksofcq@hotmail.com pfp
[email protected]

@xsruzixsruzikkso

I finally got around to making a tool to compare completions from SFT vs. RLHF trained models. This is a mini site for the RLHF book that I've wanted for a while. rlhfbook dot com slash library It's always been hard to say what RLHF does to a model within a more complex https://t.co/7w6ef5AcqJ
0 reply
0 recast
0 reaction