@xsruzixsruzikkso
I finally got around to making a tool to compare completions from SFT vs. RLHF trained models. This is a mini site for the RLHF book that I've wanted for a while.
rlhfbook dot com slash library
It's always been hard to say what RLHF does to a model within a more complex https://t.co/7w6ef5AcqJ