The dream team has organized the TWO parties of NeurIPS '25. Yacht secured. Hosts: @swyx, @latentspacepod @jordanschnyc, ChinaTalk @dylan522p, @SemiAnalysis_ me, @interconnectsai Sponsored by @DecibelVC @LambdaAPI @outshiftbycisco Get your life jackets and GPU purses ready. https://t.co/mALpg06kXO
- 0 replies
- 0 recasts
- 0 reactions
I finally got around to making a tool to compare completions from SFT vs. RLHF trained models. This is a mini site for the RLHF book that I've wanted for a while. rlhfbook dot com slash library It's always been hard to say what RLHF does to a model within a more complex https://t.co/7w6ef5AcqJ
- 0 replies
- 0 recasts
- 0 reactions
Just signed a book deal for The RLHF Book, excited to make improvements to it this fall and get physical copies in your hands soon :) (rlhfbook dot com)
- 0 replies
- 0 recasts
- 0 reactions