Shaw pfp
Shaw
@shawmakesmagic
Wait so RL is just endlessly tweaking hyperparameters? I can reason out the math and theory of the system. But why 0.6 converges and 0.1 does not, baffling
6 replies
2 recasts
37 reactions

WOO🎩 pfp
WOO🎩
@woo-x
Yup, welcome to RL. It’s math on paper, but vibes and hacks in practice.
1 reply
0 recast
2 reactions

Shaw pfp
Shaw
@shawmakesmagic
Banging my head against the wall trying to get a function to output values that seem reasonable is my jam tho
0 reply
0 recast
1 reaction