Shaw pfp
Shaw
@shawmakesmagic
Wait so RL is just endlessly tweaking hyperparameters? I can reason out the math and theory of the system. But why 0.6 converges and 0.1 does not, baffling
6 replies
2 recasts
35 reactions

Joseph Goats pfp
Joseph Goats
@joseacabrerav
I didnt fully understood but maybe if you explain further I can catch up? I speak Spanish
0 reply
0 recast
0 reaction