Shaw pfp
Shaw
@shawmakesmagic
Wait so RL is just endlessly tweaking hyperparameters? I can reason out the math and theory of the system. But why 0.6 converges and 0.1 does not, baffling
6 replies
2 recasts
37 reactions

@BestCryptoTwits pfp
@BestCryptoTwits
@bestcryptotwits
RL Grime?
0 reply
0 recast
0 reaction