web3 livz
171 Followers
recast:farcaster://casts/0xcf2e1ed3fb89fd09ed983d860a0e3853155250585bd75d57f213ef2fbc7e1262
zk大佬又自由了
recast:farcaster://casts/0xcd162efb1e1dff1b2d27f72bede916511e83aac881dceb6463f8cdc87cac8364
llm.c training stabilized and achieves parity with PyTorch training (but faster) 💪 During the last few iters of training in my previous post, there were increases in loss. it was due to gradient norm clipping, and @karpathy fixed the bug🎉. With the latest llm.c code, GPT-2 (124M) achieved 35.3% ac