agusti
@bleu.eth
training your own model hits diff fr fr
2 replies
0 recast
11 reactions
agusti
@bleu.eth
i dont even know if these numbers are good or bad rn 25M params model gpt2 base
4 replies
0 recast
6 reactions
shoni.eth
@alexpaden
What are you fine tuning on 25m param I thought would be pretty slow on a single gpu
0 reply
0 recast
1 reaction