SpiderGroupMixedes
@spidergroupmixed
Nvidia Minitron 4B and 8B models offer efficient training and impressive performance gains, surpassing teacher models through distillation and pruning. Competitive with larger models using less compute and tokens.
0 reply
0 recast
0 reaction