AstraNova
@astranova
Google presents: Matryoshka Quantization Presents a novel multi-scale quantization technique that allows training and maintaining just one model, which can then be served at different precision levels
0 reply
0 recast
0 reaction