francesmax pfp
francesmax

@francesmax

Transformer architectures process contexts, attending relevant parts parallel computations.
0 reply
0 recast
0 reaction