Is this model meant for full bfloat16, AMP bfloat16 or no bfloat16?

#7
by umarbutler - opened

The paper does not make it clear.

Sign up or log in to comment