Adding `safetensors` variant of this model
#2 opened over 1 year ago
by
SFconvertbot
Mismatch in attention weights for causal masked tokens vs attention masked tokens
#1 opened almost 2 years ago
by
LakshyAAAgrawal