Padding on batch inference

by cckm - opened 3 days ago

cckm

3 days ago

Thanks for the checkpoint!

Got a question on batched inference. So the inference input if shaped [B, 80, seq_len]. Now if inputs within the same batch have different effective seq_len, what do you expect us to pad the shorter inputs with?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment