Avoiding RuntimeError when loading in 4-bit / 8-bit
#28
by
mumeranwaar
- opened
Convert to HalfTensor as the model expects it
model_inputs["pixel_values"] = model_inputs["pixel_values"].half()
model_inputs["pixel_values"] = model_inputs["pixel_values"].half()
RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.cuda.HalfTensor) should be the same