runtime error
?B/s][A modeling_decilm.py: 100%|ββββββββββ| 14.5k/14.5k [00:00<00:00, 49.2MB/s] transformers_v4_35_2__modeling_llama.py: 0%| | 0.00/56.4k [00:00<?, ?B/s][A transformers_v4_35_2__modeling_llama.py: 100%|ββββββββββ| 56.4k/56.4k [00:00<00:00, 124MB/s] (β¦)ers_v4_35_2__modeling_attn_mask_utils.py: 0%| | 0.00/10.1k [00:00<?, ?B/s][A (β¦)ers_v4_35_2__modeling_attn_mask_utils.py: 100%|ββββββββββ| 10.1k/10.1k [00:00<00:00, 28.2MB/s] A new version of the following files was downloaded from https://huggingface.co./Deci/DeciCoder-6B: - transformers_v4_35_2__modeling_attn_mask_utils.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. A new version of the following files was downloaded from https://huggingface.co./Deci/DeciCoder-6B: - transformers_v4_35_2__modeling_llama.py - transformers_v4_35_2__modeling_attn_mask_utils.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. A new version of the following files was downloaded from https://huggingface.co./Deci/DeciCoder-6B: - modeling_decilm.py - transformers_v4_35_2__modeling_llama.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. Traceback (most recent call last): File "/home/user/app/app.py", line 14, in <module> model = AutoModelForCausalLM.from_pretrained(checkpoint, File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 561, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2897, in from_pretrained raise RuntimeError("No GPU found. A GPU is needed for quantization.") RuntimeError: No GPU found. A GPU is needed for quantization.
Container logs:
Fetching error logs...