runtime error

Exit code: 1. Reason: 1.00GB/s] Downloading shards: 100%|██████████| 8/8 [00:50<00:00, 5.98s/it] Downloading shards: 100%|██████████| 8/8 [00:50<00:00, 6.28s/it] Loading checkpoint shards: 0%| | 0/8 [00:00<?, ?it/s] Loading checkpoint shards: 12%|█▎ | 1/8 [00:07<00:54, 7.75s/it] Loading checkpoint shards: 25%|██▌ | 2/8 [00:21<01:08, 11.46s/it] Loading checkpoint shards: 38%|███▊ | 3/8 [00:33<00:57, 11.48s/it] Loading checkpoint shards: 50%|█████ | 4/8 [00:46<00:48, 12.06s/it] Loading checkpoint shards: 62%|██████▎ | 5/8 [01:03<00:41, 13.76s/it] Loading checkpoint shards: 75%|███████▌ | 6/8 [01:17<00:28, 14.06s/it] Loading checkpoint shards: 88%|████████▊ | 7/8 [01:36<00:15, 15.54s/it] Loading checkpoint shards: 100%|██████████| 8/8 [01:46<00:00, 13.76s/it] Loading checkpoint shards: 100%|██████████| 8/8 [01:46<00:00, 13.28s/it] generation_config.json: 0%| | 0.00/181 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 181/181 [00:00<00:00, 1.27MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 51, in <module> pipe = pipeline("text-generation", model=model, tokenizer=tokenizer, device="cuda") File "/usr/local/lib/python3.10/site-packages/transformers/pipelines/__init__.py", line 1178, in pipeline return pipeline_class(model=model, framework=framework, task=task, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/pipelines/text_generation.py", line 96, in __init__ super().__init__(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/pipelines/base.py", line 874, in __init__ raise ValueError( ValueError: The model has been loaded with `accelerate` and therefore cannot be moved to a specific device. Please discard the `device` argument when creating your pipeline object.

Container logs:

Fetching error logs...