runtime error

Exit code: 1. Reason: 58.9MB/s] model-00002-of-00002.safetensors: 17%|█▋ | 384M/2.20G [00:02<00:09, 194MB/s]  model-00002-of-00002.safetensors: 32%|███▏ | 699M/2.20G [00:03<00:06, 240MB/s] model-00002-of-00002.safetensors: 50%|█████ | 1.11G/2.20G [00:04<00:03, 300MB/s] model-00002-of-00002.safetensors: 79%|███████▊ | 1.73G/2.20G [00:05<00:01, 402MB/s] model-00002-of-00002.safetensors: 100%|██████████| 2.20G/2.20G [00:05<00:00, 368MB/s] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 29228.60it/s] generation_config.json: 0%| | 0.00/242 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 242/242 [00:00<00:00, 1.52MB/s] Traceback (most recent call last): File "/app/app.py", line 59, in <module> from f5_tts.infer.infer_gradio import app as f5_app File "/usr/local/lib/python3.10/site-packages/f5_tts/infer/infer_gradio.py", line 763, in <module> load_chat_model(chat_model_name_list[0]) File "/usr/local/lib/python3.10/site-packages/f5_tts/infer/infer_gradio.py", line 756, in load_chat_model chat_model_state = AutoModelForCausalLM.from_pretrained(chat_model_name, torch_dtype="auto", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...