deepseek_pretrain_90k / trainer_state.json
asrith05's picture
Upload DeepSeek pretrained multilingual model (90k steps)
1aff7ae verified
raw
history contribute delete
1.65 MB
File too large to display, you can check the raw version instead.