Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

fireballoon
/
baichuan-vicuna-chinese-7b

Text Generation
Transformers
PyTorch
Chinese
English
llama
text-generation-inference
Model card Files Files and versions
xet
Community
17
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Adding `safetensors` variant of this model

#17 opened 10 months ago by
SFconvertbot

Adding Evaluation Results

#15 opened over 2 years ago by
leaderboard-pr-bot

请问deepspeed zero3的参数是怎么配置的

1
#14 opened over 2 years ago by
Aibet

loss震荡幅度比较大是正常的嘛,loss是在3个epoch的哪个时候开始下降并保持稳定的呢

3
#13 opened over 2 years ago by
Aibet

请问eval data 应该如何获取

#11 opened over 2 years ago by
endNone

继续微调的问题

3
#9 opened over 2 years ago by
yuqin

模型效果超出预期,很棒!!

1
#8 opened over 2 years ago by
oscar325

请问这个sft用到了哪些数据,总共是多少量级?

1
#7 opened over 2 years ago by
Kuaixueshiqing

Why you have the most py files in the configfile ?

#3 opened over 2 years ago by
Wulolx
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs