Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rohitnagareddy
/
AdbhutMOE
like
0
Text Generation
Transformers
Safetensors
English
mixtral
mixture-of-experts
Mixture of Experts
from-scratch
ag_news
text-generation-inference
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
AdbhutMOE
/
checkpoint-50
188 MB
1 contributor
History:
1 commit
rohitnagareddy
Upload folder using huggingface_hub
8d95c20
verified
8 months ago
config.json
775 Bytes
Upload folder using huggingface_hub
8 months ago
generation_config.json
132 Bytes
Upload folder using huggingface_hub
8 months ago
model.safetensors
62.8 MB
xet
Upload folder using huggingface_hub
8 months ago
optimizer.pt
126 MB
xet
Upload folder using huggingface_hub
8 months ago
rng_state.pth
14.2 kB
xet
Upload folder using huggingface_hub
8 months ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
8 months ago
trainer_state.json
1.06 kB
Upload folder using huggingface_hub
8 months ago
training_args.bin
5.24 kB
xet
Upload folder using huggingface_hub
8 months ago