Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zhixuan-lin
/
delta_net-760m-longcrawl64-48b
like
0
Text Generation
Transformers
Safetensors
delta_net-project_fox
long-context
forgetting-attention
deltanet
arxiv:
2503.02130
arxiv:
2406.06484
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
delta_net-760m-longcrawl64-48b
3.34 GB
2 contributors
History:
7 commits
zhixuan-lin
Update README.md
f9eb9c5
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
9 months ago
README.md
Safe
5.08 kB
Update README.md
4 months ago
config.json
Safe
891 Bytes
Upload DeltaNetForCausalLM
9 months ago
generation_config.json
Safe
69 Bytes
Upload DeltaNetForCausalLM
9 months ago
merges.txt
Safe
456 kB
Upload tokenizer
9 months ago
model.safetensors
Safe
3.34 GB
xet
Upload DeltaNetForCausalLM
9 months ago
special_tokens_map.json
Safe
438 Bytes
Upload tokenizer
9 months ago
tokenizer_config.json
Safe
519 Bytes
Upload tokenizer
9 months ago
vocab.json
Safe
999 kB
Upload tokenizer
9 months ago