Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mjbommar
/
ogbert-tokenizer-16384
like
0
Transformers
English
tokenizer
bpe
ogbert
modernbert
opengloss
arxiv:
2511.18622
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
ogbert-tokenizer-16384
1.1 MB
1 contributor
History:
7 commits
mjbommar
Upload OGBERT tokenizer (vocab_size=16384)
f87e9d1
verified
29 days ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
1.06 kB
Upload OGBERT tokenizer (vocab_size=16384)
about 1 month ago
special_tokens_map.json
188 Bytes
Add special_tokens_map
about 1 month ago
tokenizer.json
1.1 MB
Upload OGBERT tokenizer (vocab_size=16384)
29 days ago
tokenizer_config.json
422 Bytes
Upload OGBERT tokenizer (vocab_size=16384)
about 1 month ago