HuggingFaceFW/fineweb-edu
Viewer • Updated • 3.5B • 616k • 1.09k
This is a modernized, "Grok-style" decoder-only Transformer (nanoGPT evolution) trained on the FineWeb-Edu dataset.
tiktoken (GPT-2).You can use this model directly with the GPT class from this repository.
from model import GPT
import tiktoken
import torch
model = GPT.from_pretrained("geoffsee/auto-g-nano-2")
enc = tiktoken.get_encoding("gpt2")
# Generate text
prompt = "The future of AI is"
idx = torch.tensor(enc.encode(prompt)).unsqueeze(0)
completion = model.generate(idx, max_new_tokens=50)
print(enc.decode(completion[0].tolist()))
Trained on a sample of FineWeb-Edu.