google/gemma-4-E4B-it
Any-to-Any • 8B • Updated • 1.39M • 624
datatrove for all things web-scale data preparation: https://github.com/huggingface/datatrovenanotron for lightweight 4D parallelism LLM training: https://github.com/huggingface/nanotronlighteval for in-training fast parallel LLM evaluations: https://github.com/huggingface/lighteval