mradermacher/DeepHermes-Egregore-8B-131K-i1-GGUF Reinforcement Learning • 8B • Updated about 4 hours ago • 488 • 1