Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Nagi-ovo
's Collections
RL
Llama-3-8B-RLHF-Pipeline
RL
updated
10 days ago
Upvote
-
Nagi-ovo/alphazero-gomoku
Reinforcement Learning
•
Updated
Dec 13, 2024
•
1
Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter
Text Generation
•
Updated
Feb 8, 2025
•
2
Nagi-ovo/Llama-3-8B-RM
Text Classification
•
8B
•
Updated
Jan 6, 2025
•
10
•
2
Nagi-ovo/Llama-3-8B-PPO
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
9
Nagi-ovo/HOMIERL-loco
Robotics
•
Updated
10 days ago
Upvote
-
Share collection
View history
Collection guide
Browse collections