arxiv:2510.15862
Jiuqi Wang
LeonardoWjq
AI & ML interests
reinforcement learning
Recent Activity
authored
a paper
about 1 month ago
PokeeResearch: Effective Deep Research via Reinforcement Learning from
AI Feedback and Robust Reasoning Scaffold
new activity
12 months ago
stable-diffusion-v1-5/stable-diffusion-v1-5:Update README.md