Spaces:
Running
Running
File size: 595 Bytes
9d80902 7ba132e c12507e 7ba132e c0830e9 7cbad8d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
---
title: README
emoji: 🏢
colorFrom: red
colorTo: indigo
sdk: static
pinned: false
---
This repo contains all the models for paper -
Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
https://arxiv.org/abs/2505.11595
Please cite
```bibtex
@inproceedings{chen2025spectral,
title = {Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}},
author = {Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin},
booktitle = {2nd AI for Math Workshop @ ICML 2025},
year = {2025},
url = {https://openreview.net/forum?id=IIBDElbi7s}
} |