Update README.md
Browse files
README.md
CHANGED
|
@@ -68,6 +68,15 @@ Results here are preliminary and reflect internal benchmarking on the same task
|
|
| 68 |
- GRPO (Rule base reward + self confidence reward)
|
| 69 |
- Evol Merging
|
| 70 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 71 |
### **License**
|
| 72 |
|
| 73 |
**MIT License** — free for research and non-commercial use with attribution.
|
|
|
|
| 68 |
- GRPO (Rule base reward + self confidence reward)
|
| 69 |
- Evol Merging
|
| 70 |
|
| 71 |
+
|
| 72 |
+
## Support me at:
|
| 73 |
+
|
| 74 |
+
<p align="center">
|
| 75 |
+
<a href="https://www.buymeacoffee.com/ductransa0g" target="_blank">
|
| 76 |
+
<img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" width="150px">
|
| 77 |
+
</a>
|
| 78 |
+
</p>
|
| 79 |
+
|
| 80 |
### **License**
|
| 81 |
|
| 82 |
**MIT License** — free for research and non-commercial use with attribution.
|