Improve model card: Add tags, paper link, expand description, and correct GitHub & citation details
#1
by
nielsr
HF Staff
- opened
This PR improves the model card for TMLR-Group-HF/Entropy-Llama-3.2-3B-Instruct by:
- Adding
library_name: transformersto enable the automated "How to use" widget, based onconfig.jsonindicatingllamamodel type andtransformers_version. - Adding
pipeline_tag: text-generationfor better discoverability on the Hugging Face Hub. - Integrating the paper link to Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models into the model description.
- Correcting the GitHub repository link in the content from
https://github.com/tmlr-group/Co-Rewardtohttps://github.com/tmlr-group/Co-rewardingand updating the project name to "Co-rewarding" for consistency. - Expanding the model description with key information from the paper's abstract to provide more context.
- Correcting the title within the BibTeX citation to match the official paper title.
- No sample usage code snippet has been added as there was no explicit inference code for this specific model found in the official GitHub repository, adhering to the guidelines.
Please review and merge if these updates are accurate and beneficial.
Geraldxm
changed pull request status to
merged