Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Shiyu's Lab

university

https://code-terminator.github.io/

AI & ML interests

None defined yet.

Shiyu-Lab 's collections 3

Shiyu-Lab/HarnessLLM_RL_Qwen3_4B

4B • Updated Oct 13, 2025 • 34
Shiyu-Lab/Inputoutput_RL_Qwen3_4B

4B • Updated Oct 29, 2025 • 42
Shiyu-Lab/HarnessLLM_RL_Llama3_3B

4B • Updated Oct 29, 2025 • 16
Shiyu-Lab/Inputoutput_RL_Llama3_3B

4B • Updated Oct 29, 2025 • 18

Prereq-Tune_Models

Trained models for the Prereq-Tune paper

Shiyu-Lab/Prereq-Tune_bio

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_popqa

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_hotpotqa

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_medical

Updated Jan 9, 2025

ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning

Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k

Text Generation • 2B • Updated Apr 8, 2025 • 12 • 1
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k

Text Generation • 2B • Updated Apr 8, 2025 • 7
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k

Text Generation • 2B • Updated Apr 8, 2025 • 5
Shiyu-Lab/QwQ-32B-thinkprune-4k

Text Generation • 33B • Updated Apr 8, 2025 • 1

Shiyu-Lab/HarnessLLM_RL_Qwen3_4B

4B • Updated Oct 13, 2025 • 34
Shiyu-Lab/Inputoutput_RL_Qwen3_4B

4B • Updated Oct 29, 2025 • 42
Shiyu-Lab/HarnessLLM_RL_Llama3_3B

4B • Updated Oct 29, 2025 • 16
Shiyu-Lab/Inputoutput_RL_Llama3_3B

4B • Updated Oct 29, 2025 • 18

ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning

Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k

Text Generation • 2B • Updated Apr 8, 2025 • 12 • 1
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k

Text Generation • 2B • Updated Apr 8, 2025 • 7
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k

Text Generation • 2B • Updated Apr 8, 2025 • 5
Shiyu-Lab/QwQ-32B-thinkprune-4k

Text Generation • 33B • Updated Apr 8, 2025 • 1

Prereq-Tune_Models

Trained models for the Prereq-Tune paper

Shiyu-Lab/Prereq-Tune_bio

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_popqa

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_hotpotqa

Updated Jan 9, 2025
Shiyu-Lab/Prereq-Tune_medical

Updated Jan 9, 2025

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs