2 5 15

Zhenrui Yue

yueeeeeeee2837

https://yueeeeeeee.github.io/

AI & ML interests

NLP, RecSys & Data Mining

Recent Activity

liked a model 12 days ago

deepseek-ai/DeepSeek-Math-V2

upvoted a paper 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

liked a model 4 months ago

openai/gpt-oss-20b

View all activity

Organizations

liked a model 12 days ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated 12 days ago • 9.83k • 644

upvoted a paper 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58

liked a model 4 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 8.18M • • 4.04k

authored a paper 7 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6

upvoted a paper 7 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6

commented a paper 7 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6 •

upvoted a paper 7 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

liked a model 7 months ago

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 300k • 311

liked a model 8 months ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth

Any-to-Any • 109B • Updated Apr 12 • 42 • 17

liked a model 9 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 143k • 1.83k

authored a paper 9 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

liked 2 models 10 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 757k • • 4k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 1.21M • • 12.9k

liked 2 datasets 10 months ago

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 7.12k • 334

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31 • 228k • 110k • 774

upvoted an article 10 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

liked a dataset 11 months ago

BrightData/IMDb-Media

Viewer • Updated Jun 20, 2024 • 249k • 172 • 8

liked a Space 11 months ago

Scaling test-time compute

📈

587

Implement test-time compute scaling for math problems

liked a model about 1 year ago

1bitLLM/bitnet_b1_58-xl

Text Generation • 1B • Updated Mar 29, 2024 • 51 • 37

liked a dataset about 1 year ago

McAuley-Lab/Amazon-Reviews-2023

Updated Dec 8, 2024 • 55.1k • 240

Zhenrui Yue

AI & ML interests

Recent Activity

Organizations

yueeeeeeee2837's activity

Open-R1: a fully open reproduction of DeepSeek-R1

Scaling test-time compute