Zeyu Qin's picture

48 38

Zeyu Qin

qqqzzzyyy

·

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Scalable Oversight, AI safety

Recent Activity

upvoted a paper about 1 month ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

upvoted a paper 2 months ago

MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

upvoted a collection 2 months ago

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30 • 115

upvoted a paper 2 months ago

MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

Paper • 2510.07307 • Published Oct 8 • 5

upvoted a collection 2 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 6 days ago • 19

updated a collection 2 months ago

hahah

3 items • Updated Sep 29 • 1

upvoted 2 papers 2 months ago

UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios

Paper • 2509.21766 • Published Sep 26 • 23

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26 • 70

updated a collection 3 months ago

hahah

3 items • Updated Sep 29 • 1

upvoted a collection 4 months ago

agent

219 items • Updated 3 days ago • 18

upvoted 2 papers 4 months ago

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published Aug 13 • 32

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published Aug 11 • 51

upvoted a collection 4 months ago

DataMan

4 items • Updated Aug 8 • 2

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

liked a model 5 months ago

euclaise/Memphis-CoT-3B

Text Generation • 3B • Updated Feb 4, 2024 • 48 • 30

liked a dataset 5 months ago

euclaise/TinyCoT

Viewer • Updated Jan 23, 2024 • 27.7k • 70 • 11

upvoted 2 papers 5 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 42

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 123

upvoted a collection 6 months ago

hahah

3 items • Updated Sep 29 • 1

upvoted 2 articles 7 months ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

Jun 18, 2024

•

52

Article

Open R1: Update #3

Mar 11

•

296

updated a model 8 months ago

qqqzzzyyy/qwen2.5-1.5b-simple-rl-math3to5-adaptive_s4

Updated Apr 14 • 1