Yan Yang's picture

2 6

Yan Yang PRO

HelloKKMe

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 1 day ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a dataset about 2 months ago

HelloKKMe/h

View all activity

Organizations

upvoted 2 papers 1 day ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 4 days ago • 121

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 3 days ago • 69

upvoted a paper 5 months ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43

upvoted a paper 6 months ago

GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8, 2025 • 26

upvoted an article 7 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

36

upvoted a paper 10 months ago

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Paper • 2503.06885 • Published Mar 10, 2025 • 4