5 220 19

QRQ

RichardQRQ

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

liked a dataset 3 days ago

xlangai/CUA-Gym

upvoted a paper 3 days ago

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

View all activity

Organizations

None yet

upvoted a paper 2 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 4 days ago • 44

liked a dataset 3 days ago

xlangai/CUA-Gym

Viewer • Updated 16 days ago • 10.9k • 1.07k • 21

upvoted a paper 3 days ago

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published 5 days ago • 22

upvoted a paper 4 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 17 days ago • 354

upvoted a paper 6 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 12 days ago • 101

upvoted a paper 8 days ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Paper • 2606.11926 • Published 10 days ago • 112

upvoted a paper 10 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 15 days ago • 116

liked a dataset 14 days ago

agents-last-exam/agents-last-exam

Viewer • Updated 7 days ago • 153 • 7.91k • 188

upvoted 3 papers 28 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 106

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 146

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 189

upvoted 3 papers about 1 month ago

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published May 14 • 120

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

Paper • 2605.01371 • Published May 2 • 6

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

upvoted 6 papers about 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 279

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published Apr 23 • 38

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published Apr 27 • 67

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 230

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 243

QRQ

AI & ML interests

Recent Activity

Organizations

RichardQRQ's activity