dooho lee's picture

233 18

dooho lee

BlueYellowGreen

·

https://leedooho.com

BlueYellowGreen

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Action100M: A Large-scale Video Action Dataset

upvoted a paper about 3 hours ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper about 3 hours ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

View all activity

Organizations

None yet

upvoted 5 papers about 3 hours ago

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published 4 days ago • 21

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 5 days ago • 76

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 4 days ago • 23

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 5 days ago • 28

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 5 days ago • 168

upvoted 6 papers 4 days ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 26 days ago • 34

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 27 days ago • 34

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 17 days ago • 53

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 10 days ago • 39

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 11 days ago • 45

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published 6 days ago • 48

upvoted 3 papers 7 days ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published 11 days ago • 26

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Paper • 2601.05175 • Published 11 days ago • 32

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published 11 days ago • 39

upvoted a paper 10 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 11 days ago • 194

upvoted a paper 11 days ago

Recursive Language Models

Paper • 2512.24601 • Published 19 days ago • 68

upvoted a paper 13 days ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published 14 days ago • 83

upvoted 3 papers 14 days ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published Dec 19, 2025 • 111

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 21 days ago • 94

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published 21 days ago • 20