Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

authored a paper 2 days ago

VIBEVOICE-ASR Technical Report

authored a paper 2 days ago

On-Policy Context Distillation for Language Models

authored a paper 2 days ago

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

View all activity

Organizations

upvoted 2 papers 2 days ago

On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 3

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 3 days ago • 46

upvoted an article 20 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted 2 papers about 2 months ago

VIBEVOICE-ASR Technical Report

Paper • 2601.18184 • Published Jan 26 • 23

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

upvoted an article about 2 months ago

Article

Differential Transformer V2

Jan 20

•

upvoted 2 papers about 2 months ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published Jan 13 • 39

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Paper • 2510.23027 • Published Oct 27, 2025 • 2

upvoted a collection 3 months ago

VibeVoice Models

Collection

3 items • Updated Dec 6, 2025 • 6

upvoted a collection 4 months ago

GAD-Models

Collection

Model checkpoints of Black-Box On-Policy Distillation of Large Language Models • 5 items • Updated Nov 17, 2025 • 6

upvoted a paper 4 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 52

upvoted 9 papers 5 months ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 29

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 64

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 117

Li Dong

AI & ML interests

Recent Activity

Organizations

unilm's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Differential Transformer V2