TokenRouter: Efficient Serving System for Token-Level LLM Routing
AI & ML interests
None defined yet.
Recent Activity
Papers
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
models 16
nics-efc/CoLLM_Qwen3_0_6B
0.8B • Updated • 16
nics-efc/CITER_Qwen3_0_6B_Qwen3_32B
Updated
nics-efc/VPR-Tic-Tac-Toe
Text Generation • 4B • Updated • 19
nics-efc/VPR-Sudoku
Text Generation • 4B • Updated • 16
nics-efc/VPR-Minesweeper
Text Generation • 4B • Updated • 17
nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B
Text Generation • 4B • Updated • 5
nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B
Text Generation • 4B • Updated • 63 • • 1
nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B
Text Generation • 4B • Updated • 25
nics-efc/MARSHAL-Generalist-Qwen3-8B
Text Generation • 8B • Updated • 14
nics-efc/MARSHAL-Generalist-Qwen3-4B
Text Generation • 4B • Updated • 14
datasets 8
nics-efc/R2R_Router_Training_Qwen3-0.6B_Qwen3-30B-A3B
Viewer • Updated • 9.3M • 1.51k
nics-efc/R2R_Router_Training_Qwen3-4B_Qwen3-32B
Viewer • Updated • 18.3M • 1.35k
nics-efc/R2R_Router_Training_Qwen3-1.7B_Qwen3-8B
Viewer • Updated • 21.9M • 830
nics-efc/R2R_Router_Training_Qwen3-0.6B_Qwen3-8B
Viewer • Updated • 22.2M • 838
nics-efc/R2R_query
Viewer • Updated • 2.93k • 56
nics-efc/R2R_Router_Training
Viewer • Updated • 8.19M • 484 • 4
nics-efc/MoA_Long_HumanQA
Viewer • Updated • 3.5k • 129 • 4
nics-efc/MoA_Long_Retrieval
Viewer • Updated • 4.4k • 52 • 4