-
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
Paper • 2509.19371 • Published -
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Paper • 2505.06708 • Published • 7 -
Selective Attention: Enhancing Transformer through Principled Context Control
Paper • 2411.12892 • Published -
A Survey of Reinforcement Learning for Large Reasoning Models
Paper • 2509.08827 • Published • 189
Lubomir Konstantinov
lkonstantinov
AI & ML interests
None yet
Recent Activity
updated
a collection
about 2 months ago
training
updated
a collection
about 2 months ago
reasoning
updated
a collection
2 months ago
diffusion
Organizations
None yet