Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Lubomir Konstantinov's picture

Lubomir Konstantinov

lkonstantinov

dark-pen's profile picture

·

AI & ML interests

None yet

Organizations

None yet

lkonstantinov 's collections 5

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Paper • 2509.19371 • Published Sep 19, 2025
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10, 2025 • 10
Selective Attention: Enhancing Transformer through Principled Context Control

Paper • 2411.12892 • Published Nov 19, 2024
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Paper • 2507.20526 • Published Jul 28, 2025 • 1
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 50
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15, 2025 • 11
Retrieval-augmented reasoning with lean language models

Paper • 2508.11386 • Published Aug 15, 2025 • 5
Language Models that Think, Chat Better

Paper • 2509.20357 • Published Sep 24, 2025 • 1

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29, 2025
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16, 2025 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 55

mixed precision

Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30, 2025 • 1
Training LLMs with MXFP4

Paper • 2502.20586 • Published Feb 27, 2025 • 2

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Paper • 2509.19371 • Published Sep 19, 2025
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10, 2025 • 10
Selective Attention: Enhancing Transformer through Principled Context Control

Paper • 2411.12892 • Published Nov 19, 2024
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29, 2025
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16, 2025 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30, 2025 • 55

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Paper • 2507.20526 • Published Jul 28, 2025 • 1
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

mixed precision

Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30, 2025 • 1
Training LLMs with MXFP4

Paper • 2502.20586 • Published Feb 27, 2025 • 2

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 50
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15, 2025 • 11
Retrieval-augmented reasoning with lean language models

Paper • 2508.11386 • Published Aug 15, 2025 • 5
Language Models that Think, Chat Better

Paper • 2509.20357 • Published Sep 24, 2025 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs