7 88 44

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 4 days ago

Therefore I am. I Think

submitted a paper 4 days ago

Therefore I am. I Think

upvoted a paper 6 days ago

Terminal Agents Suffice for Enterprise Automation

View all activity

Organizations

upvoted a paper 4 days ago

Therefore I am. I Think

Paper • 2604.01202 • Published 6 days ago • 28

submitted a paper to Daily Papers 4 days ago

Therefore I am. I Think

Paper • 2604.01202 • Published 6 days ago • 28

upvoted a paper 6 days ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published 7 days ago • 87

upvoted a paper 12 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 13 days ago • 94

liked a dataset 14 days ago

ServiceNow-AI/eva

Viewer • Updated 14 days ago • 50 • 5.48k • 68

upvoted an article 14 days ago

Article

A New Framework for Evaluating Voice Agents (EVA)

15 days ago

•

upvoted a paper 15 days ago

Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published 29 days ago • 21

upvoted a paper 20 days ago

Attention Residuals

Paper • 2603.15031 • Published 22 days ago • 176

commented a paper 20 days ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published 23 days ago • 416 •

upvoted 3 papers 21 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 28 days ago • 149

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published 30 days ago • 42

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 26 days ago • 64

liked a dataset 21 days ago

ServiceNow-AI/EnterpriseOps-Gym

Viewer • Updated 17 days ago • 2.56k • 5.73k • 86

upvoted a paper 21 days ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published 25 days ago • 147

upvoted a paper 25 days ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 179

New activity in MiniMaxAI/VIBE about 1 month ago

split by language and more categories

#9 opened about 1 month ago by

dtanow

upvoted a paper 2 months ago

Privileged Information Distillation for Language Models

Paper • 2602.04942 • Published Feb 4 • 26

liked a model 3 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.7M • • 4.66k

liked a dataset 3 months ago

MiniMaxAI/VIBE

Viewer • Updated Dec 23, 2025 • 200 • 314 • 274

upvoted an article 4 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity

A New Framework for Evaluating Voice Agents (EVA)

split by language and more categories

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance