1 13 7

Rishabh Maheshwary

rmahesh

https://rishabhmaheshwary.github.io/

AI & ML interests

NLP, Multimodal vision and language, AI robustness and safety

Recent Activity

upvoted an article about 15 hours ago

A New Framework for Evaluating Voice Agents (EVA)

liked a dataset 8 days ago

ServiceNow-AI/EnterpriseOps-Gym

upvoted a paper 8 days ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

View all activity

Organizations

upvoted an article about 15 hours ago

Article

A New Framework for Evaluating Voice Agents (EVA)

1 day ago

•

liked a dataset 8 days ago

ServiceNow-AI/EnterpriseOps-Gym

Viewer • Updated 3 days ago • 2.56k • 4.27k • 81

upvoted a paper 8 days ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published 11 days ago • 142

upvoted 2 articles 3 months ago

Article

PipelineRL

Apr 25, 2025

•

Article

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

Dec 23, 2025

•

liked a model 4 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • Updated Dec 22, 2025 • 2.36k • • 294

liked a model 6 months ago

ServiceNow-AI/Apriel-1.5-15b-Thinker

Image-Text-to-Text • Updated Oct 6, 2025 • 303 • 467

upvoted a collection 6 months ago

Apriel-1.5-15B-Thinker

Collection

3 items • Updated Oct 2, 2025 • 75

upvoted 2 papers 6 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 123

GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning

Paper • 2508.15690 • Published Aug 21, 2025 • 8

upvoted an article 6 months ago

Article

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

Sep 22, 2025

•

upvoted a paper 6 months ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

upvoted a paper 9 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5, 2025 • 52

authored 3 papers 10 months ago

commented a paper 10 months ago

Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

Paper • 2505.16293 • Published May 22, 2025 • 3 •

liked a model 11 months ago

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Text Generation • Updated Nov 10, 2025 • 106 • 126

updated a dataset about 1 year ago

rmahesh/hotpotqa

Viewer • Updated Feb 27, 2025 • 105k • 8

published a dataset about 1 year ago

rmahesh/hotpotqa

Viewer • Updated Feb 27, 2025 • 105k • 8

Rishabh Maheshwary

AI & ML interests

Recent Activity

Organizations

rmahesh's activity

A New Framework for Evaluating Voice Agents (EVA)

PipelineRL

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs