12 12

Dingming Li

lidingm

lidingm

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

authored a paper 1 day ago

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

authored a paper 1 day ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

View all activity

Organizations

None yet

authored 4 papers 1 day ago

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Paper • 2505.21500 • Published May 27, 2025 • 13

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7, 2025 • 20

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9, 2025 • 12

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

upvoted a paper 1 day ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 3 days ago • 35

upvoted a paper 2 days ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published 10 days ago • 92

upvoted a paper 19 days ago

PEARL: Personalized Streaming Video Understanding Model

Paper • 2603.20422 • Published 22 days ago • 40

upvoted a paper 25 days ago

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

upvoted a paper 26 days ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published Mar 11 • 19

upvoted a paper about 1 month ago

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 40

upvoted 2 papers 2 months ago

GEBench: Benchmarking Image Generation Models as GUI Environments

Paper • 2602.09007 • Published Feb 9 • 39

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

upvoted a paper 3 months ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

liked a model 3 months ago

stepfun-ai/Step3-VL-10B

Image-Text-to-Text • 10B • Updated Feb 4 • 262k • 403

upvoted 3 papers 6 months ago

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16, 2025 • 6

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published Sep 29, 2025 • 31

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published Sep 29, 2025 • 32

liked a dataset 8 months ago

wangzx1210/OmniEAR

Viewer • Updated Aug 9, 2025 • 30.2k • 44 • 10

liked a dataset 11 months ago

lidingm/ViewSpatial-Bench

Viewer • Updated May 28, 2025 • 5.71k • 330 • 18

updated a dataset 11 months ago

lidingm/ViewSpatial-Bench

Viewer • Updated May 28, 2025 • 5.71k • 330 • 18

Dingming Li

AI & ML interests

Recent Activity

Organizations

lidingm's activity