diaomuxi's picture

1 2 1

diaomuxi

diaomuxi

·

LeonDiao0427

AI & ML interests

LLM & MLLM

Recent Activity

authored a paper 3 days ago

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

authored a paper 3 days ago

OJBench: A Competition Level Code Benchmark For Large Language Models

authored a paper 3 days ago

Towards Generalizable Forgery Detection and Reasoning

View all activity

Organizations

authored 6 papers 3 days ago

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Paper • 2505.15145 • Published May 21, 2025 • 1

OJBench: A Competition Level Code Benchmark For Large Language Models

Paper • 2506.16395 • Published Jun 19, 2025 • 4

Towards Generalizable Forgery Detection and Reasoning

Paper • 2503.21210 • Published Mar 27, 2025

DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving

Paper • 2505.20665 • Published May 27, 2025 • 2

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Paper • 2508.08177 • Published Aug 11, 2025

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 5 days ago • 85

authored 5 papers about 1 year ago

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning

Paper • 2402.09136 • Published Feb 14, 2024 • 1

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Paper • 2406.08587 • Published Jun 12, 2024 • 16

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Paper • 2407.01284 • Published Jul 1, 2024 • 81

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 35

SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models

Paper • 2408.02632 • Published Aug 5, 2024 • 1