Dongrui Liu's picture

Dongrui Liu

shenqiorient

·

https://shenqildr.github.io/

AI & ML interests

Trustworthy AI

Recent Activity

upvoted a paper 19 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

upvoted a paper 26 days ago

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

liked a model about 1 month ago

InternScience/StructTable-InternVL2-1B

View all activity

Organizations

upvoted a paper 19 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published 20 days ago • 148

upvoted a paper 26 days ago

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Paper • 2504.15585 • Published Apr 22, 2025 • 14

upvoted 9 papers about 1 month ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 90

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27, 2025 • 43

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 85

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published Jul 22, 2025 • 9

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published Feb 16 • 29

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 244

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 261

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 349

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Paper • 2602.14364 • Published Feb 16 • 24

upvoted 2 papers about 2 months ago

DeepSight: An All-in-One LM Safety Toolkit

Paper • 2602.12092 • Published Feb 12 • 16

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published Feb 9 • 78

upvoted a paper 2 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

upvoted a collection 2 months ago

AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 9 items • Updated 12 days ago • 107

upvoted 3 papers 6 months ago

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15, 2025 • 64

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30, 2025 • 18

upvoted a paper 10 months ago

Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

Paper • 2506.02867 • Published Jun 3, 2025 • 2

upvoted a paper about 1 year ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22, 2025 • 61