OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation Paper • 2512.06589 • Published Dec 6, 2025 • 18
Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models Paper • 2509.01909 • Published Sep 2, 2025 • 6