YuFeng-XGuard-Reason Collection YuFeng-XGuard-Reason is a series of guardrail models specifically designed for content safety. It is engineered to accurately identify security risks • 2 items • Updated 12 days ago
OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation Paper • 2512.06589 • Published Dec 6, 2025 • 19
AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery Paper • 2505.21499 • Published May 27, 2025 • 2
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Paper • 2505.21494 • Published May 27, 2025 • 8
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22, 2025 • 18