SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction
Paper • 2606.02540 • Published • 10
Natural language processing, language models, language agents
AgentCL: Toward Rigorous Evaluation of Continual Learning in Language Agents
SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction