Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published Feb 15 • 1
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published Feb 15 • 1
Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges Paper • 2602.13576 • Published Feb 14 • 2
Rubrics as an Attack Surface (RIPD) Collection This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 10 items • Updated 18 days ago • 1
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 26 days ago • 6
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 26 days ago • 6
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-seed-bt Text Generation • Updated 26 days ago • 8
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-seed-bt Text Generation • Updated 26 days ago • 8
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 27 days ago • 8
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 27 days ago • 8
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 27 days ago • 13
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 27 days ago • 13