VGraf/repeat_response_flip_tulu_5maxturns_big_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 28 • 21.5k • 19
VGraf/paraphrase_train_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 27 • 5.28k • 33
VGraf/general_responses_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 27 • 5.19k • 30
VGraf/self-talk_gpt3.5_gpt4o_prefpairs_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27 • 10k • 31
VGraf/repeat_tulu_5maxturns_big_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27 • 4k • 29
VGraf/paraphrase_train_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27 • 4k • 23
VGraf/general_responses_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27 • 4k • 22
VGraf/olmo-3-preference-mix-deltas_reasoning-yolo_scottmix-chosen_qwen32b_rejected_qwen4b-DECON Viewer • Updated Sep 25 • 294k • 59
VGraf/olmo-3-preference-mix-deltas_reasoning-yolo_scottmix-chosen_qwen32b_rejected_qwen8b-DECON Viewer • Updated Sep 25 • 165k • 24
VGraf/context_switch_alpacaeval_4maxLeadingTurns_filtered_truncated3500_olmoTok3500 Viewer • Updated Sep 23 • 758 • 26
VGraf/context_switch_alpacaeval_4maxLeadingTurns_filtered_truncated3500 Viewer • Updated Sep 19 • 763 • 25