DCAgent/rl__24GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__qwen3base-GLM-4_7-sw Updated about 2 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_proportional__qwen3base-GLM-4_7-sw Updated about 2 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_balanced__qwen3base-GLM-4_7-sw Updated about 3 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h1_struggle_zone__qwen3base-GLM-4_7-sw Updated about 3 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_baseline_uniform__qwen3base-GLM-4_7-sw Updated about 4 hours ago