hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_24with_question_embedding-1-0-20260522-235359 Updated 4 days ago • 10
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_24without_question_embedding-1-0-20260523-000400 Updated 4 days ago • 16
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_16with_question_embedding-1-0-20260522-231635 Updated 4 days ago • 17
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_16without_question_embedding-1-0-20260522-231635 Updated 4 days ago • 15 • 1
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_8with_question_embedding-1-0-20260522-231636 Updated 4 days ago • 16
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_8without_question_embedding-1-0-20260522-231635 Updated 4 days ago • 16
hanspeterlyngsoeraaschoujensen/Reasoning_Data_25K_Qwen3_0_6B Viewer • Updated 4 days ago • 25.2k • 35
hanspeterlyngsoeraaschoujensen/qwen3-0.6b-openr1-traces-sft-equivalent Viewer • Updated 4 days ago • 8.5k • 105
hanspeterlyngsoeraaschoujensen/natural-plan-trip-progress-qwen-format Viewer • Updated Apr 24 • 3.2k • 159
hanspeterlyngsoeraaschoujensen/agentic-progressbar-eval-trajectories Viewer • Updated Mar 29 • 92 • 6
hanspeterlyngsoeraaschoujensen/Reasoning_Data_25K_Qwen3_4B_Thinking_2507 Viewer • Updated Mar 12 • 25.2k • 91