Naholav commited on
Commit
3bcec37
·
verified ·
1 Parent(s): f3a074f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -31,9 +31,13 @@ This is the best performing checkpoint from the **deep_think** training configur
31
 
32
  - **Prompt Style:** Think (uses `<think>` tags for reasoning)
33
  - **System Prompt:** "You are an expert programmer. Use <think> tags for reasoning before writing code."
34
- - **LoRA Rank:** 64
35
- - **LoRA Alpha:** 128
36
- - **Learning Rate:** 2e-4
 
 
 
 
37
 
38
  ## All Models Performance Comparison
39
 
 
31
 
32
  - **Prompt Style:** Think (uses `<think>` tags for reasoning)
33
  - **System Prompt:** "You are an expert programmer. Use <think> tags for reasoning before writing code."
34
+ - **LoRA Rank:** 32
35
+ - **LoRA Alpha:** 64
36
+ - **LoRA Dropout:** 0.05
37
+ - **Learning Rate:** 5e-5
38
+
39
+
40
+ **Note:** All 4 models were trained with identical hyperparameters for fair comparison. Better configurations may be discovered through hyperparameter search methods (e.g., grid search, random search).
41
 
42
  ## All Models Performance Comparison
43