chenghao
/

idefics2-edgar

Document Question Answering

Model card Files Files and versions

chenghao commited on Jun 21, 2024

Commit

d271c8e

·

verified ·

1 Parent(s): 98829b2

Update README.md

Files changed (1) hide show

README.md +23 -2

README.md CHANGED Viewed

@@ -74,7 +74,29 @@ Prediction: {preds or 'N/A'}
 ### Training Procedure
-10 epochs with QLoRA.
 #### Preprocessing [optional]
@@ -88,7 +110,6 @@ processor = AutoProcessor.from_pretrained(
 )
 ```
 #### Training Hyperparameters
 ```python

 ### Training Procedure
+10 epochs with QLoRA. Trained with A100-80GB for about 10 hours.
+```
+MAX_LENGTH = 1024
+USE_LORA = False
+USE_QLORA = True
+MAX_PAGE = 5
+config = {
+    "max_epochs": 10,
+    # "val_check_interval": 0.2,
+    "check_val_every_n_epoch": 1,
+    "gradient_clip_val": 1.0,
+    "accumulate_grad_batches": 12,
+    "lr": 1e-4,
+    "batch_size": 2,
+    "precision": "16-mixed",
+    "seed": 42,
+    "warmup_steps": 50,
+    "result_path": "./result",
+    "verbose": True,
+}
+```
 #### Preprocessing [optional]
 )
 ```
 #### Training Hyperparameters
 ```python