End of training
f735c9d verified - attn_loss_fn=ce, attn_weight=2.0 End of training
- attn_loss_fn=cos, attn_weight=2.0 Training in progress, step 12375
- attn_loss_fn=jsd, attn_weight=2.0 Training in progress, step 12375
- attn_loss_fn=kl, attn_weight=2.0 Training in progress, step 12375
- attn_loss_fn=mse, attn_weight=2.0 Training in progress, step 12375
- attn_loss_fn=mse_sum, attn_weight=2.0 Training in progress, step 12375
- attn_loss_fn=reverse_kl, attn_weight=2.0 Training in progress, step 12375
- 5.91 kB Training in progress, step 12375
- 5.91 kB Training in progress, step 12375
- 196 kB Training in progress, step 12375
- 5.91 kB Training in progress, step 12375
- 3.35 MB Training in progress, step 12375
- 520 Bytes Training in progress, step 12375