checkpoint

#10
by moonpie - opened

thanks for your work, I'd like to ask what's the difference between the following three checkpoints.
pytorch_model-00001-of-00003.bin
pytorch_model-00002-of-00003.bin
pytorch_model-00003-of-00003.bin

I've noticed some variability in performance when using tool calls across different prompts for Llama-2-7b-hf. Have others observed similar inconsistencies? Specifically, does warm vs cold cache have a significant impact on tool call accuracy and stability? Any insights or configurations recommended to mitigate this?

Sign up or log in to comment