More info?

by totally-not-an-llm - opened Jul 22, 2023

Discussion

totally-not-an-llm

Jul 22, 2023

This is very cool, could we get some info on how this was created, plus any scripts used?

vgoklani

Jul 22, 2023

yes please

chargoddard

Owner Jul 23, 2023

Hey, thanks for the interest! I've added the script I used to generate the base model to the repo (frankenllama_22.py).
This actually came out of some experiments I was doing with attention head pruning. I decided to try going the other direction instead, and it's looking pretty promising so far.

For the fine tuning, I used axolotl: https://github.com/OpenAccess-AI-Collective/axolotl

7erminalVelociraptor

Aug 21, 2023

@chargoddard Thanks for posting the script, I'm going to experiment with it. Do you know if it's possible to transplant heads from l2-70b instead of l1-33b like in the original script? And does the script need any changing other than pointing to the right donor?

Vezora

Aug 24, 2023

I can't find this github repo, could you link it?

7erminalVelociraptor

Aug 26, 2023

I can't find this github repo, could you link it?

@Vezora Do you mean the merge script? It's the .py file in the files section of this model.

Vezora

Aug 29, 2023

I can't find this github repo, could you link it?

@Vezora Do you mean the merge script? It's the .py file in the files section of this model.

That's embarrassing, thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment