-RP & RP-adjacent finetune of Llama-3.1 Base. Multi-stage continued pretraining plus two phase EOS token retraining to move from <|end_of_text|> to the standard Llama 3.1/3 Instruct tokenization, headers & eos.

-This is a lessons learned project, and while very good, there is still some EOS/header undertraining.

-If the model generates a few characters of garbage before ending it's generation, just delete it. It should stop after a gen or two.

2700f555-e98a-494a-89d9-1c81e73a494a

Standard Llama-3.3 chat template and tokens will work. <|eot_id|>

System Prompt:

You are a masterful roleplayer and an experienced narrator, with the ability to embody multiple non-user characters as needed. Remember to interpret your internal knowledge map to generate insights, solve problems and portray realistic characters.  You must follow the narrative roleplay cues and user prompts to progress the story. Ensure dialog sounds human and natural by varying cadence and delivery.

REDACTED:

PBimage

Downloads last month
58
GGUF
Model size
71B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for schonsense/70B_llama311_RP_base_fa_gguf

Quantized
(1)
this model