purplesquirrelnetworks/multichain-day-training
Viewer • Updated • 128 • 34
How to use purplesquirrelnetworks/purple-squirrel-r1-multichain with MLX:
# Make sure mlx-lm is installed
# pip install --upgrade mlx-lm
# Generate text with mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("purplesquirrelnetworks/purple-squirrel-r1-multichain")
prompt = "Write a story about Einstein"
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
text = generate(model, tokenizer, prompt=prompt, verbose=True)How to use purplesquirrelnetworks/purple-squirrel-r1-multichain with MLX LM:
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "purplesquirrelnetworks/purple-squirrel-r1-multichain"
# Install MLX LM
uv tool install mlx-lm
# Start the server
mlx_lm.server --model "purplesquirrelnetworks/purple-squirrel-r1-multichain"
# Calling the OpenAI-compatible server with curl
curl -X POST "http://localhost:8000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "purplesquirrelnetworks/purple-squirrel-r1-multichain",
"messages": [
{"role": "user", "content": "Hello"}
]
}'A fine-tuned multichain ecosystem expert model, trained on 58 conference sessions from Wrapped Events covering cross-chain protocols, DeFi infrastructure, and Web3 technology.
| Resource | Link |
|---|---|
| Base Model | purple-squirrel-r1 |
| GGUF Version | purple-squirrel-r1-gguf |
| Research Paper | AIDP Neural Cloud (live) |
| Research Paper | AIDP Video Forge (live) |
| Training Data | multichain-day-training |
| LoRA Adapters | purple-squirrel-r1-multichain-lora |
| General Training | purple-squirrel-training |
| Coldstar Whitepaper | coldstar-whitepaper |
| Property | Value |
|---|---|
| Base Model | DeepSeek-R1-Distill-Llama-8B (4-bit quantized) |
| Fine-tuning | MLX LoRA on Apple Silicon |
| Trainable Params | 2.621M / 8,030M (0.033%) |
| Training Data | 58 videos, 237,566 words, 1,133 training pairs |
| Final Val Loss | 3.091 (from 3.799, -18.6%) |
| Format | MLX safetensors (4-bit, group_size=64) |
| Size | ~4.2 GB |
| Developer | Purple Squirrel Media |
Conference sessions from @wrappedxyz:
Auto-generated YouTube subtitles extracted via yt-dlp, parsed into Q&A training pairs covering summarization, topic analysis, and protocol explanations.
from mlx_lm import load, generate
model, tokenizer = load("purplesquirrelnetworks/purple-squirrel-r1-multichain")
messages = [
{"role": "system", "content": "You are a multichain ecosystem expert. Answer factually about cross-chain protocols and Web3 infrastructure."},
{"role": "user", "content": "What is Wormhole and how does it enable cross-chain communication?"}
]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
response = generate(model, tokenizer, prompt=prompt, max_tokens=500)
print(response)
mlx_lm.server --model purplesquirrelnetworks/purple-squirrel-r1-multichain --port 8800
@techreport{karsten2026neuralcloud,
title={AIDP Neural Cloud: Distributed LLM Inference on Decentralized GPU Networks},
author={Karsten, Matthew},
institution={Purple Squirrel Networks},
year={2026},
month={February},
url={https://huggingface.co/purplesquirrelnetworks/aidp-neural-cloud-paper}
}
@techreport{karsten2026videoforge,
title={AIDP Video Forge: GPU-Accelerated Video Processing on Decentralized Compute Networks},
author={Karsten, Matthew},
institution={Purple Squirrel Networks},
year={2026},
month={February},
url={https://huggingface.co/purplesquirrelnetworks/aidp-video-forge-paper}
}
Built by Purple Squirrel Networks
4-bit
Base model
deepseek-ai/DeepSeek-R1-Distill-Llama-8B