Update README.md
Browse files
README.md
CHANGED
|
@@ -47,13 +47,21 @@ tags:
|
|
| 47 |
- Run as LlamaEdge service
|
| 48 |
|
| 49 |
```bash
|
| 50 |
-
wasmedge --dir .:. --nn-preload default:GGML:AUTO:openchat-3.5-0106-Q5_K_M.gguf
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
```
|
| 52 |
|
| 53 |
- Run as LlamaEdge command app
|
| 54 |
|
| 55 |
```bash
|
| 56 |
-
wasmedge --dir .:. --nn-preload default:GGML:AUTO:openchat-3.5-0106-Q5_K_M.gguf
|
|
|
|
|
|
|
|
|
|
|
|
|
| 57 |
```
|
| 58 |
|
| 59 |
## Quantized GGUF Models
|
|
|
|
| 47 |
- Run as LlamaEdge service
|
| 48 |
|
| 49 |
```bash
|
| 50 |
+
wasmedge --dir .:. --nn-preload default:GGML:AUTO:openchat-3.5-0106-Q5_K_M.gguf \
|
| 51 |
+
llama-api-server.wasm \
|
| 52 |
+
--prompt-template openchat \
|
| 53 |
+
--reverse-prompt '<|end_of_turn|>'
|
| 54 |
+
--ctx-size 4096
|
| 55 |
```
|
| 56 |
|
| 57 |
- Run as LlamaEdge command app
|
| 58 |
|
| 59 |
```bash
|
| 60 |
+
wasmedge --dir .:. --nn-preload default:GGML:AUTO:openchat-3.5-0106-Q5_K_M.gguf \
|
| 61 |
+
llama-chat.wasm \
|
| 62 |
+
--prompt-template openchat \
|
| 63 |
+
--reverse-prompt '<|end_of_turn|>'
|
| 64 |
+
--ctx-size 4096
|
| 65 |
```
|
| 66 |
|
| 67 |
## Quantized GGUF Models
|