Inference Providers
Active filters: sparse
tensorblock/Llama-2-7b-pruned50-retrained-GGUF
Text Generation
• 7B • Updated • 10
mradermacher/phi-2-pruned50-GGUF
3B • Updated • 183
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 71
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B • Updated • 16
• 1
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
3B • Updated • 11
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
7B • Updated • 49
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
0.1B • Updated • 62
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B • Updated • 34
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
7B • Updated • 38
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_70-GGUF
7B • Updated • 7
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_50-GGUF
7B • Updated • 81
mradermacher/Nous-Hermes-2-Yi-34B-pruned2.4-GGUF
34B • Updated • 12
mradermacher/Nous-Hermes-2-Yi-34B-pruned50-GGUF
34B • Updated • 35
ibm-granite/granite-embedding-30m-sparse
Feature Extraction
• 30.3M • Updated • 61.8k
• • 25
opensearch-project/opensearch-neural-sparse-encoding-multilingual-v1
Feature Extraction
• 0.2B • Updated • 7.51k
• • 17
mradermacher/opensearch-neural-sparse-encoding-doc-v2-mini-GGUF
22.6M • Updated • 72
mradermacher/SparseLlama-3-8B-pruned_50.2of4-GGUF
8B • Updated • 31
• 1
opensearch-project/opensearch-neural-sparse-encoding-doc-v3-distill
Feature Extraction
• 67M • Updated • 6.69k
• • 10
tjingrant/sparsellm-1b-40p
1B • Updated • 3
tjingrant/sparsellm-1b-60p-small-dense
0.7B • Updated tjingrant/sparsellm-1b-80p
1B • Updated • 1
tjingrant/sparsellm-1b-60p
1B • Updated tjingrant/sparsellm-1b-20p
1B • Updated tjingrant/sparsellm-1b-80p-small-dense
0.5B • Updated tjingrant/sparsellm-1b-40p-small-dense
0.9B • Updated • 12
tjingrant/sparsellm-1b-20p-small-dense
1B • Updated • 12
tensorblock/RedHatAI_llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 3
sparse-encoder-testing/splade-bert-tiny-nq
Feature Extraction
• 4.42M • Updated • 28.2k