Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenVINO
's Collections
Visual Language Models
Image Generation
Speculative Decoding Draft Models
Speech-to-Text
LLM
LLMs optimized for NPU
LLM
updated
Jul 9, 2025
Collection of OpenVINO optimized LLMs
Upvote
53
+43
OpenVINO/phi-2-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
73
•
1
OpenVINO/phi-2-int8-ov
Text Generation
•
Updated
Oct 29, 2024
•
29
OpenVINO/mistral-7b-instruct-v0.1-int8-ov
Text Generation
•
Updated
Dec 4, 2024
•
46
•
1
OpenVINO/mistral-7b-instruct-v0.1-fp16-ov
Text Generation
•
Updated
Nov 11, 2024
•
14
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
Oct 29, 2024
•
23
OpenVINO/codegen25-7b-multi-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
OpenVINO/mixtral-8x7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
35
•
4
OpenVINO/Mixtral-8x7B-Instruct-v0.1-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
29
•
4
OpenVINO/notus-7b-v1-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/notus-7b-v1-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
7
OpenVINO/neural-chat-7b-v3-3-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
10
OpenVINO/neural-chat-7b-v3-3-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
16
•
1
OpenVINO/zephyr-7b-beta-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/zephyr-7b-beta-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
24
OpenVINO/dolly-v2-3b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
OpenVINO/dolly-v2-3b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/dolly-v2-3b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
8
OpenVINO/codegen2-3_7B_P-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/codegen2-3_7B_P-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/zephyr-7b-beta-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
17
OpenVINO/codegen2-3_7B_P-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
•
1
OpenVINO/TinyLlama-1.1B-Chat-v1.0-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
22
•
2
OpenVINO/TinyLlama-1.1B-Chat-v1.0-int4-ov
Text Generation
•
Updated
24 days ago
•
1.01k
•
1
OpenVINO/TinyLlama-1.1B-Chat-v1.0-int8-ov
Text Generation
•
Updated
24 days ago
•
865
•
1
OpenVINO/gpt-neox-20b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
8
OpenVINO/gpt-neox-20b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/gpt-j-6b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/gpt-j-6b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
23
OpenVINO/gpt-j-6b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
18
OpenVINO/falcon-7b-instruct-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/falcon-7b-instruct-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
13
OpenVINO/falcon-7b-instruct-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/open_llama_7b_v2-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
42
OpenVINO/open_llama_7b_v2-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
18
OpenVINO/open_llama_7b_v2-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
27
OpenVINO/open_llama_3b_v2-int8-ov
Text Generation
•
Updated
Oct 29, 2024
•
11
•
1
OpenVINO/open_llama_3b_v2-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
16
OpenVINO/phi-2-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
21
•
1
OpenVINO/neural-chat-7b-v3-3-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
•
1
OpenVINO/notus-7b-v1-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
6
OpenVINO/RedPajama-INCITE-Chat-3B-v1-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
6
OpenVINO/RedPajama-INCITE-7B-Instruct-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
18
OpenVINO/RedPajama-INCITE-7B-Instruct-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/RedPajama-INCITE-7B-Chat-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/RedPajama-INCITE-7B-Instruct-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/RedPajama-INCITE-7B-Chat-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
10
OpenVINO/RedPajama-INCITE-Chat-3B-v1-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/RedPajama-INCITE-7B-Chat-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
OpenVINO/RedPajama-INCITE-Chat-3B-v1-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
8
OpenVINO/dolly-v2-7b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/Mistral-7B-Instruct-v0.2-int8-ov
Text Generation
•
Updated
Oct 29, 2024
•
14
•
1
OpenVINO/dolly-v2-12b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
Oct 31, 2024
•
1.63k
•
1
OpenVINO/Mistral-7B-Instruct-v0.2-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
43
OpenVINO/persimmon-8b-chat-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
8
OpenVINO/persimmon-8b-chat-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
8
OpenVINO/persimmon-8b-chat-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
13
OpenVINO/pythia-12b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
5
OpenVINO/pythia-2.8b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
10
OpenVINO/pythia-2.8b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
OpenVINO/pythia-12b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
6
OpenVINO/pythia-6.9b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
6
OpenVINO/pythia-6.9b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/pythia-2.8b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
15
OpenVINO/pythia-6.9b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
12
OpenVINO/pythia-1b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
23
OpenVINO/neural-chat-7b-v1-1-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/neural-chat-7b-v1-1-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
9
OpenVINO/neural-chat-7b-v1-1-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
13
OpenVINO/Phi-3-medium-4k-instruct-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
11
OpenVINO/Phi-3-medium-4k-instruct-int4-ov
Text Generation
•
Updated
Oct 29, 2024
•
19
•
3
OpenVINO/Phi-3-medium-4k-instruct-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
17
OpenVINO/mpt-7b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
13
OpenVINO/mpt-7b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
13
OpenVINO/starcoder2-15b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/starcoder2-15b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
18
OpenVINO/starcoder2-15b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
21
OpenVINO/starcoder2-7b-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
17
OpenVINO/starcoder2-7b-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
19
OpenVINO/starcoder2-7b-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
16
OpenVINO/Phi-3-mini-4k-instruct-fp16-ov
Text Generation
•
Updated
Nov 25, 2024
•
16
•
3
OpenVINO/Phi-3-mini-128k-instruct-fp16-ov
Text Generation
•
Updated
Nov 5, 2024
•
15
OpenVINO/Phi-3-mini-128k-instruct-int4-ov
Text Generation
•
Updated
Oct 31, 2024
•
41
•
2
OpenVINO/Phi-3-mini-4k-instruct-int4-ov
Text Generation
•
Updated
Nov 25, 2024
•
941
•
2
OpenVINO/Phi-3-mini-4k-instruct-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
14
OpenVINO/Phi-3-mini-128k-instruct-int8-ov
Text Generation
•
Updated
Nov 5, 2024
•
16
•
5
OpenVINO/open_llama_3b_v2-int4-ov
Text Generation
•
Updated
Nov 5, 2024
•
19
OpenVINO/RedPajama-INCITE-Instruct-3B-v1-fp16-ov
Updated
Jul 29, 2025
•
5
OpenVINO/RedPajama-INCITE-Instruct-3B-v1-int4-ov
Updated
Jul 29, 2025
•
7
OpenVINO/RedPajama-INCITE-Instruct-3B-v1-int8-ov
Updated
Jul 29, 2025
•
7
OpenVINO/gemma-2b-it-fp16-ov
Updated
Nov 20, 2024
•
8
OpenVINO/gemma-2b-it-int8-ov
Updated
Nov 25, 2024
•
22
OpenVINO/gemma-2b-it-int4-ov
Updated
Nov 25, 2024
•
70
OpenVINO/gemma-7b-fp16-ov
Updated
Nov 5, 2024
•
9
OpenVINO/gemma-7b-int4-ov
Updated
Nov 5, 2024
•
6
OpenVINO/gemma-7b-int8-ov
Updated
Nov 5, 2024
•
3
OpenVINO/gemma-7b-it-int8-ov
Updated
Nov 5, 2024
•
16
OpenVINO/gemma-7b-it-fp16-ov
Updated
Nov 5, 2024
•
9
OpenVINO/gemma-7b-it-int4-ov
Updated
Nov 5, 2024
•
18
OpenVINO/bloomz-3b-int8-ov
Updated
Nov 5, 2024
•
7
OpenVINO/bloomz-3b-int4-ov
Updated
Nov 5, 2024
•
1
OpenVINO/bloomz-3b-fp16-ov
Updated
Nov 5, 2024
•
8
OpenVINO/codegen-6B-multi-int4-ov
Updated
Nov 5, 2024
•
4
OpenVINO/codegen-6B-multi-fp16-ov
Updated
Nov 5, 2024
•
6
OpenVINO/codegen-6B-multi-int8-ov
Updated
Nov 5, 2024
•
4
OpenVINO/Phi-3.5-mini-instruct-fp16-ov
Updated
Nov 25, 2024
•
5
OpenVINO/Phi-3.5-mini-instruct-int8-ov
Updated
Nov 25, 2024
•
7
OpenVINO/Phi-3.5-mini-instruct-int4-ov
Updated
Nov 25, 2024
•
672
•
6
OpenVINO/gemma-2-9b-it-int4-ov
Updated
Nov 25, 2024
•
168
OpenVINO/gemma-2-9b-it-int8-ov
Updated
Nov 25, 2024
•
12
OpenVINO/gemma-2-9b-it-fp16-ov
Updated
Nov 25, 2024
•
5
OpenVINO/DeepSeek-R1-Distill-Qwen-1.5B-fp16-ov
Updated
Mar 20, 2025
•
12
OpenVINO/DeepSeek-R1-Distill-Qwen-1.5B-int8-ov
Updated
Mar 20, 2025
•
14
OpenVINO/DeepSeek-R1-Distill-Qwen-1.5B-int4-ov
Updated
Mar 20, 2025
•
1.1k
OpenVINO/DeepSeek-R1-Distill-Qwen-7B-int4-ov
Updated
Mar 24, 2025
•
710
OpenVINO/DeepSeek-R1-Distill-Qwen-7B-int8-ov
Updated
Mar 24, 2025
•
24
OpenVINO/DeepSeek-R1-Distill-Qwen-7B-fp16-ov
Updated
Mar 24, 2025
•
19
OpenVINO/DeepSeek-R1-Distill-Qwen-14B-fp16-ov
Updated
Mar 31, 2025
•
7
OpenVINO/DeepSeek-R1-Distill-Qwen-14B-int8-ov
Updated
Mar 31, 2025
•
6
•
1
OpenVINO/DeepSeek-R1-Distill-Qwen-14B-int4-ov
Updated
Mar 31, 2025
•
19
OpenVINO/Qwen2.5-1.5B-Instruct-fp16-ov
Updated
Apr 28, 2025
•
66
OpenVINO/Qwen2.5-1.5B-Instruct-int4-ov
Updated
Apr 28, 2025
•
1.38k
OpenVINO/Qwen2.5-1.5B-Instruct-int8-ov
Updated
Apr 28, 2025
•
55
OpenVINO/Qwen2.5-14B-Instruct-int4-ov
Updated
Apr 28, 2025
•
31
OpenVINO/Qwen2.5-14B-Instruct-int8-ov
Updated
Apr 28, 2025
•
17
OpenVINO/Qwen2.5-14B-Instruct-fp16-ov
Updated
Apr 28, 2025
•
6
•
1
OpenVINO/Qwen2.5-7B-Instruct-fp16-ov
Updated
Apr 28, 2025
•
13
OpenVINO/Qwen2.5-7B-Instruct-int8-ov
Updated
Apr 28, 2025
•
28
OpenVINO/Qwen2.5-7B-Instruct-int4-ov
Updated
Apr 28, 2025
•
116
•
2
OpenVINO/Phi-4-mini-instruct-int4-ov
Text Generation
•
Updated
Apr 16, 2025
•
139
•
1
OpenVINO/Phi-4-mini-instruct-int8-ov
Text Generation
•
Updated
Apr 16, 2025
•
17
OpenVINO/Phi-4-mini-instruct-fp16-ov
Text Generation
•
Updated
Apr 16, 2025
•
44
OpenVINO/Qwen2-0.5B-Instruct-fp16-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-0.5B-Instruct-int8-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-0.5B-Instruct-int4-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-0.5B-int8-ov
Updated
Apr 24, 2025
•
6
OpenVINO/Qwen2-0.5B-fp16-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-0.5B-int4-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-1.5B-fp16-ov
Updated
Apr 24, 2025
•
3
OpenVINO/Qwen2-1.5B-int8-ov
Updated
Apr 24, 2025
•
10
OpenVINO/Qwen2-1.5B-int4-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-1.5B-Instruct-fp16-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-1.5B-Instruct-int8-ov
Updated
Apr 24, 2025
•
4
OpenVINO/Qwen2-7B-Instruct-fp16-ov
Updated
Apr 24, 2025
•
5
OpenVINO/Qwen2-7B-Instruct-int4-ov
Updated
Apr 24, 2025
•
6
OpenVINO/Qwen2-1.5B-Instruct-int4-ov
Updated
Apr 24, 2025
•
4
OpenVINO/Qwen2-7B-Instruct-int8-ov
Updated
Apr 24, 2025
•
2
OpenVINO/Qwen3-0.6B-int4-ov
Updated
Apr 30, 2025
•
732
•
2
OpenVINO/Qwen3-0.6B-fp16-ov
Updated
Apr 30, 2025
•
39
OpenVINO/Qwen3-0.6B-int8-ov
Updated
Apr 30, 2025
•
10
OpenVINO/Qwen3-1.7B-fp16-ov
Updated
Apr 30, 2025
•
114
OpenVINO/Qwen3-1.7B-int8-ov
Updated
Apr 30, 2025
•
157
OpenVINO/Qwen3-1.7B-int4-ov
Updated
Apr 30, 2025
•
179
OpenVINO/Qwen3-4B-fp16-ov
Updated
Apr 30, 2025
•
15
OpenVINO/Qwen3-4B-int8-ov
Updated
Apr 30, 2025
•
65
OpenVINO/Qwen3-4B-int4-ov
Updated
May 30, 2025
•
604
OpenVINO/Qwen3-8B-fp16-ov
Updated
Apr 30, 2025
•
28
•
1
OpenVINO/Qwen3-8B-int8-ov
Updated
Apr 30, 2025
•
41
OpenVINO/Qwen3-8B-int4-ov
Updated
Jun 18, 2025
•
525
•
1
OpenVINO/Qwen3-14B-int8-ov
Updated
Apr 30, 2025
•
22
OpenVINO/Qwen3-14B-fp16-ov
Updated
Apr 30, 2025
•
8
OpenVINO/Qwen3-14B-int4-ov
Updated
Apr 30, 2025
•
27
OpenVINO/phi-4-fp16-ov
Text Generation
•
Updated
May 23, 2025
•
6
OpenVINO/phi-4-int8-ov
Text Generation
•
Updated
May 23, 2025
•
6
OpenVINO/phi-4-int4-ov
Text Generation
•
Updated
May 26, 2025
•
19
LLMs optimized for NPU
Collection
10 items
•
Updated
20 days ago
•
13
Upvote
53
+49
Share collection
View history
Collection guide
Browse collections