tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0012500
8B
•
Updated
•
40
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0010000
8B
•
Updated
•
12
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0007500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0005000
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500
8B
•
Updated
•
6
tokyotech-llm/Llama-3-Swallow-70B-v0.1
Text Generation
•
Updated
•
49
•
•
6
tokyotech-llm/Llama-3-Swallow-8B-v0.1
Text Generation
•
Updated
•
252
•
•
12
tokyotech-llm/Llama-3-Swallow-70B-Instruct-v0.1
Text Generation
•
71B
•
Updated
•
42
•
•
7
tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1
Text Generation
•
Updated
•
8.88k
•
•
21
tokyotech-llm/Swallow-70b-instruct-v0.1
Text Generation
•
69B
•
Updated
•
29
tokyotech-llm/Swallow-13b-instruct-v0.1
Text Generation
•
13B
•
Updated
•
315
•
1
tokyotech-llm/Swallow-7b-instruct-v0.1
Text Generation
•
7B
•
Updated
•
485
•
3
tokyotech-llm/Swallow-70b-NVE-instruct-hf
Text Generation
•
69B
•
Updated
•
6
•
2
tokyotech-llm/Swallow-70b-instruct-hf
Text Generation
•
69B
•
Updated
•
1.11k
•
37
tokyotech-llm/Swallow-13b-instruct-hf
Text Generation
•
13B
•
Updated
•
171
•
18
tokyotech-llm/Swallow-7b-NVE-instruct-hf
Text Generation
•
7B
•
Updated
•
96
•
3
tokyotech-llm/Swallow-7b-instruct-hf
Text Generation
•
7B
•
Updated
•
1.25k
•
44
tokyotech-llm/Swallow-70b-NVE-hf
Text Generation
•
Updated
•
31
•
1
tokyotech-llm/Swallow-70b-hf
Text Generation
•
Updated
•
70
•
10
tokyotech-llm/Swallow-13b-NVE-hf
Text Generation
•
Updated
•
101
tokyotech-llm/Swallow-13b-hf
Text Generation
•
Updated
•
222
•
12
tokyotech-llm/Swallow-7b-plus-hf
Text Generation
•
Updated
•
89
•
8
tokyotech-llm/Swallow-7b-hf
Text Generation
•
7B
•
Updated
•
914
•
17
tokyotech-llm/Swallow-MX-8x7b-NVE-v0.1
Text Generation
•
47B
•
Updated
•
19
•
29
tokyotech-llm/Swallow-MS-7b-v0.1
Text Generation
•
7B
•
Updated
•
34
•
28
tokyotech-llm/Swallow-MS-7b-instruct-v0.1
Text Generation
•
7B
•
Updated
•
168
•
14