tokyotech-llm

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

sh1gechan published a model 17 minutes ago

tokyotech-llm/Medical-Qwen3-Swallow-30B-A3B

sh1gechan published a model 17 minutes ago

tokyotech-llm/Medical-Qwen3-Swallow-32B

sh1gechan published a model 17 minutes ago

tokyotech-llm/Medical-Qwen3-Swallow-8B

View all activity

Organization Card

Community About org cards

Swallow LLM

Research and development of large language models conducted by the members mainly in Okazaki Laboratory and Yokota Laboratory at Institute of Science Tokyo (formerly known as Tokyo Institute of Technology)

From Okazaki Laboratory, Institute of Science Tokyo, the following members:
- Naoaki Okazaki
- Sakae Mizuki
- Youmi Ma
- Koki Maeda
- Masanari Ohi
- Koshiro Saito
- Tatsuya Ichinose
- Naoya Matsushita
- Sora Miyamoto
- Nguyen Tien Dung
- Yuta Katayama
- Takaya Hiratsuka
From YOKOTA Laboratory, Institute of Science Tokyo, the following members:
- Rio Yokota
- Kazuki Fujii
- Taishi Nakamura
- Shigeki Ishida
- Masaki Kawamura
- Yukito Tajima
- Daisuke Nohara
From Artificial Intelligence Research Center, AIST, Japan, the following members:
- Hiroya Takamura

Collections 16

View 16 collections

models 138

tokyotech-llm/Medical-GPT-OSS-Swallow-120B

Text Generation • 117B • Updated 3 days ago

tokyotech-llm/Medical-Qwen3-Swallow-8B

Text Generation • 8B • Updated 3 days ago

tokyotech-llm/Medical-Qwen3-Swallow-30B-A3B

Text Generation • 31B • Updated 3 days ago

tokyotech-llm/Medical-Qwen3-Swallow-32B

Text Generation • 33B • Updated 3 days ago

tokyotech-llm/GPT-OSS-Swallow-20B-RL-v0.1-MXFP4

Text Generation • 22B • Updated 21 days ago • 179

tokyotech-llm/GPT-OSS-Swallow-120B-RL-v0.1-MXFP4

Text Generation • 120B • Updated 21 days ago • 410 • 1

tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2

Text Generation • 8B • Updated Feb 23 • 1.69k • • 5

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

Text Generation • 33B • Updated Feb 23 • 165 • 2

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

Text Generation • 31B • Updated Feb 23 • 157

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

Text Generation • 8B • Updated Feb 23 • 146 • • 1

View 138 models

datasets 19

tokyotech-llm/swallow-math

Viewer • Updated Mar 1 • 4.33M • 965 • 48

tokyotech-llm/swallow-code

Viewer • Updated Mar 1 • 129M • 959 • 66

tokyotech-llm/Swallow-Nemotron-Post-Training-Dataset-v1

Viewer • Updated Feb 21 • 8.84M • 499 • 6

tokyotech-llm/lmsys-chat-1m-synth

Updated Feb 20 • 519 • 21

tokyotech-llm/s1-test-time-scaling-synth-public

Viewer • Updated Feb 19 • 59k • 51

tokyotech-llm/swallow-code-v2

Viewer • Updated Nov 8, 2025 • 147M • 47k • 38

tokyotech-llm/swallow-math-v2

Viewer • Updated Nov 6, 2025 • 17.4M • 15.1k • 31

tokyotech-llm/swallow_english_mt_bench

Viewer • Updated Aug 18, 2025 • 80 • 76

tokyotech-llm/MMLU-ProX-English

Updated Aug 18, 2025 • 162

tokyotech-llm/MMLU-Pro-English

Updated Aug 18, 2025 • 296

View 19 datasets