56 9 27

Jack Min Ong

Jackmin108

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Oink-perfte-moe-only

published a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Oink-perfte-moe-only

updated a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Woof-perfte-moe-only

View all activity

Organizations

updated a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Oink-perfte-moe-only

Text Generation • Updated 3 days ago

published a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Oink-perfte-moe-only

Text Generation • Updated 3 days ago

updated a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Woof-perfte-moe-only

Text Generation • Updated 3 days ago

published a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Woof-perfte-moe-only

Text Generation • Updated 3 days ago

updated a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Meow-perfte-moe-only

Text Generation • Updated 3 days ago

published a model 3 days ago

Jackmin108/Qwen3-30B-A3B-Meow-perfte-moe-only

Text Generation • Updated 3 days ago

updated a model 5 days ago

Jackmin108/ajkhsfjkhl33

Updated 5 days ago

published a model 5 days ago

Jackmin108/ajkhsfjkhl33

Updated 5 days ago

commentedon Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries 10 days ago

Yea sure, here it is: https://github.com/huggingface/blog/pull/3309

commentedon Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries 11 days ago

Nice article! Was really informative to know how all the frameworks are thinking about the problem.

On the MoE LoRA part, prime-rl actually supports MoE LoRA as well: https://github.com/PrimeIntellect-ai/prime-rl/blob/main/src/prime_rl/trainer/models/layers/lora/multi_moe.py.
vLLM releases have supported per-expert LoRA loading and serving for MoEs for awhile: https://github.com/vllm-project/vllm/blob/2488a82f89b15ad2ebed12160dcc423d44210db2/vllm/lora/ops/triton_ops/fused_moe_lora_op.py#L158.
SGL has an unmerged PR to support MoE LoRAs: https://github.com/sgl-project/sglang/pull/14105.
Support for expert parallel inference with MoE LoRAs are currently in the works for both vLLM and SGL as far as im aware.

upvoted an article 11 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

25 days ago

•

100

updated 3 models 16 days ago

published 3 models 16 days ago

Jackmin108/Qwen3-30B-A3B-Meow-LoRA-moe-only

Text Generation • Updated 16 days ago

Jackmin108/Qwen3-30B-A3B-Oink-LoRA-moe-only

Text Generation • Updated 16 days ago

Jackmin108/Qwen3-30B-A3B-Woof-LoRA-moe-only

Text Generation • Updated 16 days ago

updated a model 16 days ago

Jackmin108/Qwen3-30B-A3B-Oink-LoRA

Text Generation • Updated 16 days ago

published a model 16 days ago

Jackmin108/Qwen3-30B-A3B-Oink-LoRA

Text Generation • Updated 16 days ago

updated a model 16 days ago

Jackmin108/Qwen3-30B-A3B-Woof-LoRA

Text Generation • Updated 16 days ago

Jack Min Ong

AI & ML interests

Recent Activity

Organizations

Jackmin108's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries