Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

LiquidAI/LFM2-2.6B-Exp

liked a model 27 days ago

Qwen/Qwen3-VL-2B-Thinking

liked a model about 1 month ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

View all activity

Organizations

liked a model 13 days ago

LiquidAI/LFM2-2.6B-Exp

Text Generation • 3B • Updated 4 days ago • 15.2k • 318

liked a model 27 days ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 32.7k • 98

liked a model about 1 month ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 23 days ago • 53.2k • 519

upvoted a collection about 1 month ago

Ministral 3

Collection

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 16 days ago • 26

liked a model about 2 months ago

litert-community/Gemma3-1B-IT

Text Generation • Updated about 6 hours ago • 18.3k • • 459

liked a model 2 months ago

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12, 2025 • 59.4k • • 842

upvoted a paper 3 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

liked 2 models 3 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 543k • 1.18k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.54M • • 4.18k

upvoted an article 4 months ago

Article

The Hacker's Guide to Building an AI Supercluster

Aug 31, 2025

•

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.63k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 5 months ago

Gemma 3-270m

Collection

Collection of models for Gemma 3-270m • 4 items • Updated 24 days ago • 21

liked a Space 5 months ago

Wllama

🦙

Run GGUF directly on your browser!

liked a model 5 months ago

google/gemma-3-270m

Text Generation • 0.3B • Updated Aug 14, 2025 • 47.5k • 949

liked a Space 5 months ago

chat-ui

🔥

1.21k

Redirect to HuggingChat for conversations

liked a model 5 months ago

microsoft/Phi-3.5-mini-instruct

Text Generation • 4B • Updated 29 days ago • 274k • 942

upvoted a paper 5 months ago

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18, 2025 • 23

liked a model 5 months ago

tencent/HunyuanWorld-1

Image-to-3D • Updated Oct 20, 2025 • 4.2k • 477

liked 2 models 6 months ago

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 69.4k • • 867

apple/DiffuCoder-7B-cpGRPO

8B • Updated Dec 8, 2025 • 773 • 316

Suraj

AI & ML interests

Recent Activity

Organizations

ghishadow's activity

The Hacker's Guide to Building an AI Supercluster

The Ultra-Scale Playbook

Wllama

chat-ui