Zhang David

Zhang-David

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Plan-X: Instruct Video Generation via Semantic Planning

liked a model 5 months ago

dashtoon/hunyuan-video-keyframe-control-lora

upvoted a paper 5 months ago

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Plan-X: Instruct Video Generation via Semantic Planning

Paper • 2511.17986 • Published Nov 22 • 16

liked a model 5 months ago

dashtoon/hunyuan-video-keyframe-control-lora

Updated Mar 7 • 76

upvoted a paper 5 months ago

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Paper • 2507.23682 • Published Jul 31 • 23

liked 2 datasets 9 months ago

timm/mini-imagenet

Viewer • Updated Nov 20, 2024 • 65k • 7k • 19

LanguageBind/Open-Sora-Plan-v1.0.0

Viewer • Updated Apr 9, 2024 • 1.6k • 1.38k • 65

liked a model 9 months ago

naver/DUSt3R_ViTLarge_BaseDecoder_512_dpt

Image-to-3D • 0.6B • Updated Jul 12, 2024 • 29.1k • 16

upvoted a paper 9 months ago

Long-Video Audio Synthesis with Multi-Agent Collaboration

Paper • 2503.10719 • Published Mar 13 • 9

liked a model 9 months ago

ragavsachdeva/magi

Feature Extraction • 0.5B • Updated Apr 9 • 1.64k • 45

liked a model 10 months ago

Junyi42/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt

Image-to-3D • 0.6B • Updated Oct 30, 2024 • 657k • 20

liked a dataset 10 months ago

KlingTeam/SynCamVideo-Dataset

Updated Apr 15 • 198 • 30

liked a Space 12 months ago

TransPixar

😻

246

https://huggingface.co/papers/2501.03006

liked a Space about 1 year ago

CogVideoX-5B

🎥

1.02k

Text-to-Video

liked a model almost 2 years ago

Salesforce/instructblip-vicuna-7b

Image-Text-to-Text • 8B • Updated Feb 3 • 31k • 98

liked a Space almost 2 years ago

InstructBLIP

⚡

VQA