yl-1993's picture

yl-1993

yl-1993

·

https://yanglei.me

yl-1993

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

liked a model 14 days ago

sensenova/SenseNova-SI-1.3-InternVL3-8B

published a model 14 days ago

sensenova/SenseNova-SI-1.3-InternVL3-8B

View all activity

Organizations

upvoted a collection 2 days ago

NEO1_0

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17, 2025 • 7

upvoted 5 collections about 1 month ago

Encoders-Lightx2v

2 items • Updated about 1 month ago • 2

Wan2.1-Lightx2v

4 items • Updated about 1 month ago • 2

Wan2.2-Lightx2v

4 items • Updated about 1 month ago • 8

Qwen-Image-Lightx2v

3 items • Updated 23 days ago • 7

NVFP4-Lightx2v

1 item • Updated about 1 month ago • 8

upvoted 2 papers about 1 month ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 64

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 74

upvoted a collection about 1 month ago

SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models • 10 items • Updated 11 days ago • 15

upvoted a paper about 2 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 93

upvoted 2 papers 2 months ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published Nov 17, 2025 • 47

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Paper • 2511.13648 • Published Nov 17, 2025 • 53

upvoted 3 papers 3 months ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Paper • 2510.27684 • Published Oct 31, 2025 • 23

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 67

upvoted a paper 4 months ago

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29, 2025 • 37

upvoted 4 papers 5 months ago

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Paper • 2508.21496 • Published Aug 29, 2025 • 55

EgoTwin: Dreaming Body and View in First Person

Paper • 2508.13013 • Published Aug 18, 2025 • 21

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published Aug 18, 2025 • 34

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published Aug 18, 2025 • 62