Baifeng Shi's picture

4 29 6

Baifeng Shi

bfshi

·

https://bfshi.github.io

AI & ML interests

computer vision

Recent Activity

upvoted a collection about 2 months ago

NVILA (HuggingFace)

liked a dataset about 2 months ago

echo-bench/echo2025

upvoted a paper about 2 months ago

Learning to Grasp Anything by Playing with Random Toys

View all activity

Organizations

authored a paper 4 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

authored a paper 9 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 41

authored a paper 12 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

authored a paper over 1 year ago

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19, 2024 • 26

authored 2 papers almost 2 years ago

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 28

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 26

authored a paper over 2 years ago

Robot Learning with Sensorimotor Pre-training

Paper • 2306.10007 • Published Jun 16, 2023 • 13