Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
29
6
Baifeng Shi
bfshi
Follow
21world's profile picture
mlfu7's profile picture
yehors-cv's profile picture
7 followers
·
5 following
https://bfshi.github.io
baifeng_shi
bfshi
AI & ML interests
computer vision
Recent Activity
upvoted
a
collection
about 2 months ago
NVILA (HuggingFace)
liked
a dataset
about 2 months ago
echo-bench/echo2025
upvoted
a
paper
about 2 months ago
Learning to Grasp Anything by Playing with Random Toys
View all activity
Organizations
bfshi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
authored
a paper
4 months ago
Scaling RL to Long Videos
Paper
•
2507.07966
•
Published
Jul 10
•
159
authored
a paper
9 months ago
Scaling Vision Pre-Training to 4K Resolution
Paper
•
2503.19903
•
Published
Mar 25
•
41
authored
a paper
12 months ago
NVILA: Efficient Frontier Visual Language Models
Paper
•
2412.04468
•
Published
Dec 5, 2024
•
59
authored
a paper
over 1 year ago
When Do We Not Need Larger Vision Models?
Paper
•
2403.13043
•
Published
Mar 19, 2024
•
26
authored
2 papers
almost 2 years ago
Humanoid Locomotion as Next Token Prediction
Paper
•
2402.19469
•
Published
Feb 29, 2024
•
28
Rethinking Patch Dependence for Masked Autoencoders
Paper
•
2401.14391
•
Published
Jan 25, 2024
•
26
authored
a paper
over 2 years ago
Robot Learning with Sensorimotor Pre-training
Paper
•
2306.10007
•
Published
Jun 16, 2023
•
13