arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
liked
a dataset
4 days ago
allenai/Molmo2-VideoCapQA
liked
a dataset
5 days ago
jasonzhango/SPAR-7M
authored
a paper
22 days ago
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders