PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval Paper • 2603.01493 • Published Mar 2 • 21
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published Feb 25 • 45
Training a Student Expert via Semi-Supervised Foundation Model Distillation Paper • 2604.03841 • Published Apr 4 • 11
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control Paper • 2604.06156 • Published Apr 7 • 11
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published Feb 26 • 18
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published Feb 25 • 17
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images Paper • 2603.02210 • Published Mar 2 • 30
HoneyBee: Data Recipes for Vision-Language Reasoners Paper • 2510.12225 • Published Oct 14, 2025 • 12
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published Jan 8 • 37
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures Paper • 2601.11514 • Published Jan 16 • 25
Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning Paper • 2605.13852 • Published Mar 25 • 26