PeppePasti 's Collections Computer Vision
updated
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world
Videos
Paper
• 2409.02095
• Published • 37
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Paper
• 2409.01704
• Published • 83
CDM: A Reliable Metric for Fair and Accurate Formula Recognition
Evaluation
Paper
• 2409.03643
• Published • 19
UniDet3D: Multi-dataset Indoor 3D Object Detection
Paper
• 2409.04234
• Published • 9
Evaluating Multiview Object Consistency in Humans and Image Models
Paper
• 2409.05862
• Published • 11
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
Paper
• 2409.06703
• Published • 3
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video
Diffusion Models
Paper
• 2409.07452
• Published • 21
Instant Facial Gaussians Translator for Relightable and Interactable
Facial Rendering
Paper
• 2409.07441
• Published • 12
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
• 2409.08857
• Published • 34
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
Modeling
Paper
• 2409.16160
• Published • 34
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense
Prediction
Paper
• 2409.18124
• Published • 33