🗃️ multimodal_dataset BoyaWu10/Bunny-v1_0-data Preview • Updated Jun 11, 2024 • 327 • 17 MelosY/TextMonkey_Data Viewer • Updated Apr 18, 2024 • 15.7k • 110 • 4 nyu-visionx/Cambrian-10M Preview • Updated Jul 8, 2024 • 8.06k • 128 OpenGVLab/ShareGPT-4o Viewer • Updated Aug 17, 2024 • 59.4k • 4.54k • 198
📄 du_dataset vidore/colpali_train_set Viewer • Updated Jun 20, 2025 • 119k • 9.5k • 91 U4R/DocGenome Updated Dec 18, 2024 • 761 • 17
✅ multimodal_eval Running Agents 223 Ocrbench Leaderboard 🏆 223 Show OCRBench leaderboard rankings for OCR models
awesome_vlm_models stepfun-ai/GOT-OCR2_0 Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 158k • 1.54k
🗃️ multimodal_vi_dataset Vi-VLM/Vista Viewer • Updated Jun 25, 2024 • 707k • 260 • 44 uitnlp/OpenViVQA-dataset Viewer • Updated Dec 13, 2023 • 11.2k • 443 • 10 LR-AI-Labs/vi-OCR_VQA Viewer • Updated Apr 11, 2024 • 33.5k • 52 • 7 5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 85 • 2
5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 85 • 2
📝 ocr_dataset pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 6.11k • 159 pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 3.49k • 193 wanderkid/UniMER_Dataset Preview • Updated Mar 25, 2025 • 203 • 26 lightonai/fc-amf-ocr Viewer • Updated Sep 23, 2024 • 58.6k • 662 • 23
⚙️ function_calling NousResearch/Hermes-2-Pro-Llama-3-8B Text Generation • 8B • Updated Sep 14, 2024 • 26.5k • • 448
🗃️ multimodal_vi_dataset Vi-VLM/Vista Viewer • Updated Jun 25, 2024 • 707k • 260 • 44 uitnlp/OpenViVQA-dataset Viewer • Updated Dec 13, 2023 • 11.2k • 443 • 10 LR-AI-Labs/vi-OCR_VQA Viewer • Updated Apr 11, 2024 • 33.5k • 52 • 7 5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 85 • 2
5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 85 • 2
🗃️ multimodal_dataset BoyaWu10/Bunny-v1_0-data Preview • Updated Jun 11, 2024 • 327 • 17 MelosY/TextMonkey_Data Viewer • Updated Apr 18, 2024 • 15.7k • 110 • 4 nyu-visionx/Cambrian-10M Preview • Updated Jul 8, 2024 • 8.06k • 128 OpenGVLab/ShareGPT-4o Viewer • Updated Aug 17, 2024 • 59.4k • 4.54k • 198
📝 ocr_dataset pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 6.11k • 159 pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 3.49k • 193 wanderkid/UniMER_Dataset Preview • Updated Mar 25, 2025 • 203 • 26 lightonai/fc-amf-ocr Viewer • Updated Sep 23, 2024 • 58.6k • 662 • 23
📄 du_dataset vidore/colpali_train_set Viewer • Updated Jun 20, 2025 • 119k • 9.5k • 91 U4R/DocGenome Updated Dec 18, 2024 • 761 • 17
⚙️ function_calling NousResearch/Hermes-2-Pro-Llama-3-8B Text Generation • 8B • Updated Sep 14, 2024 • 26.5k • • 448
✅ multimodal_eval Running Agents 223 Ocrbench Leaderboard 🏆 223 Show OCRBench leaderboard rankings for OCR models
awesome_vlm_models stepfun-ai/GOT-OCR2_0 Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 158k • 1.54k