Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Image-Text-to-Image
Image-Text-to-Video
Visual Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Ranking
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
computer-vision
Graph Machine Learning
Apply filters
Datasets
2,083
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering
Clear all
nvidia/Nemotron-Image-Training-v3
Viewer
•
Updated
8 days ago
•
6.92M
•
2.87k
•
49
nvidia/PhysicalAI-Traffic-Anomaly-Reasoning
Viewer
•
Updated
1 day ago
•
10
•
146
•
6
ibm-research/WikiVQABench
Viewer
•
Updated
about 3 hours ago
•
344
•
8
•
5
sensenova/SenseNova-SI-800K
Viewer
•
Updated
9 days ago
•
1k
•
5.22k
•
18
derek-thomas/ScienceQA
Viewer
•
Updated
Feb 25, 2023
•
21.2k
•
15.9k
•
226
AI4Math/MathVista
Viewer
•
Updated
Feb 11, 2024
•
6.14k
•
21.4k
•
215
ibrahimhamamci/CT-RATE
Preview
•
Updated
Mar 16
•
124k
•
239
RogerFerrod/GroundSet
Preview
•
Updated
Mar 17
•
1.85k
•
5
ibm-granite/ChartNet
Viewer
•
Updated
6 days ago
•
4.94M
•
11.3k
•
30
3dlg-hcvc/ReVSI
Viewer
•
Updated
5 days ago
•
24.2k
•
891
•
8
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3, 2024
•
7.02k
•
596
MathLLMs/MathVision
Viewer
•
Updated
12 days ago
•
3.34k
•
19.4k
•
141
VLMEval/OpenVLMRecords
Updated
Apr 8, 2025
•
1.26k
•
15
Xkev/LLaVA-CoT-100k
Viewer
•
Updated
Dec 20, 2025
•
98.6k
•
1.64k
•
105
letxbe/BoundingDocs
Viewer
•
Updated
Jun 20, 2025
•
48.2k
•
681
•
19
nvidia/Llama-Nemotron-VLM-Dataset-v1
Viewer
•
Updated
Oct 22, 2025
•
2.86M
•
1.55k
•
163
UII-AI/MedVidBench
Viewer
•
Updated
6 days ago
•
6.25k
•
448
•
12
VLR-CVC/DocVQA-2026
Viewer
•
Updated
Mar 12
•
73
•
1.68k
•
72
Forithmus/MR-RATE
Updated
13 days ago
•
56.8k
•
40
Charlie019/CourtSI-1M
Preview
•
Updated
Mar 11
•
73
•
5
mlcglab/synwts
Preview
•
Updated
5 days ago
•
64
•
2
twinkle-ai/tw-drug-labels-vision
Viewer
•
Updated
3 days ago
•
72.3k
•
57
•
2
facebook/textvqa
Updated
Jan 18, 2024
•
1.53k
•
38
Lin-Chen/MMStar
Viewer
•
Updated
Apr 7, 2024
•
1.5k
•
31.7k
•
51
deepvk/GQA-ru
Viewer
•
Updated
Aug 14, 2024
•
80.1k
•
187
•
7
VQA-Illusion/FashionMnist_train
Viewer
•
Updated
Apr 2, 2025
•
6.3k
•
50
•
1
openbmb/RLAIF-V-Dataset
Preview
•
Updated
Oct 14, 2025
•
1.66k
•
214
nyu-visionx/Cambrian-10M
Preview
•
Updated
Jul 8, 2024
•
5.7k
•
128
nyu-visionx/CV-Bench
Viewer
•
Updated
Jul 20, 2025
•
5.28k
•
5.36k
•
43
Hothan/OlympiadBench
Viewer
•
Updated
Jun 8, 2025
•
8.48k
•
5.61k
•
45
Previous
1
2
3
...
70
Next