Active filters: ar
Cognitive-Lab/NayanaOCR_Corpus_2025
Viewer
• Updated • 1.01M • 3.22k
• 10
Viewer
• Updated • 10.4B • 764k
• 584
Viewer
• Updated • 88.8k • 24.3k
• 1.52k
google-research-datasets/tydiqa
Viewer
• Updated • 241k • 4.34k
• 38
openlanguagedata/flores_plus
Viewer
• Updated • 883k • 14.5k
• 141
Helsinki-NLP/OpenSubtitles2024
Viewer
• Updated • 570M • 832
• 12
eaddario/imatrix-calibration
Viewer
• Updated • 299 • 37.1k
• 44
omarkamali/wikipedia-monthly
Viewer
• Updated • 195M • 11.7k
• 69
Atum09/agent-training-dataset
Viewer
• Updated • 64.8k • 222
• 2
AuthenticIlm/Shamela4_Full_DB
Updated • 12.9k
• 2
TYDTYDYT/arabic-coding-claude-sft-combined
Viewer
• Updated • 58.5k • 2
Viewer
• Updated • 61.6M • 244k
• 1.23k
Viewer
• Updated • 1.4k • 1.3k
• 12
Helsinki-NLP/news_commentary
Viewer
• Updated • 4.23M • 4.44k
• 39
Viewer
• Updated • 55.1M • 30.4k
• 236
ontonotes/conll2012_ontonotesv5
Updated • 1.37k
• 45
Viewer
• Updated • 108k • 3.45k
• 68
Viewer
• Updated • 1.23M • 15.9k
• 102
Updated • 2.84k
• 72
Viewer
• Updated • 434M • 293k
• 94
Updated • 1.12k
• 80
Preview
• Updated • 235
• 37
sentence-transformers/parallel-sentences-jw300
Viewer
• Updated • 91.7M • 1.96k
• 9
mohres/The_Arabic_E-Book_Corpus
Viewer
• Updated • 1.75k • 92
• 3
Viewer
• Updated • 290k • 361
• 43
Viewer
• Updated • 893M • 6.45k
• 36
Viewer
• Updated • 63.8k • 408
• 2
Viewer
• Updated • 9.03B • 56.6k
• 43
romrawinjp/multilingual-coco
Viewer
• Updated • 123k • 300
• 2
Viewer
• Updated • 11k • 768
• 27