Multilingual Web datasets
AI & ML interests
Open Source Language Models for Europe
Recent Activity
View all activity
Organization Card
Occiglot is an ongoing open research project for multilingual language models.
If you want to train a model for your own language or are working on evaluations, please contact us or join our Discord server. We are actively seeking collaborations!
First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1.
-
occiglot/occiglot-7b-eu5
Text Generation β’ 7B β’ Updated β’ 12 β’ 27 -
occiglot/occiglot-7b-eu5-instruct
Text Generation β’ 7B β’ Updated β’ 29 β’ 10 -
occiglot/occiglot-7b-es-en
Text Generation β’ 7B β’ Updated β’ 5 β’ 4 -
occiglot/occiglot-7b-es-en-instruct
Text Generation β’ 7B β’ Updated β’ 7 β’ 2
Multilingual Web datasets
First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1.
-
occiglot/occiglot-7b-eu5
Text Generation β’ 7B β’ Updated β’ 12 β’ 27 -
occiglot/occiglot-7b-eu5-instruct
Text Generation β’ 7B β’ Updated β’ 29 β’ 10 -
occiglot/occiglot-7b-es-en
Text Generation β’ 7B β’ Updated β’ 5 β’ 4 -
occiglot/occiglot-7b-es-en-instruct
Text Generation β’ 7B β’ Updated β’ 7 β’ 2
models
10
occiglot/occiglot-7b-es-en-instruct
Text Generation
β’
7B
β’
Updated
β’
7
β’
2
occiglot/occiglot-7b-eu5
Text Generation
β’
7B
β’
Updated
β’
12
β’
27
occiglot/occiglot-7b-de-en-instruct
Text Generation
β’
7B
β’
Updated
β’
1k
β’
24
occiglot/occiglot-7b-eu5-instruct
Text Generation
β’
7B
β’
Updated
β’
29
β’
10
occiglot/occiglot-7b-it-en-instruct
Text Generation
β’
7B
β’
Updated
β’
1.14k
β’
5
occiglot/occiglot-7b-fr-en-instruct
Text Generation
β’
7B
β’
Updated
β’
9
β’
3
occiglot/occiglot-7b-it-en
Text Generation
β’
7B
β’
Updated
β’
6
β’
5
occiglot/occiglot-7b-fr-en
Text Generation
β’
7B
β’
Updated
β’
56
β’
3
occiglot/occiglot-7b-de-en
Text Generation
β’
7B
β’
Updated
β’
10
β’
7
occiglot/occiglot-7b-es-en
Text Generation
β’
7B
β’
Updated
β’
5
β’
4
datasets
6
occiglot/arcX
Viewer
β’
Updated
β’
26.4k
β’
562
occiglot/hellaswagX
Viewer
β’
Updated
β’
240k
β’
229
occiglot/euro-llm-leaderboard-requests
Updated
β’
63
β’
2
occiglot/occiglot-fineweb-v1.0
Updated
β’
556
β’
3
occiglot/occiglot-fineweb-v0.5
Viewer
β’
Updated
β’
226M
β’
6
β’
15
occiglot/tokenizer-wiki-bench
Viewer
β’
Updated
β’
84.4M
β’
17.5k
β’
6