interstellarninja/tool-use-multiturn-reasoning Viewer β’ Updated Jul 27, 2025 β’ 14.6k β’ 323 β’ 31
Running 147 The ultimate guide to RL environments: building and scaling them in the LLM era π 147 Building and scaling RL environments for LLM training
GestaltLabs/Ornstein-Hermes-3.6-27b-SABER-GGUF Text Generation β’ 27B β’ Updated 15 days ago β’ 3.54k β’ 17