view article Article Fine-Tune W2V2-Bert for low-resource ASR with π€ Transformers Jan 19, 2024 β’ 46
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 β’ 24
view article Article Fine-Tune Whisper For Multilingual ASR with π€ Transformers Nov 3, 2022 β’ 363
Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published Aug 21, 2025 β’ 268
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 β’ 278
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? Mar 17, 2025 β’ 354
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper β’ 2506.13585 β’ Published Jun 16, 2025 β’ 273
TransMLA: Multi-head Latent Attention Is All You Need Paper β’ 2502.07864 β’ Published Feb 11, 2025 β’ 57
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 25 items β’ Updated 10 days ago β’ 575