Le Huy Hoang's picture

Le Huy Hoang

splendor1811

·

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

upvoted an article 24 days ago

Voice Cloning with Consent

upvoted an article 24 days ago

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

liked a dataset 24 days ago

openslr/librispeech_asr

View all activity

Organizations

None yet

upvoted 2 articles 24 days ago

Article

Voice Cloning with Consent

Oct 28, 2025

•

38

Article

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

Jan 19, 2024

•

46

upvoted 2 articles about 1 month ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21, 2025

•

24

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Nov 3, 2022

•

363

upvoted an article 4 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

343

upvoted a paper 7 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 268

upvoted a collection 8 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.71k

upvoted 3 articles 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

278

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17, 2025

•

354

Article

I trained a Language Model to schedule events with GRPO!

Apr 29, 2025

•

94

upvoted a paper 9 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

upvoted a collection 9 months ago

Qwen3-Embedding

6 items • Updated Dec 31, 2025 • 149

upvoted an article 10 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

600

upvoted a paper about 1 year ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 57

upvoted 2 articles about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k

Article

SmolVLM - small yet mighty Vision Language Model

+3

Nov 26, 2024

•

416

upvoted a paper about 1 year ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

upvoted a collection about 1 year ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 25 items • Updated 10 days ago • 575

upvoted a collection over 1 year ago

MIT Talk 31/10 Papers

14 items • Updated Oct 28, 2024 • 32

upvoted a paper almost 2 years ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 91