Riya
tai-tai-sama
ยท
AI & ML interests
Large Language Models, Applied ML, AI Agents, ReAct, Reflexion, Function Calling Models, Model Fine-Tuning, LoRA, QLoRA, RAG Systems, Semantic Search, Code Understanding Models, AST-Based Chunking, Model Evaluation, Generative AI, Production ML, ML Infrastructure, Cost Optimization, Token Efficiency, MLOps, Transformers, PyTorch, Llama Models
Recent Activity
upvoted an article 11 days ago
๐ช Introduction to Matryoshka Embedding Models liked
a model 11 days ago
jinaai/jina-embeddings-v5-text-small reacted
to
sergiopaniego's
post with ๐ฅ 3 months ago
TRL now includes agent training support for GRPOโผ๏ธ
Train ๐ต๏ธ agents with ๐ง tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
๐ notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
๐ script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
๐ฆ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0