arxiv:2512.16969
Jiaqi Wei
VitaCoco
AI & ML interests
None yet
Recent Activity
upvoted a collection 21 days ago
AgentDoG upvoted a paper about 2 months ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs upvoted a paper about 2 months ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning