Examples of tasks we designed in https://arxiv.org/abs/2504.15266
Chen Wu PRO
ChenWu98
AI & ML interests
Generative models
Recent Activity
updated a model 3 days ago
ChenWu98/grpo_generator_both_Qwen-Qwen3-8B_lr1e-6 published a model 3 days ago
ChenWu98/grpo_generator_both_Qwen-Qwen3-8B_lr1e-6 updated a model 3 days ago
ChenWu98/grpo_generator_easy_Qwen-Qwen3-8B_lr1e-6Organizations
None yet