14 36 4

Peng Xia

richardxp888

https://richard-peng-xia.github.io

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

Jianwen/Search-7B-RL:Create README.md

new activity 5 days ago

Jianwen/Search-7B-SFT:Create README.md

new activity 5 days ago

Jianwen/Webshop-7B-RL:Create README.md

View all activity

Organizations

New activity in Jianwen/Search-7B-RL 5 days ago

Create README.md

#1 opened 5 days ago by

richardxp888

New activity in Jianwen/Search-7B-SFT 5 days ago

Create README.md

#1 opened 5 days ago by

richardxp888

New activity in Jianwen/Webshop-7B-RL 5 days ago

Create README.md

#1 opened 5 days ago by

richardxp888

New activity in Jianwen/Webshop-7B-SFT 5 days ago

Create README.md

#1 opened 5 days ago by

richardxp888

New activity in Jianwen/Alfworld-7B-RL 5 days ago

Create README.md

#1 opened 5 days ago by

richardxp888

New activity in Jianwen/Alfworld-7B-SFT 5 days ago

Update README.md

#2 opened 5 days ago by

richardxp888

Create README.md

#1 opened 5 days ago by

richardxp888

upvoted a paper 19 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 25 days ago • 341

authored 3 papers 19 days ago

Reliable and Responsible Foundation Models: A Comprehensive Survey

Paper • 2602.08145 • Published 25 days ago • 8

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 21 days ago • 67

MedVerse: Efficient and Reliable Medical Reasoning via DAG-Structured Parallel Execution

Paper • 2602.07529 • Published 23 days ago

upvoted a paper 19 days ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 21 days ago • 67

submitted a paper to Daily Papers 19 days ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 21 days ago • 67

upvoted a paper 20 days ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published 28 days ago • 137

upvoted a paper 23 days ago

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Paper • 2602.05258 • Published 25 days ago • 7

authored a paper about 2 months ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37

upvoted 2 papers about 2 months ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 56

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37

updated a model 2 months ago

aimingpppyb/qwen25vl_7b_n_9

8B • Updated Dec 30, 2025

published a model 2 months ago

aimingpppyb/qwen25vl_7b_n_9

8B • Updated Dec 30, 2025

Peng Xia

AI & ML interests

Recent Activity

Organizations

richardxp888's activity

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md

Update README.md

Create README.md