HanXiao's picture

HanXiao

HanXiao1999

·

Euphoria16

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

upvoted a paper 9 days ago

UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

authored a paper 10 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

View all activity

Organizations

upvoted a paper 9 days ago

UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

Paper • 2605.29534 • Published 10 days ago • 15

upvoted a paper 11 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Paper • 2605.26114 • Published 13 days ago • 64

upvoted 2 papers 3 months ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 49

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

Paper • 2603.08013 • Published Mar 9 • 16

upvoted 4 papers 4 months ago

CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion

Paper • 2602.10999 • Published Feb 11 • 11

FeatureBench: Benchmarking Agentic Coding for Complex Feature Development

Paper • 2602.10975 • Published Feb 11 • 18

MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments

Paper • 2602.06075 • Published Feb 3 • 14

FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation

Paper • 2602.03798 • Published Feb 3 • 10

upvoted 2 papers 5 months ago

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Paper • 2601.00393 • Published Jan 1 • 132

Web World Models

Paper • 2512.23676 • Published Dec 29, 2025 • 27

upvoted 4 papers 8 months ago

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Paper • 2510.14958 • Published Oct 16, 2025 • 23

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Paper • 2510.12796 • Published Oct 14, 2025 • 13

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

Paper • 2509.22644 • Published Sep 26, 2025 • 21

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Paper • 2509.22651 • Published Sep 26, 2025 • 23

upvoted a paper 10 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 85

upvoted a paper 12 months ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published Jun 11, 2025 • 48

upvoted 4 papers about 1 year ago

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Paper • 2505.21496 • Published May 27, 2025 • 38

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15, 2025 • 50

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published May 6, 2025 • 17

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28, 2025 • 23