Jiaheng Liu's picture

Jiaheng Liu

CheeryLJH

·

AI & ML interests

None yet

Recent Activity

commentedon a paper 3 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

upvoted a paper 3 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

upvoted a paper 4 days ago

Qwen3.5-Omni Technical Report

View all activity

Organizations

upvoted a paper 3 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published 4 days ago • 21

upvoted a paper 4 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 7 days ago • 50

upvoted a paper 7 days ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published 8 days ago • 35

upvoted a paper 8 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 9 days ago • 151

upvoted 2 papers 10 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 11 days ago • 13

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published 11 days ago • 38

upvoted a paper 23 days ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published 24 days ago • 48

upvoted a paper 30 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published about 1 month ago • 35

upvoted 3 papers about 1 month ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 308

MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Paper • 2603.09652 • Published Mar 10 • 15

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

upvoted 4 papers about 2 months ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published Feb 27 • 88

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published Feb 26 • 23

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53

upvoted 5 papers 2 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 60

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 145

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published Feb 15 • 27

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Paper • 2602.09514 • Published Feb 10 • 11

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 196