view article Article ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM ibm-research • 4 days ago • 12
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated 3 days ago • 25
view reply Epic launch Niels! I've been loving it! Really missed paperswithcode when it was gone. Thank you for all the efforts bringing it back even better.
view article Article Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia matthew-d-white • 9 days ago • 3
view post Post 9891 Harness, Scaffold, Context Engineering, Agent... do you actually know what they mean?We wrote an AI agent glossary and tried to make sense of it all with simple definitions and real examples↓ go read it ↓https://huggingface.co/blog/agent-glossary See translation 1 reply · 👍 8 8 👀 3 3 🔥 1 1 + Reply
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • 7 days ago • 82
CohereLabs/command-a-plus-05-2026-w4a4 Image-Text-to-Text • 126B • Updated 4 days ago • 15.6k • • 214