AI agents can write code.
AI agents can browse the web.
AI agents can pass the bar exam.
Can they edit a video?
We gave the 7 best frontier models 100 real-world post-production tasks, scored by 20 industry experts.
Best agent: barely crosses 30%
Human experts: 89%
Introducing AgenticVBench.
AI agents can write code. AI agents can browse the web. AI agents can pass the bar exam. Can they edit a video? We gave the 7 best frontier models 100 real-world post-production tasks, scored by 20 industry experts. Best agent: barely crosses 30% Human experts: 89% Introducing AgenticVBench.