
Meka Achieves State-of-the-Art Performance for Computer Use
Introduction Meka establishes a new state-of-the-art in browser use benchmarks. Our computer agent achieved a 72.7% success rate across 651 diverse web tasks on WebArena, reflecting tasks involving shopping, store administration, Wikipedia, Reddit, and GitLab. This post outlines detailed information on the WebArena evaluation, including results, environment, prompting, test