
Introducing Meka: An Open-Source Framework for Building Autonomous Computer Agents
Architecture behind Meka's state of the art webArena performance.
Architecture behind Meka's state of the art webArena performance.
Introduction Meka establishes a new state-of-the-art in browser use benchmarks. Our computer agent achieved a 72.7% success rate across 651 diverse web tasks on WebArena, reflecting tasks involving shopping, store administration, Wikipedia, Reddit, and GitLab. This post outlines detailed information on the WebArena evaluation, including results, environment, prompting, test