browser-use/browser-use
Browser Use is the leading open-source Python framework that enables large language models to autonomously control web browsers. With nearly 80,000 GitHub stars, it has become the de facto standard for AI-powered browser automation, allowing developers to build agents that can navigate websites, fill forms, extract data, and complete multi-step web tasks using natural language instructions. The framework takes a hybrid approach to web understanding, combining DOM parsing with vision-based analysis through Playwright. When an agent receives a task, Browser Use loads the target page, processes the DOM to identify interactive elements like buttons, links, and input fields, and optionally engages a vision model for layout-dependent decisions. The LLM then reasons about what actions to take and executes them in sequence. Browser Use is LLM-agnostic, supporting OpenAI GPT-4, Anthropic Claude, Google Gemini, and local models via Ollama, alongside its own optimized ChatBrowserUse model. The framework includes a CLI for interactive browser control, a template system for quick project scaffolding, and extensible custom actions via Python decorators. For production deployments, Browser Use offers a cloud browser infrastructure with stealth capabilities, CAPTCHA handling, and proxy rotation. Backed by a $17 million seed round led by Felicis with participation from Paul Graham and Y Combinator, Browser Use has achieved an 89.1% success rate on the WebVoyager benchmark across 586 diverse web tasks. Its MIT license and active development — with 8,739 commits and 118 releases — make it a robust foundation for any AI browser automation project.
Why It Matters
Browser Use addresses one of the most practical challenges in AI agent development: giving LLMs the ability to interact with the web the way humans do. While traditional browser automation tools like Selenium and Puppeteer rely on brittle CSS selectors and XPath expressions that break when UIs change, Browser Use lets AI agents adapt to layout changes automatically through visual understanding. With over 2,400 dependent projects and 15,000+ developer users, Browser Use has established itself as critical infrastructure for the AI agent ecosystem. Its production readiness is proven by adoption across 20+ Y Combinator companies. For developers building AI agents that need web access — whether for research automation, data extraction, e-commerce workflows, or form completion — Browser Use provides the most battle-tested, community-supported foundation available.