🌐 Browser Use：93k Stars，让你的 AI 直接操控真实浏览器，pip install 就搞定

老实说，每次看到 AI Agent 说自己"能操控浏览器"，点进去一看——要么只是个 screenshot 截图工具，要么必须用人家搭好的云环境。你想让它填个表单、买点东西？先配三天 API。

Browser Use 不一样。93,828 Stars / 10,605 Forks，纯 Python 写，pip install 就完事。你的 AI 直接操控你的真实浏览器——本地 Chrome，你每天用的那个。

GitHub：https://github.com/browser-use/browser-use

语言：Python | 协议：MIT | 创建：2024-10

它能干什么？

🔥 填表单、投简历 — "帮我填这份工作申请，用我的简历。" Browser Use 自己打开网页、定位输入框、填内容、点提交。完整代码见 apply_to_job.py

🛒 购物 — "帮我把这些加购到 Instacart。" 它能理解自然语言描述的购物清单，逐个搜索、比对、加入购物车。

💻 个人助理 — "帮我找一套 PC 组装件。" 它会自己逛电商网站，比对配置和价格，给你列出来。

最骚的是，所有这些你不需要改一行浏览器配置，不需要什么 remote debugging 端口。

上手

# 用 uv（推荐，Python >= 3.11）
uv init && uv add browser-use && uv sync
# 如果没有 Chromium，运行这个
uvx browser-use install

配好 API key：

# .env 文件
BROWSER_USE_API_KEY=***
# 或者用其他模型
# ANTHROPIC_API_KEY=***
# GOOGLE_API_KEY=***

写你的第一个 agent：

from browser_use import Agent, Browser, ChatBrowserUse
import asyncio

async def main():
    browser = Browser()
    agent = Agent(
        task="帮我查一下 browser-use 这个 repo 有多少颗星",
        llm=ChatBrowserUse(),
        browser=browser,
    )
    await agent.run()

if __name__ == "__main__":
    asyncio.run(main())

跑起来就完事了。Agent 会打开浏览器、跳转 GitHub、读取 star 数、然后告诉你结果。

# Using uv (recommended, Python >= 3.11)
uv init && uv add browser-use && uv sync
# Install Chromium if needed
uvx browser-use install

Set up your API key:

# .env file
BROWSER_USE_API_KEY=***
# Or use other models
# ANTHROPIC_API_KEY=***
# GOOGLE_API_KEY=***

Write your first agent:

from browser_use import Agent, Browser, ChatBrowserUse
import asyncio

async def main():
    browser = Browser()
    agent = Agent(
        task="Find the number of stars of the browser-use repo",
        llm=ChatBrowserUse(),
        browser=browser,
    )
    await agent.run()

if __name__ == "__main__":
    asyncio.run(main())

Run it and watch it go. The agent opens the browser, navigates to GitHub, reads the star count, and reports back.

CLI 模式也不错

不想写代码？它内置了 CLI：

# 浏览一个页面
browser-use open https://example.com
# 看看哪些元素可以点
browser-use state
# 点击第 5 个可交互元素
browser-use click 5
# 输入文字
browser-use type "Hello"
# 截图
browser-use screenshot page.png
# 关闭浏览器
browser-use close

CLI 模式保持浏览器一直开着，你可以一步步指挥它，适合调试和快速任务。

Don't feel like writing code? It has a built-in CLI:

# Navigate to a URL
browser-use open https://example.com
# See clickable elements
browser-use state
# Click element by index
browser-use click 5
# Type text
browser-use type "Hello"
# Take screenshot
browser-use screenshot page.png
# Close browser
browser-use close

The CLI keeps the browser running between commands — great for debugging and quick tasks.

几个要点

自定义工具很简单 — 用 @tools.action(description=...) 装饰器就能给 Agent 加自定能力

2. Cloud 版更猛 — 如果你需要抗检测指纹、代理轮换、验证码绕过，他们有付费云服务

3. Claude Code Skill — 一键安装 mkdir -p ~/.claude/skills/browser-use && curl -o ... 就能在 Claude Code 里直接用

4. ChatBrowserUse 模型最优 — 他们自己训练的浏览器操控模型，比通用模型快 3-5x，输入 $0.20/百万 token，输出 $2.00/百万 token

菜单

分享

🌐 Browser Use：93k Stars，让你的 AI 直接操控真实浏览器，pip install 就搞定

🌐 Browser Use：93k Stars，让你的 AI 直接操控真实浏览器，pip install 就搞定

它能干什么？

上手

CLI 模式也不错

几个要点

评论

🧠 Mem0：55k Stars 的开源 AI 记忆层，pip install 让你的 Agent 不再"转头就忘" / Mem0: 55k Stars Open-Source Memory Layer for AI Agents

🐺 OpenFang：17.5k Stars 的开源 Agent 操作系统，装了它你的 Agent 就自己干活了

🤖 AionUi：25k Stars 的开源 AI 协作桌面，一个 App 管理所有 Coding Agent / AionUi: Free Open-Source Multi-Agent Cowork Desktop

🍒 Cherry Studio：45k Stars 的跨平台 AI 桌面客户端，一个 App 装下所有大模型

⚡ Mastra：23.9k Stars 的 TypeScript AI Agent 框架，Gatsby 团队出品，一行命令搭好生产级 Agent

🎨 Taste Skill：17k Stars 的 Anti-Slop 前端框架，一句命令让 AI 不再生成丑界面

⚡ Agno：40k Stars 的一站式 Agent 平台 SDK，20 行代码搭出生产级 AI 应用

🔥 GenericAgent：11.4k Stars 的自我进化 Agent，3K 行代码长出专属技能树

🎯 Page Agent：17.8k Stars，阿里开源的 JavaScript 页面 GUI Agent，一行代码给你的网页装上 AI

🦌 DeerFlow：ByteDance's 67k Stars SuperAgent Harness，三行命令跑起一个 Agent 团队