860 Stars 🍴 83 Forks 👀 860 Watchers TypeScript mit
GitHub 链接https://github.com/browserwing/browserwing
项目简介BrowserWing turns your browser actions into MCP commands Or Claude Skill, allowing AI agents to control browsers efficiently and reliably. Say goodbye to slow, token-heavy LLM interactions — let agents call commands directly for faster automation. Perfect for AI-driven tasks, browser automation, and boosting productivity.
创建时间2025-12-11
更新时间2026-02-26
📖 README English
<p align="center"> <img width="600" alt="BrowserWing" src="https://raw.githubusercontent.com/browserwing/browserwing/main/docs/assets/banner.svg"> </p> <p align="center"> <img alt="Go" src="https://img.shields.io/badge/Go-1.21%2B-00ADD8?logo=go&logoColor=white" /> <img alt="React" src="https://img.shields.io/badge/React-18-61DAFB?logo=react&logoColor=white" /> <img alt="TypeScript" src="https://img.shields.io/badge/TypeScript-5-3178C6?logo=typescript&logoColor=white" /> <img alt="Vite" src="https://img.shields.io/badge/Vite-5-646CFF?logo=vite&logoColor=white" /> <img alt="pnpm" src="https://img.shields.io/badge/pnpm-9-F69220?logo=pnpm&logoColor=white" /> <img alt="MCP" src="https://img.shields.io/badge/MCP-Model%20Context%20Protocol-7B61FF" /> </p> <p align="center"> English · <a href="./README.zh-CN.md">简体中文</a> · <a href="./README.ja.md">日本語</a> · <a href="./README.es.md">Español</a> · <a href="./README.pt.md">Português</a> </p> <p align="center"><a href="https://browserwing.com">browserwing.com</a></p> https://github.com/user-attachments/assets/e5377892-4b88-433a-8620-43b38a2fb28f ## Highlights **Native Browser Automation Platform with AI Integration** - **Complete Browser Control**: 26+ HTTP API endpoints for full-featured browser automation - **Built-in AI Agent**: Direct conversational interface for browser automation tasks - **Universal AI Tool Integration**: Native MCP & Skills protocol support - compatible with any AI tool that supports these standards - **Visual Script Recording**: Record browser actions, edit visually, and replay with precision - **Flexible Export Options**: Convert recorded scripts to MCP commands or Skills files for AI tool integration - **Intelligent Data Extraction**: LLM-powered semantic extraction supporting OpenAI, Claude, DeepSeek, and more - **Session Management**: Robust cookie and storage handling for stable, authenticated browsing sessions ## Requirements - Google Chrome or Chromium installed and accessible in your environment. ## Screenshots <img width="600" alt="BrowserWing Homepage" src="https://raw.githubusercontent.com/browserwing/browserwing/main/docs/assets/screenshot_homepage.png"> ### Turn Scripts Into Claude Skill You can now combine any scripts into a SKILL.md. <img width="600" alt="BrowserWing Skill" src="https://raw.githubusercontent.com/browserwing/browserwing/main/docs/assets/screenshot_skill.png"> ## Quick Start ### Option A — Install via Package Manager (recommended) **Using npm:** ```bash npm install -g browserwing browserwing --port 8080 ``` **Using pnpm:** ```bash pnpm add -g browserwing browserwing --port 8080 ``` The npm package automatically tests GitHub and Gitee mirrors during installation and selects the fastest one. **⚠️ macOS Users:** If you encounter a "killed" error when running, fix it with: ```bash xattr -d com.apple.quarantine $(which browserwing) ``` See the [macOS Installation Fix Guide](docs/MACOS_INSTALLATION_FIX.md) for details. **Using Homebrew (macOS/Linux):** ```bash # Coming soon brew install browserwing ``` ### Option B — One-Line Install Script **Linux / macOS:** ```bash curl -fsSL https://raw.githubusercontent.com/browserwing/browserwing/main/install.sh | bash ``` **Windows (PowerShell):** ```powershell iwr -useb https://raw.githubusercontent.com/browserwing/browserwing/main/install.ps1 | iex ``` The script automatically: - Detects your OS/architecture - Tests GitHub and Gitee mirrors, selects the fastest one - Downloads and extracts the binary - Adds to PATH **Then start BrowserWing:** ```bash browserwing --port 8080 # Open http://localhost:8080 in your browser ``` **Note for users in China:** The installation script automatically uses Gitee mirror if GitHub is slow. ### Option C — Manual Download Download the prebuilt binary for your OS from [Releases](https://github.com/browserwing/browserwing/releases): ```bash # Linux/macOS chmod +x ./browserwing ./browserwing --port 8080 # Windows (PowerShell) ./browserwing.exe --port 8080 ``` ### Option D — Build from Source ```bash # Install deps (Go + pnpm required) make install # Build integrated binary (frontend embedded) make build-embedded ./build/browserwing --port 8080 # Or build all targets and packages make build-all make package ``` ## Quick Integration with AI Tools **Three Ways to Use BrowserWing:** ### 1. MCP Server Integration Configure BrowserWing as an MCP server in any MCP-compatible AI tool: ```json { "mcpServers": { "browserwing": { "type": "http", "url": "http://localhost:8080/api/v1/mcp/message" } } } ``` Paste this configuration into your AI tool's MCP settings to enable browser automation capabilities. ### 2. Skills File Integration Download and import the Skills file into any AI tool that supports the Skills protocol: 1. Start BrowserWing 2. Download [SKILL.md](https://raw.githubusercontent.com/browserwing/browserwing/refs/heads/main/SKILL.md) from the repository 3. Import into your AI tool's Skills settings 4. Start automating with natural language commands **Example:** ``` "Navigate to example.com, search for 'AI tools', and extract the top 5 results" ``` ### 3. Direct AI Agent Interface Use BrowserWing's built-in AI Agent for immediate browser automation: 1. Open BrowserWing web interface at `http://localhost:8080` 2. Navigate to "AI Agent" section 3. Configure your LLM (OpenAI, Claude, DeepSeek, etc.) 4. Start conversational browser automation **Export Custom Scripts:** ```bash # Export your recorded scripts as Skills or MCP commands curl -X POST 'http://localhost:8080/api/v1/scripts/export/skill' \ -H 'Content-Type: application/json' \ -d '{"script_ids": []}' \ -o MY_CUSTOM_SCRIPTS.md ``` ## Why BrowserWing **Professional Browser Automation with AI Integration** - **Universal Protocol Support**: Native MCP & Skills implementation works with any compatible AI tool - **Complete Automation API**: 26+ HTTP endpoints providing comprehensive browser control capabilities - **Flexible Integration Options**: Use as MCP server, Skills file, or standalone AI Agent - **Visual Workflow Builder**: Record, edit, and replay browser actions without writing code - **Token-Efficient Design**: Optimized for LLM usage with fast performance and minimal token consumption - **Production-Ready**: Stable session management, cookie handling, and error recovery - **Extensible Architecture**: Convert recorded scripts to reusable MCP commands or Skills files - **Multi-LLM Support**: Works with OpenAI, Anthropic, DeepSeek, and other providers - **Enterprise Use Cases**: Data extraction, RPA, testing, monitoring, and agent-driven automation ## Usage Guide ### Getting Started in Three Steps 1. **Choose Integration Method** - Copy MCP server configuration for AI tool integration - Download Skills file for Skills-compatible AI tools - Or use built-in AI Agent for immediate access 2. **Configure Your AI Tool** - Import MCP configuration or Skills file into your preferred AI tool - Configure LLM settings (API keys, model selection) - Verify connection to BrowserWing 3. **Start Automating** - Control browser through natural language commands - Record custom scripts for repeated tasks - Export scripts as MCP commands or Skills for reuse ### Advanced Workflows **For Browser Automation:** - Launch and manage multiple browser instances - Configure profiles, proxies, and browser settings - Handle cookies and authentication sessions - Execute complex interaction sequences **For Script Recording:** - Capture clicks, inputs, navigation, and waits - Edit actions visually in the script editor - Test and debug with step-by-step replay - Add variables and conditional logic **For AI Integration:** - Convert scripts to MCP commands or Skills files - Integrate with multiple LLM providers - Use semantic extraction for data parsing - Build agent-driven automation workflows ### HTTP API Reference BrowserWing exposes 26+ RESTful endpoints for programmatic browser control: **Navigation & Control** - Navigate to URLs, go back/forward, refresh pages - Manage browser windows and tabs - Handle page loading and timeouts **Element Interaction** - Click, type, select, and hover actions - File uploads and form submissions - Keyboard shortcuts and key presses **Data Extraction** - Extract text, HTML, and attributes - Semantic content analysis with LLM - Screenshot capture (full page or element) **Advanced Operations** - Execute custom JavaScript - Manage cookies and local storage - Batch operations for efficiency - Wait conditions and element visibility **Complete Documentation**: See `docs/EXECUTOR_HTTP_API.md` for detailed endpoint specifications ## Contributing - Issues and PRs are welcome. Please include clear steps to reproduce or a concise rationale. - For feature ideas, open a discussion with use cases and expected outcomes. ## Community Discord: [https://discord.gg/BkqcApRj](https://discord.gg/BkqcApRj) twitter: [https://x.com/chg80333](https://x.com/chg80333) ## Acknowledgements - Inspired by modern browser automation, agentic workflows, and MCP. ## License - MIT License. See `LICENSE`. ## Disclaimer - Do not use for illegal purposes or to violate site terms. - Intended for personal learning and legitimate automation only.
🛠 部署服务

如果你在部署 browserwing/browserwing 项目时碰到困难,请联系我们。

我们有技术团队提供专业化的项目安装、运维服务。

联系我们 →
微信客服

添加注明:会员账号

微信客服二维码