sandraschi/windows-computer-use-mcp

GitHub: sandraschi/windows-computer-use-mcp

面向 AI Agent 的 Windows 桌面自动化框架，提供 22 个 MCP 工具与自主任务引擎，让 LLM 能够操控真实 Windows 应用。

Stars: 23 | Forks: 6

# windows-computer-use-mcp

**一个为 agent 打造的工具，同时它本身也是一个 agent。** | 你 | 它 | |-----|----| | 作为 **MCP server** 使用 | Claude, Cursor, DeepSeek 调用 `automation_click`, `automation_screenshot`, `automation_ocr` — 共 22 个工具 | | 作为 **自主 agent** 使用 | 给它设定一个目标：`automation_mission(run="install app, verify UI, screenshot result")` — 它会自行规划、执行、重试并报告 | | 作为 **Web 应用** 使用 | `start.ps1` 将在 http://127.0.0.1:10788 开启一个 React 控制台，支持 HITL、爬虫和日志记录 | | 作为 **桌面应用** 使用 | NSIS 安装程序将所有内容打包成一个二进制文件 — 无需 Python, uv 或 git | 基于 [pywinauto](https://github.com/pywinauto/pywinauto) 构建。在生产环境使用前，请务必阅读 **[docs/SAFETY.md](docs/SAFETY.md)**。 - [快速开始](#quick-start) - [功能](#features) - [文档](#documentation) - [端口](#ports) - [许可证](#license) ## 快速开始 | 方法 | 命令 / 配置 | |--------|-----------------| | **MCP stdio** (Cursor, Claude Desktop) | `{ "mcpServers": { "windows-computer-use": { "command": "uv", "args": ["--directory", "", "run", "windows-computer-use-mcp"] } } }` | | **HTTP streamable** (任何 MCP HTTP 客户端) | `{ "mcpServers": { "windows-computer-use": { "url": "http://127.0.0.1:10789/mcp" } } }` | | **Web 操作 UI** | `.\start.ps1` → http://127.0.0.1:10788 | | **桌面应用** (NSIS 安装程序) | 从 [Releases](https://github.com/sandraschi/windows-computer-use-mcp/releases) 下载 — 零依赖 | 有关详细的安装说明，请参阅 **[INSTALL.md](INSTALL.md)**。运行 `just demo` 查看示例。 ## 功能 - **窗口管理** — 查找、激活、最大化、最小化、定位、关闭 - **鼠标与键盘** — 点击、拖拽、输入、快捷键、应用快捷方式 - **UI 元素** — 通过 UIA / Win32 检查、点击、读取文本、验证状态 - **视觉智能** — 截图、OCR、模板匹配 - **自主任务** — 给定目标，自主规划并执行，包含重试与验证 - **宏录制** — 录制任意 UI 操作序列，回放并验证结果 - **多应用工作流** — 在记事本、计算器、画图或任何 Windows 应用间串联操作 - **遥测** — 每一项操作均记录至 SQLite；可按工具查询失败模式 - **自适应定位** — 自动依次通过 title/auto_id/control_id/class/OCR 寻找元素 - **人脸识别** — 可选功能，默认关闭 ## 文档 | 文档 | 内容 | |-----|---------| | [INSTALL.md](INSTALL.md) | 设置：桌面应用、uv、MCP 配置 | | [docs/README.md](docs/README.md) | 完整文档中心 | | [docs/py-stack.md](docs/py-stack.md) | Python 依赖项深度解析 | | [docs/composing-with-playwright.md](docs/composing-with-playwright.md) | 使用 Playwright MCP 进行浏览器自动化 | | [docs/ocr.md](docs/ocr.md) | OCR 系统 — Tesseract 设置、限制与竞品对比 | | [docs/cua-nsis-certification.md](docs/cua-nsis-certification.md) | 吃自己的狗粮：使用该工具测试其自身的 NSIS 安装程序 | | [docs/ROADMAP.md](docs/ROADMAP.md) | 短期/中期/长期改进路线图 | | [docs/SAFETY.md](docs/SAFETY.md) | HITL、紧急停止开关、选择性开启的功能 | | [docs/TOOLS.md](docs/TOOLS.md) | 合成工具参考 | | [tests/README.md](tests/README.md) | 测试套件指南与 e2e 设置 | | [examples/README.md](examples/README.md) | 可运行的演示 | | [mcpb/README.md](mcpb/README.md) | MCPB bundle 打包 | | [web_sota/README.md](web_sota/README.md) | Operator UI 构建/开发指南 | | [CHANGELOG.md](CHANGELOG.md) | 发布历史 | ## 端口 | 端口 | 服务 | |------|---------| | **10788** | 前端 — Vite Operator UI | | **10789** | 后端 — FastAPI + FastMCP HTTP | | stdio | MCP transport (无端口) | ## 相关项目 | 仓库 | 功能描述 | |------|-------------| | **[autohotkey-mcp](https://github.com/sandraschi/autohotkey-mcp)** | 通过 AHK 进行原始输入录制/回放 | | **[browser-mcp](https://github.com/sandraschi/browser-mcp)** | Playwright 浏览器控制 — 专为 **Web 应用、HTML DOM、网站** 设计 | | **[virtualization-mcp](https://github.com/sandraschi/virtualization-mcp)** | 沙箱 / VM 隔离 | | **[windows-operations-mcp](https://github.com/sandraschi/windows-operations-mcp)** | 注册表、服务、账户管理 | Fleet 标准：[mcp-central-docs](https://github.com/sandraschi/mcp-central-docs)。 ## 许可证 MIT — Copyright (c) 2026 Sandra Schipal.

标签：逆向工具