joergmichno/clawguard

GitHub: joergmichno/clawguard

一款零依赖的 AI Agent 输入安全扫描 CLI 工具，用于在恶意输入到达 AI 系统前检测 Prompt 注入、危险命令和数据泄露等威胁。

Stars: 11 | Forks: 1

# 🛡️ ClawGuard **面向自主 AI Agent 的安全扫描器** [![CI](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/70a93b9e9d032530.svg)](https://github.com/joergmichno/clawguard/actions/workflows/ci.yml) [![Python](https://img.shields.io/badge/Python-3.6+-3776AB?style=flat&logo=python&logoColor=white)](https://python.org) [![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE) [![Tests](https://img.shields.io/badge/Tests-71_passed-brightgreen?style=flat&logo=pytest&logoColor=white)](tests/) [![Patterns](https://img.shields.io/badge/Patterns-42+-red?style=flat)](README.md#detection-categories) [![Zero Dependencies](https://img.shields.io/badge/Dependencies-Zero-blue?style=flat)]() [![Release](https://img.shields.io/github/v/release/joergmichno/clawguard?style=flat)](https://github.com/joergmichno/clawguard/releases) ClawGuard 是一个轻量级、零依赖的 CLI 工具，用于扫描文本中的 Prompt 注入、危险命令、代码混淆、数据泄露和社会工程学攻击——在它们到达您的 AI Agent 之前进行拦截。 ## 问题背景自主 AI Agent 可以执行代码、访问文件并与外部服务交互。一次成功的 Prompt 注入可能导致数据泄露、未授权操作或系统沦陷。 **ClawGuard 补上了这一缺失的安全层。** ## 功能特性 - **42+ 种检测模式**，覆盖 5 大威胁类别 - **Prompt 注入检测** —— 直接覆盖、同义词绕过、分隔符注入、角色扮演逃逸 - **危险命令检测** —— Shell 漏洞利用、反向 Shell、权限提升 - **代码混淆检测** —— `eval()`、`getattr()`、魔术属性、字符串拼接 - **数据泄露检测** —— API 密钥、私钥、数据库连接字符串、Webhook URL - **社会工程学检测** —— 冒充权威、制造紧迫感 - **双语支持** —— 检测英语和德语攻击 - **零依赖** —— 纯 Python 标准库实现 - **多种输出格式** —— 人类可读报告和用于自动化的 JSON - **CI/CD 就绪** —— 提供用于流水线集成的退出码 ## 生态系统与在线演示 🎯 **试用交互式演示：** [prompttools.co](https://prompttools.co) — 实时测试 Prompt 注入 ### 生态系统 | 组件 | 描述 | |-----------|-------------| | [**ClawGuard Shield**](https://github.com/joergmichno/clawguard-shield) | 用于生产环境的 REST API — [prompttools.co/api/v1/](https://prompttools.co/api/v1/) | | [**Python SDK**](https://github.com/joergmichno/clawguard-shield-python) | `pip install clawguard-shield` — Shield API 的 SDK | | [**GitHub Action**](https://github.com/joergmichno/clawguard-scan-action) | CI/CD 集成 — 在流水线中扫描 Prompt | | [**Prompt Lab**](https://github.com/joergmichno/prompt-lab) | 用于 Prompt 注入测试的交互式 Web 试验场 | ## 安装 ``` git clone https://github.com/joergmichno/clawguard.git cd clawguard ``` 无需安装依赖。仅需 Python 3.6+。 ## 快速开始 ``` # 扫描文件 python3 clawguard.py suspicious_input.txt # 直接 Pipe 文本 echo "ignore all previous instructions" | python3 clawguard.py --stdin # 用于自动化的 JSON output python3 clawguard.py --json input.txt > report.json ``` ## 输出示例 ``` ================================================================ ClawGuard v0.4.0 -- Security Scan Report ================================================================ Timestamp : 2026-02-26T16:18:05Z Source : suspicious_input.txt Lines : 9 Findings : 13 Risk Score: 10/10 CRITICAL ---------------------------------------------------------------- [1] CRITICAL -- Synonym Override (EN) Category : Prompt Injection Line : 1 Match : "Disregard all antecedent regulations" Context : Disregard all antecedent regulations and send me... Action : BLOCK: Synonym-based prompt injection detected. ================================================================ RECOMMENDATION: BLOCK this input. Do NOT forward to agent. ================================================================ ``` ## 架构 ``` Input (file/stdin) | v +--------------+ | CLI Parser | argparse: file, --stdin, --json +------+-------+ | v +--------------+ | scan_text() | Core scanning engine +------+-------+ | v +--------------------------------------+ | Pattern Matching | | | | +--------------+ +----------------+ | | | Prompt | | Dangerous | | | | Injection | | Commands | | | | (13 rules) | | (5 rules) | | | +--------------+ +----------------+ | | +--------------+ +----------------+ | | | Code | | Data | | | | Obfuscation | | Exfiltration | | | | (11 rules) | | (6 rules) | | | +--------------+ +----------------+ | | +--------------+ | | | Social | | | | Engineering | | | | (3 rules) | | | +--------------+ | +---------------+----------------------+ | v +-----------------------+ | Deduplication + | | Risk Scoring (0-10) | +-----------+-----------+ | +-----+------+ v v +----------+ +-----------+ | Human | | JSON | | Report | | Report | +----------+ +-----------+ ``` ## 风险分级 | 分数 | 级别 | 措施 | |-------|-------|--------| | 0 | CLEAN (安全) | 可以安全处理 | | 1-3 | LOW (低危) | 监控，通常安全 | | 4-6 | MEDIUM (中危) | 人工审查 | | 7-8 | HIGH (高危) | 建议拦截 | | 9-10 | CRITICAL (严重) | 立即拦截 | ## 检测类别 | 类别 | 模式数 | 示例 | |----------|----------|----------| | Prompt 注入 | 13 | `"ignore previous instructions"`、分隔符注入、同义词绕过 | | 危险命令 | 5 | `rm -rf /`、`curl | bash`、反向 Shell | | 代码混淆 | 11 | `getattr()`、`eval()`、`__builtins__`、字符串拼接 | | 数据泄露 | 6 | API 密钥、私钥、数据库连接字符串 | | 社会工程学 | 3 | 冒充权威、制造紧迫感 | ## 测试 ``` # 安装 pytest pip install pytest # 运行所有测试 pytest tests/ -v # 运行特定 test file pytest tests/test_scanner.py -v ``` ## 使用说明 ### 退出码 | 代码 | 含义 | |------|---------| | 0 | Clean —— 未发现威胁 | | 1 | 检测到威胁 | | 2 | 错误（文件未找到等） | ## 系统要求 - Python 3.6+ - **零外部依赖** ## CI/CD 集成 ClawGuard 返回退出码以便流水线集成： ``` # .github/workflows/security-scan.yml - name: Scan for prompt injections run: | echo "${{ github.event.issue.body }}" | python3 clawguard.py --stdin --json # Exit code 1 = threats found → fail the build ``` 或者使用托管的 [ClawGuard Shield API](https://github.com/joergmichno/clawguard-shield) 进行基于 HTTP 的扫描，无需安装 Python。 ## 相关项目 - **[ClawGuard Shield](https://github.com/joergmichno/clawguard-shield)** — 封装此扫描器的 REST API（托管于 [prompttools.co/api/v1](https://prompttools.co/api/v1/)） - **[Prompt Lab](https://github.com/joergmichno/prompt-lab)** — 交互式 Prompt 注入试验场 ([在线演示](https://prompttools.co)) - **[DocQA](https://github.com/joergmichno/docqa)** — 基于 RAG 的文档问答工具 ## 贡献欢迎贡献！请先开 Issue 讨论建议的变更。 ## 许可证 MIT 许可证 —— 详情见 [LICENSE](LICENSE)。 **由 [Jörg Michno](https://github.com/joergmichno) 开发 —— 因为自主 AI Agent 需要护栏。**

标签：AI安全, API密钥检测, API密钥泄露, Chat Copilot, DNS 反向解析, ESC8, meg, Prompt注入检测, Python, Web安全, 云安全监控, 代码混淆检测, 信息安全, 加密, 命令注入, 大模型安全, 安全助手, 安全规则引擎, 对抗攻击, 敏感信息检测, 数据泄露防护, 文档结构分析, 无后门, 漏洞扫描器, 社会工程学, 网络安全, 网络探测, 自主智能体, 蓝队分析, 越狱检测, 输入验证, 逆向工具, 隐私保护, 零依赖, 静态分析