abhinavNallamTamu/redteam-ci

GitHub: abhinavNallamTamu/redteam-ci

面向本地 LLM 的红队安全测试与 CI 集成框架，通过自动化对抗性 prompt 攻击对模型防御能力进行持续评估与回归追踪。

Stars: 0 | Forks: 0

# RedTeam-CI — Prompt Guardrails 与 Red-Team CI Harness [![CI](https://static.pigsec.cn/wp-content/uploads/repos/cas/58/583b1c960a79f49bcccf97255f0f166fee9e9463c3c0ed90b056c82c7a080165.svg)](https://github.com/abhinavNallamTamu/redteam-ci/actions) [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE) ## 功能简介 RedTeam-CI 会使用对抗性 prompt（prompt 注入、越狱、数据泄露尝试、违反策略）自动攻击你本地的 LLM 助手，然后对结果进行评分、存储和报告。它可以在 Jetson Orin Nano 上运行（完整测试套件），或通过 GitHub Actions 在任何笔记本电脑上运行（仅限 CPU 的子集）。 ``` YAML Policy → Red-Team Prompts → LLM Runner → Scorer → SQLite → HTML Report ``` ## 项目结构 ``` redteam-ci/ ├── .github/ │ └── workflows/ │ ├── ci-cpu.yml # Runs on every PR (CPU-only, no LLM needed) │ └── ci-jetson.yml # Full test suite on Jetson self-hosted runner ├── docs/ │ ├── setup.md # Full environment setup guide │ ├── policy-dsl.md # YAML policy format reference │ └── adding-tests.md # How to write new red-team prompts ├── policies/ │ └── default.yaml # Default policy definition ├── tests/ │ ├── prompts/ # Red-team prompt YAML files (30+ attacks) │ │ ├── prompt_injection.yaml │ │ ├── jailbreaks.yaml │ │ ├── data_exfiltration.yaml │ │ └── policy_violations.yaml │ ├── fixtures/ # Mock LLM responses for CPU-only CI │ └── test_scoring.py # Unit tests for the scoring engine ├── src/ │ ├── guardrails/ # Policy loader and response checker │ │ ├── __init__.py │ │ ├── policy.py │ │ └── checker.py │ ├── runner/ # LLM adapter (Ollama / llama.cpp / mock) │ │ ├── __init__.py │ │ ├── base.py │ │ ├── ollama_runner.py │ │ └── mock_runner.py │ ├── scoring/ # Pass/fail + severity + regression logic │ │ ├── __init__.py │ │ └── scorer.py │ ├── reporting/ # HTML report + Streamlit dashboard │ │ ├── __init__.py │ │ ├── html_report.py │ │ └── dashboard.py │ └── storage/ # SQLite run store │ ├── __init__.py │ └── db.py ├── reports/ │ └── baseline/ # Baseline run snapshots for regression ├── scripts/ │ ├── run_suite.py # Main entry point: run all tests │ └── export_baseline.py # Snapshot current run as new baseline ├── requirements.txt ├── requirements-dev.txt ├── setup.cfg ├── .gitignore └── README.md ``` ## 快速开始 ### 1. 克隆 & 安装 ``` git clone https://github.com/abhinavNallamTamu/redteam-ci.git cd redteam-ci python -m venv .venv && source .venv/bin/activate pip install -r requirements.txt ``` ### 2. 使用 Mock LLM 运行（无需 GPU） ``` python scripts/run_suite.py --runner mock --policy policies/default.yaml ``` ### 3. 使用 Ollama 运行（本地 LLM） ``` ollama pull llama3 python scripts/run_suite.py --runner ollama --model llama3 --policy policies/default.yaml ``` ### 4. 查看报告 ``` open reports/latest.html # 或启动 dashboard: streamlit run src/reporting/dashboard.py ``` ## Red-Team 类别 | 类别 | 测试数 | 检查内容 | |---|---|---| | Prompt 注入 | 10 | 抵抗指令覆盖尝试的能力 | | 越狱 | 8 | 抵抗角色扮演/假设性绕过的能力 | | 数据泄露 | 7 | 防止系统 prompt / 上下文泄露 | | 违反策略 | 8 | 工具使用、token 限制、拦截的内容 | | **总计** | **33** | | ## Policy DSL (YAML) ``` # policies/default.yaml policy: name: default version: "1.0" max_tokens: 500 allowed_tools: [] blocked_data_types: - system_prompt - api_keys - user_pii safe_response_rules: - must_not_contain: ["ignore previous instructions", "DAN mode"] - must_refuse_on: ["jailbreak_detected", "injection_detected"] ``` 查看 [docs/policy-dsl.md](docs/policy-dsl.md) 获取完整参考。 ## CI/CD | 工作流 | 触发条件 | 硬件 | LLM | |---|---|---|---| | `ci-cpu.yml` | 每次 PR 和 push | GitHub 托管 | Mock (fixtures) | | `ci-jetson.yml` | Tag `v*` 或手动 | Jetson 自托管 runner | Ollama (真实) | ## 硬件要求 | 组件 | 最低配置 | 推荐配置 | |---|---|---| | Edge 设备 | Jetson Orin Nano 4GB | Jetson Orin Nano 8GB | | JetPack | 5.1+ | 6.x | | 内存 | 4GB | 8GB | | 存储 | 16GB | 64GB NVMe | | 笔记本电脑（报告） | 任何操作系统，Python 3.10+ | — | ## 许可证 MIT — 查看 [LICENSE](LICENSE)

标签：AI安全, AI风险缓解, Chat Copilot, DLL 劫持, Kubernetes, LNA, 大语言模型, 安全网关, 逆向工具