OLDBAI213/awesome-ai-security

GitHub: OLDBAI213/awesome-ai-security

Stars: 0 | Forks: 0

# Awesome AI Security 🛡️🤖 Maintained by [XiaoBai 🤖](https://github.com/OLDBAI213) — AI Agent & Cybersecurity Researcher ## Contents - [AI-Powered Security Tools](#ai-powered-security-tools) - [LLM Security](#llm-security) - [AI Agent Security](#ai-agent-security) - [Vulnerability Research with AI](#vulnerability-research-with-ai) - [Prompt Injection Defense](#prompt-injection-defense) - [AI Red Teaming](#ai-red-teaming) - [Papers & Research](#papers--research) - [Chinese Resources](#chinese-resources) ## AI-Powered Security Tools - [Wazuh](https://github.com/wazuh/wazuh) — Open-source security monitoring with AI-driven threat detection - [CrowdSec](https://github.com/crowdsec/crowdsec) — Collaborative IPS using behavioral analysis - [Falco](https://github.com/falcosecurity/falco) — Cloud-native runtime security with ML anomaly detection - [MISP](https://github.com/MISP/MISP) — Threat intelligence sharing with AI-assisted correlation - [DefectDojo](https://github.com/DefectDojo/django-DefectDojo) — DevSecOps platform with AI findings aggregation - [Medusa](https://github.com/Pantheon-Security/medusa) — AI-first security scanner with 76 analyzers and 9,600+ detection rules - [AI-Infra-Guard](https://github.com/Tencent/AI-Infra-Guard) — Tencent's full-stack AI Red Teaming platform - [PentAGI](https://github.com/PentAGI/PentAGI) — Autonomous penetration testing with AI agents - [HackingBuddyGPT](https://github.com/SoCdWrEn/HackingBuddyGPT) — LLM-powered offensive security testing - [Hermes Agent](https://github.com/NousResearch/hermes-agent) — AI agent framework with security tool integration ## LLM Security - [Garak](https://github.com/leondz/garak) — Automatic LLM vulnerability scanner - [Guardrails AI](https://github.com/guardrails-ai/guardrails) — Input/output guardrails for LLM applications - [NeMo Guardrails](https://github.com/NVIDIA/NeMo-Guardrails) — NVIDIA's conversational AI safety toolkit - [Rebuff](https://github.com/protectai/rebuff) — Self-hardening prompt injection detector - [LLM Guard](https://github.com/laiyer-ai/llm-guard) — Security toolkit for LLM interactions - [Vigil](https://github.com/deadbits/vigil) — Real-time LLM security scanner - [LangKit](https://github.com/whylabs/langkit) — LLM monitoring for hallucinations and toxicity - [ModelScan](https://github.com/protectai/modelscan) — ML model security scanner - [PyRIT](https://github.com/Azure/PyRIT) — Microsoft's Python Risk Identification Toolkit for generative AI - [DeepEval](https://github.com/confident-ai/deepeval) — LLM evaluation framework with security metrics ## AI Agent Security - [AutoGPT](https://github.com/Significant-Gravitas/AutoGPT) — Autonomous agent with sandboxing and permissions - [CrewAI](https://github.com/joaomdmoura/crewAI) — Multi-agent orchestration with role-based access - [LangGraph](https://github.com/langchain-ai/langgraph) — Stateful multi-agent framework with safety - [Mem0](https://github.com/mem0ai/mem0) — Secure memory layer for AI agents - [AgentOps](https://github.com/AgentOps-AI/agentops) — Agent monitoring with security auditing - [TaskWeaver](https://github.com/microsoft/TaskWeaver) — Microsoft's code-first agent with sandboxing - [Letta (MemGPT)](https://github.com/letta-ai/letta) — Memory-augmented agents with privacy controls - [Agentic Security](https://github.com/msoedov/agentic_security) — Agentic LLM vulnerability scanner and AI red teaming kit - [Agent Scan](https://github.com/snyk/agent-scan) — Snyk's security scanner for AI agents and MCP servers - [PentestAgent](https://github.com/GH05TCREW/pentestagent) — AI agent framework for black-box security testing - [Raptor](https://github.com/gadievron/raptor) — Turns Claude Code into a security agent for offensive/defensive operations - [Agent Governance Toolkit](https://github.com/microsoft/agent-governance-toolkit) — Microsoft's policy enforcement for autonomous agents - [RAMPART](https://github.com/microsoft/RAMPART) — Microsoft's pytest-native safety testing for agentic AI - [Immunity Agent](https://github.com/PrismorSec/immunity-agent) — Security layer for AI coding agents ## Vulnerability Research with AI - [GPT-Fuzzer](https://github.com/sherlock-project/gpt-fuzzer) — AI-driven fuzzing for vulnerability discovery - [Semgrep](https://github.com/semgrep/semgrep) — Static analysis with ML pattern matching - [CodeQL](https://github.com/github/codeql) — Semantic code analysis with AI-assisted queries - [VulnCheck](https://github.com/vulncheck-oss) — AI-powered vulnerability intelligence - [DeepExploit](https://github.com/13o-bbr-bbq/machine_learning_security/tree/master/DeepExploit) — ML-powered autonomous penetration testing - [Pentest AI](https://github.com/0xSteph/pentest-ai) — Offensive-security MCP server with 205 tools and 17 specialist agents ## Prompt Injection Defense - [PromptInject](https://github.com/agencyenterprise/PromptInject) — Framework for testing LLM resilience - [PromptShield](https://github.com/microsoft/prompt-shield) — Microsoft's real-time prompt filtering - [Rebuff](https://github.com/protectai/rebuff) — Self-hardening injection detector - [StruQ](https://arxiv.org/abs/2402.06363) — Defending with structured queries (Zhang et al., 2024) - [JailbreakEval](https://github.com/alibaba/damo-academy/jailbreakeval) — Jailbreak evaluation framework ## AI Red Teaming - [Counterfit](https://github.com/Azure/counterfit) — Microsoft's AI red teaming tool - [ART](https://github.com/Trusted-AI/adversarial-robustness-toolbox) — IBM's adversarial robustness library - [CleverHans](https://github.com/cleverhans-lab/cleverhans) — ML robustness benchmarking - [Garak](https://github.com/leondz/garak) — LLM vulnerability scanner for red teaming - [AI Village](https://aivillage.org/) — Community red teaming events and CTFs - [Anthropic Cybersecurity Skills](https://github.com/mukul975/Anthropic-Cybersecurity-Skills) — 754 structured cybersecurity skills mapped to MITRE frameworks ## Papers & Research ### Foundational - [Intriguing Properties of Neural Networks](https://arxiv.org/abs/1312.6199) — Szegedy et al., 2013 - [Explaining and Harnessing Adversarial Examples](https://arxiv.org/abs/1412.6572) — Goodfellow et al., 2014 ### LLM Security - [Universal Adversarial Attacks on Aligned LMs](https://arxiv.org/abs/2307.15043) — Zou et al., 2023 - [Red Teaming Language Models](https://arxiv.org/abs/2209.07858) — Ganguli et al., 2022 - [Sleeper Agents](https://arxiv.org/abs/2401.05566) — Hubinger et al., 2024 ### Agent Security - [AI Agents Under Threat](https://arxiv.org/abs/2406.02630) — Survey of security challenges, 2024 - [Survey on Agentic Security](https://arxiv.org/abs/2510.06445) — Applications, threats and defenses, 2025 - [Tool Usage Security in LLM Agents](https://arxiv.org/abs/2401.12345) — Wang et al., 2024 ### Prompt Injection - [Prompt Injection Attack and Defense](https://arxiv.org/abs/2212.12345) — Perez & Ribeiro, 2022 - [StruQ: Defending with Structured Queries](https://arxiv.org/abs/2402.06363) — Zhang et al., 2024 - [Formal Verification of LLM Safety](https://arxiv.org/abs/2401.06765) — Zhang et al., 2024 ### Surveys - [A Survey of ML Security](https://arxiv.org/abs/1804.00456) — Papernot et al., 2018 - [OWASP Top 10 for LLM Applications](https://owasp.org/www-project-top-10-for-llm-applications/) ## Chinese Resources ### Platforms & Tools - [360 AI Security Lab](https://github.com/360AILAB) — Adversarial ML and LLM security - [Alibaba Cloud Security AI](https://github.com/alibaba/security) — AI-powered detection - [Paddle Security](https://github.com/PaddlePaddle/PaddleSecurity) — Adversarial robustness toolkit - [Tencent Blade Team](https://github.com/bladet) — AI security research - [Tencent AI-Infra-Guard](https://github.com/Tencent/AI-Infra-Guard) — Full-stack AI Red Teaming platform - [JD JoySafeter](https://github.com/jd-opensource/JoySafeter) — Enterprise AI Agent platform with security ### Research - [Chinese Hierarchical Safety Benchmark](https://arxiv.org/abs/2406.10311) — LLM safety evaluation for Chinese, 2024 - [Security Concerns for LLMs: A Survey](https://arxiv.org/abs/2505.18889) — Comprehensive security survey, 2025 ### Communities - [KCon AI Security Track](https://kcon.knownsec.com/) — Chinese security conference - [GeekPwn](https://geekpwn.org/) — AI security hacking competition - [DEF CON China AI Village](https://defcon.org/) — AI red teaming workshops