Carlos-Projects/agenthive
GitHub: Carlos-Projects/agenthive
Stars: 0 | Forks: 0
# AgentHive
Multi-agent attack simulation framework for AI systems.
[](https://pypi.org/project/agenthive-sim/)
[](https://pypi.org/project/agenthive-sim/)
[](LICENSE)
[](https://github.com/Carlos-Projects/agenthive/actions/workflows/ci.yml)
AgentHive extends [mcpwn](https://github.com/Carlos-Projects/mcpwn) from the single-agent domain to the **multi-agent domain**, modeling attacks that are only possible when multiple AI agents interact with each other.
## Installation
pip install agenthive-sim
For development:
pip install -e ".[dev,lab]"
## Quick Start
# Generate a scenario template
agenthive scenario my-scenario -o scenario.yaml
# Run a simulation
agenthive simulate scenario.yaml -v
# List available attack categories
agenthive list-scenarios
# Start the vulnerable lab server
agenthive lab
## Scenario YAML Format
Scenarios define the agent topology and attacks to simulate:
name: my-scenario
description: "Multi-agent SSRF through agent chain"
max_steps: 50 # max simulation steps before timeout
timeout_seconds: 300
agents:
- role: attacker # attacker | victim | observer | coordinator
name: red-agent-1
capabilities:
- prompt_injection
- tool_manipulation
- role: victim
name: blue-agent-1
capabilities:
- data_processing
- tool_usage
- role: victim
name: blue-agent-2
capabilities:
- collaboration
- file_operations
- role: observer
name: observer-1
capabilities: [monitoring]
attacks:
- category: tool_drift # must match AttackCategory
name: "Tool Drift"
description: "Drift tool definitions via shared memory"
severity: high # critical | high | medium | low | info
parameters: {} # attack-specific parameters
mitre_atlas: ["ATLAS-001"]
metadata:
environment: lab
difficulty: medium
## Ecosystem Integration
AgentHive integrates with the MCP security ecosystem:
- **mcp-taxonomy**: Findings are convertible via `agenthive_finding_to_taxonomy()` for unified correlation
- **MCPscop**: SARIF and JSON reports are consumable by MCPscop dashboards
- **mcpwn**: Extends mcpwn patterns from single-agent to multi-agent domain
## Attack Scenarios
| Category | Description | Severity |
|---|---|---|
| `tool_drift` | Exploit tool derivation caused by shared memory between agents | High |
| `long_horizon` | RL-based attacks spanning multiple agents in sequence | Critical |
| `collaboration_attack` | Manipulation of agent-to-agent collaboration | High |
| `authority_hijack` | Hijacking the authority chain between agents | Critical |
| `cross_agent_injection` | Prompt injection that propagates across agents | Critical |
| `multi_agent_ssrf` | Coordinated SSRF through multiple agents | High |
| `swarm_poisoning` | Poison one agent that propagates to the swarm | Critical |
| `identity_spoofing` | Identity spoofing between agents | High |
## Academic References
- [Evo-Attacker: Memory-Augmented RL for Long-Horizon Tool Attacks](https://arxiv.org/abs/2605.25389) (ACL 2026)
- [Memory-Induced Tool-Drift in LLM Agents](https://arxiv.org/abs/2605.24941)
- [Behind EvoMap: Agent-to-Agent Collaboration Network](https://arxiv.org/abs/2605.25815)
- [Authority Frontier Framework for Runtime Actuarial Control](https://arxiv.org/abs/2605.25632)
- [Deep-Research Agents Can Be Poisoned](https://arxiv.org/abs/2605.24294) (Shmatikov et al.)
- [MITRE ATLAS — Multi-Agent System Attack Patterns](https://atlas.mitre.org/techniques/)
## Ecosystem
- Extends [mcpwn](https://github.com/Carlos-Projects/mcpwn) — same stack, multi-agent domain
- Uses [mcp-taxonomy](https://github.com/Carlos-Projects/mcp-taxonomy)
- Reports consumable by [MCPscop](https://github.com/Carlos-Projects/mcpscope)
## License
MIT — see [LICENSE](LICENSE)