Andrew0x7/incidentiq
GitHub: Andrew0x7/incidentiq
Stars: 0 | Forks: 0
# IncidentIQ — Multi-Agent Incident Response
## 🚩 Pain Point
Production incidents cost enterprises **$5,600/minute** on average (Gartner). Engineers spend 60% of incident time manually correlating logs across services, identifying root cause, and coordinating response. A single misdiagnosis can extend MTTR by hours, costing millions.
## 🧠 Multi-Agent Architecture
IncidentIQ deploys **5 specialized agents** orchestrated by MiMo V2.5:
| Agent | Focus | Techniques |
|-------|-------|-----------|
| 📋 Log Parser | Multi-source log ingestion & correlation | Pattern matching, temporal correlation |
| 🔭 Trace Analyzer | Distributed tracing & latency profiling | Span analysis, cascade detection |
| 🧠 Root Cause | Causal inference & dependency mapping | Bayesian networks, symptom filtering |
| 🔧 Fix Generator | Auto-remediation & rollback plans | Template matching, risk scoring |
| 🚨 Escalation | Severity routing & stakeholder comms | Blast radius, SLA tracking |
### Response Pipeline
1. **Ingestion** — Normalize logs from Docker, K8s, CloudWatch, application traces
2. **Analysis** — All 5 agents analyze in parallel → correlate findings
3. **Resolution** — Generate fixes → route escalation → produce report
## 🔧 Tech Stack
- **Engine**: MiMo V2.5 (orchestration & natural language analysis)
- **Log Parsing**: Multi-format parser (JSON, syslog, structured, unstructured)
- **Tracing**: OpenTelemetry-compatible span analysis
- **Frontend**: Vanilla JS (zero dependencies, instant load)
- **Deployment**: GitHub Pages (static)
## 📊 Metrics
- **14+ incident patterns** detected across 5 categories
- **Severity scoring**: P0 (Critical) → P3 (Low)
- **Timeline reconstruction** from raw logs
- **Auto-fix suggestions** with risk assessment
## 🚀 Usage
1. Paste error logs, stack traces, or incident reports
2. Click **⚡ Analyze**
3. Review findings across Analyze, Timeline, Agents, and Pipeline views
4. Follow recommended fixes per finding
## 📄 License
标签:后端开发