azdabat/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules

GitHub: azdabat/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules

这是一个基于KQL的威胁检测框架，通过最小真相和组合评分优化大规模安全运营。

Stars: 0 | Forks: 0

# Minimum Truth Detection Framework # ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 4.0 INTERNATIONAL (CC BY-NC-SA 4.0) Copyright (c) 2026 Ala Dabat. All Rights Reserved. This work (including all KQL queries, detection logic, documentation, and the "Minimum Truth" Framework architecture) is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. ## You are free to: * **Share** — copy and redistribute the material in any medium or format. * **Adapt** — remix, transform, and build upon the material. ## Under the following terms: * **Attribution** — You must give appropriate credit to **Ala Dabat**, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. * **NonCommercial** — You may **NOT** use the material for commercial purposes (e.g., selling these rules, including them in a paid product, or putting them behind a paywall). * **ShareAlike** — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original. ╔══════════════════════════════════════════════════════════════════════════════╗ ║ MINIMUM TRUTH DETECTION FRAMEWORK ║ ║ ║ ║ Minimum Truth ──▶ Reinforcement ──▶ Scoring ──▶ Hunter Directive ║ ║ ║ ║ Truth Anchor = Sensor Reinforcement = Evidence ║ ║ Cousins = Adjacent Sensors Incident = Story Stitching ║ ║ ║ ║ The rule is the sensor. The incident is the narrative. ║ ╚══════════════════════════════════════════════════════════════════════════════╝ ## Table of Contents - [Operational Calibration & Testing](#operational-calibration--testing) - [Engineering Notes](#engineering-notes) - [Detection Engineering Lifecycle](#detection-engineering-lifecycle) - [ATT&CK Substrate Adjacency](#attck-substrate-adjacency) - [Attack Ecosystem Intelligence](#attack-ecosystem-intelligence--defeating-temporal-deception) - [Why This Repository Exists](#why-this-repository-exists) - [Detection Maturity Model](#detection-maturity-model) - [Core Doctrine — The Minimum Truth Funnel](#core-doctrine--the-minimum-truth-funnel) - [Substrate-First vs Intent-First](#substrate-first-vs-intent-first-minimum-truth) - [MITRE Ecosystem Coverage — Minimum Truth Anchors & Cousins](#mitre-ecosystem-coverage--minimum-truth-anchors--cousins) - [OAuth Consent Abuse — Applying Both Anchoring Strategies](#oauth-consent-abuse--applying-both-anchoring-strategies) - [Noise Model & Suppression Strategy](#noise-model--suppression-strategy) - [Rarity & Organisational Prevalence](#rarity--organisational-prevalence) - [Correlation vs Ghost Chains](#correlation-vs-ghost-chains) - [Primitive Stitching & Incident Narrative Architecture](#primitive-stitching--incident-narrative-architecture) - [Composite Threat Hunt Portfolio](#composite-threat-hunt-portfolio) - [Architecture Doctrine — At Scale](#architecture-doctrine--at-scale) - [Composite Rule Template](#composite-rule-template--registry-persistence-taskcache) - [Hunter Directives](#hunter-directives) - [The Rule Factory Checklist](#the-rule-factory-checklist) - [Architectural Strategy — Split vs Composite](#architectural-strategy--split-vs-composite) - [Cousin Rules & Attack Ecosystem Coverage](#cousin-rules--attack-ecosystem-coverage) - [Router Rules](#router-rules--rules-that-sit-outside-ecosystems) - [Production Deployment](#production-deployment) - [The ATLAS](#the-attack-ecosystem-atlas) ## Operational Calibration & Testing ## Engineering Notes During validation of the Minimum Truth Detection Framework composite rule set, several recurring implementation pitfalls were identified while stress-testing multiple KQL detections. These issues do **not affect the detection doctrine itself** (*Minimum Truth → Reinforcement → Scoring → Hunter Directive*), but arise from common **KQL engineering edge cases** including: - Prevalence window overlap - Incorrect `leftouter` join handling - SHA256 rarity edge cases - Non-deterministic `any()` summarization - Negative composite score behaviour **[KQL Detection Engineering — Common Implementation Errors](https://github.com/azdabat/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules/blob/main/KQL%20Detection%20Engineering%20%E2%80%94%20Common%20Implementation%20Errors.md)** This document acts as an engineering reference and lint guide for KQL detection development, capturing the bug classes discovered during composite rule validation. Its purpose is to ensure the framework remains **deterministic, reliable, and production-safe** as additional detection logic is developed. ## Detection Engineering Lifecycle ## ATT&CK Substrate Adjacency **[ATT&CK Substrate Adjacency — Full Document](https://github.com/azdabat/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules/blob/main/ATT%26CK_Substrate_Adjacency.md)** MITRE ATT&CK models techniques as independent units with vertical depth (technique → sub-technique). What it does not model is **substrate adjacency** — the reality that many techniques represent the same adversary intent executed across different operating system substrates. Lateral movement via **SMB (T1021.002)**, **DCOM (T1021.003)**, and **WinRM (T1021.006)** are operationally interchangeable. An adversary pivots dynamically between them based on firewall restrictions, privileges, and endpoint controls. Treating these as independent creates a **false sense of detection coverage**. The Minimum Truth Detection Framework introduces a **Cousin Technique Doctrine** — modelling adjacent techniques as part of a shared attack ecosystem. This layer sits on top of ATT&CK and enables detection strategies that target **adversary intent** rather than isolated technique identifiers. ## Attack Ecosystem Intelligence — Defeating Temporal Deception Modern adversary tradecraft relies heavily on **Temporal Deception** — staggered C2 jitter, delayed BYOVD kernel exploitation, and automated script loops that pivot dynamically across parallel execution boundaries (Cousin Techniques). When defensive units rely on monolithic, join-dependent kill chains, they face catastrophic query timeouts or miss intrusions entirely due to sequence fracturing over time. This framework formalises a **hybrid architecture**: deploying optimised, single-surface Behavioural Composites to deliver immediate high-confidence alerts (*Hunter Directives*), while concurrently running a silent Incident-Layer Stitching Engine mapped to common entity keys (`DeviceName`, `AccountName`). By separating the sensor architecture from chronological storytelling, this framework achieves true scale-safe efficiency — forcing immediate narrative convergence the exact second an attacker touches an un-bypassable telemetric substrate choke point. ## Why This Repository Exists Most SOCs struggle with threat hunting not because they lack tools, but because: - Detections are **over-engineered** — monolithic queries that collapse under production load - Behavioural chains are **forced where they are not required** — ghost chains producing false certainty - Analysts are overwhelmed by **noise disguised as intelligence** - Rules are written without regard for **SOC operating reality** This repository documents a **deliberate, operationally grounded methodology** for threat hunting that: - Scales to real SOC teams under real enterprise load - Preserves signal fidelity without brittle allowlists - Reduces analyst fatigue through contextual scoring - Applies behavioural correlation **only when the attack structurally requires it** **Focus:** Practical, adversary-informed threat hunting for real SOC environments **Audience:** L2 / L2.5 Threat Hunters, Detection Engineers, Security Leads ## Detection Maturity Model ### Reductive Baseline — Truth First Every attack technique has a **minimum condition that must be true**. If that condition is not met, the detection should not exist. This prevents speculative or assumption-driven hunting. ### Composite L2 / L2.5 Hunts Most attacks do **not** require full behavioural chains. This repository focuses on Composite Hunts that group related high-signal indicators, prefer single telemetry sources, and use minimal joins only when unavoidable. This is where **most effective threat hunting lives**. ### Reinforcement — Confidence, Not Dependency Once baseline truth is met, confidence is increased using parent/child execution context, suspicious paths, network proximity, and rarity/prevalence. Reinforcement improves fidelity. Reinforcement reduces noise. **Reinforcement never defines the attack.** ### Behavioural Chains — Used Sparingly Correlation is used **only when the attack cannot exist without multiple linked events**. DLL sideloading is the canonical example — neither the DLL drop nor the binary load alone proves the technique. Only together, correlated within a time window, do they constitute truth. flowchart LR subgraph Noise["Each event alone — benign in isolation"] A[DLL written to writable path] B[Signed binary loads DLL] end subgraph Truth["Composite Truth — correlation is mandatory here"] C[DLL written to Temp / Public] D[Signed binary loads DLL\nfrom same path within window] C --> E{Time Window\nCorrelation} D --> E E --> F[High-Confidence\nSideload Attack] F --> G[Score · Reinforce · Direct] end ## Core Doctrine — The Minimum Truth Funnel ### The Problem at Enterprise Scale In environments of 100,000+ endpoints, traditional detection fails at the database layer. Standard SIEM rules rely on monolithic queries — massive, multi-table `join` operations executed across raw telemetry — producing query timeouts, extreme compute costs, and what this framework defines as **Bleak Outcomes**: high-confidence intrusions missed entirely because the query never completed. Traditional monolithic rule: Stage A AND Stage B within 15 minutes AND Stage C on same host → 100k endpoints → DeviceProcessEvents 500M+ rows/day → Cross-table join on raw data → TIMEOUT → Attacker delays Stage B by 72 hours → TIME WINDOW MISSED → Attacker pivots from SMB to WMI → JOIN CONDITION BROKEN → Detection: NULL ### The Three Pillars **Filter Before You Join.** Never join two raw tables. Reduce the primary table to its most critical subset — the truth — before asking for context. A query filtering `DeviceRegistryEvents` to three specific key paths before a pre-summarised prevalence join runs in seconds on a 100k estate. The same query joining raw `DeviceProcessEvents` times out. **Native Enrichment Over Joins.** Modern EDR schemas carry implicit context. `DeviceRegistryEvents` already contains `InitiatingProcessFileName`, `InitiatingProcessSHA256`, `InitiatingProcessSigner`, and `InitiatingProcessVersionInfoCompanyName`. Mapping these native fields eliminates the `DeviceProcessEvents` join entirely — zero memory pressure, full process context. **Contextual Scoring, Not Binary Alerts.** Once truth is established, route surviving data through a convergence matrix. A cumulative risk score prevents dangerous truths from being suppressed when a safe signal is present, and prevents noise from being elevated when a dangerous signal is absent. BaseScore (Truth Anchor) = 55 + TaskCache Artefact = +25 + Dangerous Primitive = +25 + Base64 Payload = +20 + User-Writable Path = +15 + Untrusted Writer = +10 + Rare Writer (Prevalence) = +10 ────────────────────────────────────── FinalScore = 160 → CRITICAL ## Substrate-First vs Intent-First Minimum Truth ### The Architectural Decision Every detection begins with a choice: anchor on the **execution substrate itself**, or anchor on a **malicious primitive** performed by that substrate. Choosing the wrong strategy is a primary cause of detection failure at scale. ┌─────────────────────────────────────────┬───────────────────────────────────────────┐ │ SUBSTRATE-FIRST │ INTENT-FIRST │ │ │ │ │ "Did this execution surface exist?" │ "Did this substrate perform an action │ │ │ that implies attacker capability?" │ ├─────────────────────────────────────────┼───────────────────────────────────────────┤ │ Anchor: execution surface │ Anchor: malicious primitive │ │ Use when: no visible intent │ Use when: substrate is common but │ │ (WMI fileless, BYOVD, injection) │ primitive implies capability │ │ Noise: higher, reinforcement required │ Noise: lower, base confidence raised │ │ Tier: L1 sensor / atomic │ Tier: L2 composite │ └─────────────────────────────────────────┴───────────────────────────────────────────┘ graph TD Q{Does the execution\nsubstrate carry\nvisible intent?} Q -- No --> SF[Substrate-First\nThe surface IS the anchor\nWMI · BYOVD · Pipe Patterns] Q -- Yes --> IF[Intent-First\nThe primitive IS the anchor\nPowerShell · LOLBins · OAuth scopes] SF --> R1[Reinforcement required\nto gain confidence] IF --> R2[Reinforcement amplifies\nalready elevated base score] R1 & R2 --> SC[Convergence Scoring\nHunter Directive] ### Substrate-First — The Canonical Case: WMI Fileless Execution In a WMI Permanent Event Subscription attack, an adversary registers a malicious `ActiveScriptEventConsumer`. When triggered, Windows Script Host (`scrcons.exe`) loads a script engine DLL — `vbscript.dll`, `jscript.dll`, or `scrobj.dll` — directly into process memory. **There is no child process. No command-line argument. No file written to disk.** The payload executes as a DLL loaded into a trusted Windows process. Attempting to anchor on intent fails here — there is no attacker-controlled command-line visible at the DLL load layer. The substrate IS the signal. // Minimum Truth — WMI Fileless Execution (T1546.003) DeviceImageLoadEvents | where InitiatingProcessFileName =~ "scrcons.exe" | where FileName in~ ("vbscript.dll", "jscript.dll", "scrobj.dll") // This is the irreducible minimum. You cannot go further left in the kill chain. Reinforcement adds confidence after truth is confirmed: // R1: Near-time network egress from scrcons.exe → C2 beacon signal DeviceNetworkEvents | where InitiatingProcessFileName =~ "scrcons.exe" | where RemoteIPType == "Public" and RemotePort in (80, 443) // R2: DLL loaded from non-system path → staging artefact // R3: First-time behaviour on this device in 30 days → prevalence anomaly ### Substrate-First — BYOVD Driver Staging (Temporal Case) BYOVD attacks exploit time. A vulnerable `.sys` driver is dropped quietly on Day 0. A service is created days later. The driver drop is the only observable primitive at the time it occurs — no execution, no intent visible. // Minimum Truth — BYOVD Driver Staging (T1543.003 / T1068) DeviceFileEvents | where FileName endswith ".sys" | where FolderPath matches regex @"(?i)\\(AppData|Temp|Public|ProgramData|Users)\\" | where InitiatingProcessSignatureStatus != "Signed" or InitiatingProcessFolderPath matches regex @"(?i)\\(AppData|Temp|Public)\\" // A .sys file dropped to a writable path by an anomalously located binary // is the substrate truth. The driver is passive. The substrate IS the signal. The 30-day atomic primitive index connects the Day 0 staging to the Day 3 activation — defeating temporal deception that no time-windowed composite rule can address alone. ### Intent-First — The Primitive Implies Capability When the execution substrate is ubiquitous, the only reliable anchor is a specific action on that substrate that structurally implies attacker capability. The primitive is the anchor, not the binary. // Minimum Truth — PowerShell Intent (T1059.001) DeviceProcessEvents | where FileName in~ ("powershell.exe", "pwsh.exe") | where ProcessCommandLine has_any ( "Invoke-WebRequest", "DownloadString", // Remote retrieval "FromBase64String", "IEX", // Payload decoding + in-memory exec "Add-Type", "-EncodedCommand", // .NET loading + obfuscated exec "VirtualAlloc", "OpenProcess" // Memory allocation / injection primitives ) PowerShell running is common. PowerShell performing remote retrieval, payload decoding, or memory allocation is not common in legitimate enterprise workflows. The primitive raises base confidence before any reinforcement is applied. ## MITRE Ecosystem Coverage — Minimum Truth Anchors & Cousins This table maps the core MITRE attack ecosystems to their minimum truth anchors, the anchoring strategy that applies, and the cousin techniques that must be covered to eliminate false coverage gaps. graph LR subgraph MITRE_Ecosystems["Core MITRE Attack Ecosystems"] P[Persistence\nT1053 · T1547 · T1543 · T1546] LM[Lateral Movement\nT1021 · T1570] CA[Credential Access\nT1003 · T1558 · T1621] EX[Execution\nT1059 · T1218 · T1546] DE[Defense Evasion\nT1562 · T1055 · T1036] C2[Command & Control\nT1071 · T1095] EF[Exfiltration\nT1041 · T1567 · T1048] end subgraph Framework["Framework Response"] CS[Cousin Sensors\nCovering adjacent substrates] MT[Minimum Truth Anchors\nSubstrate-First or Intent-First] IL[Incident Layer\nNarrative stitching] end MITRE_Ecosystems --> MT --> CS --> IL ### Persistence Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | Silent TaskCache Persistence | `RegistryValueSet` under `Schedule\TaskCache` | Substrate-First | T1053.005 | CLI `schtasks.exe /create` | T1053.005 | | Registry Run Key Persistence | `RegistryValueSet` under `\Run` or `\RunOnce` | Intent-First (writable + payload) | T1547.001 | ActiveSetup · AppInit · Winlogon | T1547.014 · T1546.010 | | Service Persistence (ImagePath) | `RegistryValueSet` on `ImagePath` in `Services` | Intent-First (path + signer) | T1543.003 | BYOVD Driver Service | T1543.003 · T1068 | | WMI Permanent Subscription | `scrcons.exe` loads script engine DLL | Substrate-First | T1546.003 | COM Hijacking · IFEO | T1546.015 · T1546.012 | ### Lateral Movement Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | SMB Service Execution | `services.exe` spawning uncommon child binary | Intent-First | T1021.002 | WMI Remote Exec · WinRM · DCOM | T1021.003 · T1021.006 · T1021.003 | | WMI Remote Execution | `WmiPrvSE.exe` spawning cmd/powershell | Substrate-First | T1021.003 | PsExec · AT Scheduler · WMIC | T1021.002 · T1053.005 | | Pass-the-Hash | Network logon type 3 without interactive logon | Substrate-First | T1550.002 | Pass-the-Ticket · Overpass-the-Hash | T1550.003 · T1550.002 | ### Credential Access Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | LSASS Memory Dump | Non-AV process opens LSASS with ReadProcessMemory rights | Substrate-First | T1003.001 | DCSync Replication · SAM Extract | T1003.006 · T1003.002 | | DCSync | Non-DC account performs `GetChanges` replication rights request | Substrate-First | T1003.006 | NTDS.dit Volume Shadow Copy extract | T1003.003 | | Kerberoasting | Unusual volume of TGS requests using RC4 encryption from non-admin | Intent-First | T1558.003 | AS-REP Roasting (no preauth) | T1558.004 | ### Execution Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | PowerShell Abuse | `-enc` / `IEX` / `VirtualAlloc` primitives | Intent-First | T1059.001 | WScript · CScript · mshta execution | T1059.005 · T1218.005 | | mshta.exe Proxy Exec | `mshta.exe` with `http://` or `vbscript:` argument | Intent-First | T1218.005 | regsvr32 Squiblydoo · rundll32 | T1218.010 · T1218.011 | | certutil.exe Decode | `certutil.exe -decode` or `-urlcache` invocation | Intent-First | T1140 | bitsadmin · curl · PowerShell WebRequest | T1197 · T1059.001 | | DLL Sideloading | Signed binary loads DLL from user-writable path | Substrate-First (correlation required) | T1574.002 | DLL Search Order Hijack · Phantom DLL | T1574.001 · T1574.002 | ### Defense Evasion Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | Security Product Tamper | `fltmc.exe unload` or `WinDefend` stop primitives | Intent-First | T1562.001 | Exclusion path addition (`Add-MpPreference`) | T1562.001 | | BYOVD Rootkit Activation | Service created pointing to `.sys` in writable path | Substrate-First | T1068 · T1543.003 | Kernel Callback Modification | T1014 | | Process Injection | `VirtualAlloc` in PowerShell script block or via LOLBin | Substrate-First | T1055 | Process Hollowing · Thread Hijack | T1055.012 · T1055.003 | ### Command & Control Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | Named Pipe C2 | Pipe creation matching known implant naming pattern | Substrate-First | T1071 | HTTP/S Beaconing to rare ASN | T1071.001 | | Encrypted C2 (HTTPS Jitter) | Low-volume outbound HTTPS to first-seen domain by LOLBin | Intent-First | T1071.001 | DNS Tunnelling · ICMP C2 | T1071.004 · T1095 | ### Exfiltration Ecosystem | Technique | Minimum Truth Anchor | Anchoring Strategy | MITRE | Cousin Techniques | Cousin MITRE | |-----------|---------------------|--------------------|-------|-------------------|--------------| | LOLBin Exfiltration | `bitsadmin /transfer` or `certutil -urlcache` to external | Intent-First | T1197 · T1041 | PowerShell `Invoke-WebRequest` POST | T1059.001 | | Cloud Storage Exfil | Bulk OneDrive/SharePoint download spike vs user baseline | Substrate-First (deviation) | T1567.002 | Archive staging before exfil (`7z`/`rar`) | T1560.001 | ## OAuth Consent Abuse — Applying Both Anchoring Strategies Unlike endpoint execution, OAuth abuse is identity-driven and user-mediated. The distinction between substrate-first and intent-first becomes operationally critical here. ### OAuth Substrate-First A successful consent grant occurred. A trust boundary changed. This does not imply malicious intent — it is substrate truth appropriate for tenant visibility and baseline modelling. AuditLogs | where OperationName in~ ( "Consent to application", "Add delegated permission grant", "Add app role assignment grant to service principal" ) | where Result =~ "success" ### OAuth Intent-First Intent-first in OAuth is not "consent happened." It is: **high-risk permission capability was granted.** AuditLogs | where OperationName in~ ( "Consent to application", "Add delegated permission grant", "Add app role assignment grant to service principal" ) | where Result =~ "success" | mv-expand TargetResources[0].modifiedProperties | where tostring(TargetResources[0].modifiedProperties.newValue) has_any ( "Mail.ReadWrite", "Directory.ReadWrite.All", "AppRoleAssignment.ReadWrite.All", "RoleManagement.ReadWrite.Directory", "Files.ReadWrite.All", "Sites.FullControl.All" ) The scope grant is the primitive that implies capability — it is the intent anchor that makes the detection stable. Without it, the rule requires endless tuning against thousands of legitimate application consent events. ### OAuth Composite Integration Sensor Layer (Substrate-First) → Visibility · baseline modelling · consent velocity Intent Layer (Intent-First) → High-risk scope capability granted Reinforcement → Admin consent (OnBehalfOfAll == true) Suspicious User-Agent · FirstSeen AppId Rare AppId in tenant · Privileged user Scoring → Substrate consent = low base score High-risk permission = primary weight Admin consent = escalator Rarity/newness = anomaly boost Known-good AppId = discount (never bypass) High-risk floor prevents score burial ## Noise Model & Suppression Strategy ### Core Principle Noise is not removed through blind exclusions. It is **measured, profiled, and down-scored** through contextual weighting. // ❌ Hard exclusion — creates structural blind spots | where InitiatingProcessFileName != "ccmexec.exe" // ✅ Soft-allow scoring model let Penalty_ManagedLineage = -25; let Penalty_InternalNet = -10; let Penalty_HighBurst = -20; Management automation reduces risk. It does not eliminate telemetry visibility. ### Empirical Noise Baseline — Pre-Tuning Requirement Before suppression logic is applied, extract dominant operational patterns: DeviceProcessEvents | where FileName =~ "powershell.exe" | summarize Count = count(), Devices = dcount(DeviceId) by InitiatingProcessFileName, InitiatingProcessAccountName, bin(Timestamp, 1h) | order by Count desc ### Scoring Model for Suppression let Score_EncodedPrimitive = 40; let Score_SuspiciousParent = 30; let Score_WritablePath = 20; let Score_ExternalNetwork = 25; let Score_RareExecution = 15; let Penalty_ManagedLineage = -25; // SCCM, Intune, Tanium lineage let Penalty_InternalNet = -10; // Internal IP egress only let Penalty_HighBurst = -20; // 50+ hosts in 10 minutes Tenant-portable suppression uses configuration tables rather than hardcoded values: let TrustedAutomationParents = datatable(ProcessName:string) [ "ccmexec.exe", "intunemanagementextension.exe", "taniumclient.exe" ]; ### Burst Modelling DeviceProcessEvents | where FileName =~ "powershell.exe" | summarize BurstCount = dcount(DeviceId) by bin(Timestamp, 10m) | order by BurstCount desc // High volume simultaneous → patch deployment → down-score // Low volume isolated → targeted intrusion → escalate ### Architectural Summary | Principle | Implementation | |-----------|----------------| | No brittle allowlists | Score reduction instead of exclusion | | Measure before suppressing | Empirical baseline extraction first | | Convergence required | Multiple reinforcement layers needed for escalation | | Prevalence modifies urgency | Never suppresses alerts | | Burst modelling | Differentiates mass automation from targeted intrusion | | Config-driven tuning | Avoids hard-coded exclusions across tenants | ## Rarity & Organisational Prevalence ### Three Safe Applications **Command / Behaviour Prevalence** How many hosts exhibit this exact behaviour? - 1–2 hosts → likely targeted intrusion → escalate - 200+ hosts → likely IT automation → deprioritise (never suppress) **Actor / Parent Context Prevalence** Who normally performs this action in this environment? - `rundll32.exe` spawned by `winword.exe` → anomalous execution context - Service account accessing data outside its role → privilege anomaly **Burst / Radius Prevalence** How fast and how widely did this appear? - Single host → targeted persistence - Domain-wide in under 10 minutes → ransomware precursor ### What Prevalence Is NOT Used For | Wrong | Right | |-------|-------| | Rarity as standalone alert trigger | Rarity as reinforcement signal after truth | | Common = safe | Common = lower urgency, not lower visibility | | Rare = malicious | Rare = higher priority, not automatic alert | | Suppress LSASS access on AV hosts | Surface always — score urgency by actor context | **Example — Prevalence Applied After Truth:** // Minimum Truth already established — persistence exists: RegistryValueData has "powershell" and RegistryValueData has "\\users\\public\\" // Prevalence reinforcement applied AFTER truth confirmation: | summarize DeviceCount = dcount(DeviceId) by TaskFingerprint | extend IsRare = DeviceCount <= 2 // 1 device → likely intrusion → CRITICAL urgency // 300 devices → likely IT script → MEDIUM urgency // The detection never disappears. The response priority changes. **Minimum Truth defines the attack. Reinforcement increases confidence. Prevalence scales triage.** ## Correlation vs Ghost Chains ### What Is a Ghost Chain? A ghost chain stitches together unrelated events into fake kill-chain certainty: // ❌ Ghost chain — forces false narrative RegistryValueSet | join NetworkConnection on DeviceId | join ProcessInjection on DeviceId | where all within 10 minutes // Persistence may be set today, executed tomorrow → time window MISSED // Network traffic is unrelated → FALSE POSITIVE // Injection never occurs → NULL SCORE on real intrusion ### The Correct Architecture — Independent Sensors // Sensor 1: Persistence truth DeviceRegistryEvents | where RegistryKey has "\\Run" | where RegistryValueData has "powershell" // Truth: persistence exists. Do not join this. Do not extend this. Alert on this. // Sensor 2: In-memory execution truth DeviceEvents | where ActionType == "PowerShellScriptBlock" | where AdditionalFields has "VirtualAlloc" // Truth: in-memory execution capability is being prepared. // Sensor 3: Silent task persistence truth DeviceRegistryEvents | where RegistryKey has "\\Schedule\\TaskCache" | where RegistryValueData has "-enc" // Truth: silent scheduled task persistence exists. **Incident-Level Correlation:** The SIEM correlates same device + same user + same timeframe + multiple truths firing. This builds the attack story correctly — without ghost chains inside individual rules. ### Correlation vs Ghost Chains — Decision Correlate inside a rule ONLY when: → The technique cannot exist without both events → Telemetry sources are stable and reliable → The join reduces ambiguity, not increases complexity Split into sibling composites when: → The truth surface changes → The noise domain changes → The attacker method is optional → The timing may vary across sessions or days ## Primitive Stitching & Incident Narrative Architecture ### The Two-Layer Fusion Architecture ┌──────────────────────────────────┬────────────────────────────────────────────┐ │ LAYER 1: ATOMIC SENTINEL │ LAYER 2: BEHAVIOURAL COMPOSITE │ │ (The Net) │ (The Anchor) │ ├──────────────────────────────────┼────────────────────────────────────────────┤ │ Continuous silent logging │ High-fidelity minimum truth detection │ │ No individual alert threshold │ Fires as Instant Hit Anchor │ │ 30-day rolling entity index │ Immediate HunterDirective output │ │ Catches what composites miss │ Triggers pivot into atomic timeline │ │ Defeats temporal deception │ Localized time window (2h–48h) │ └──────────────────────────────────┴────────────────────────────────────────────┘ When a Composite fires: → Analyst receives HunterDirective + RiskScore → Atomic layer surfaces full 30-day entity timeline → Slow-rolling APT staging artefacts become visible → Day 0 BYOVD driver drop connects to Day 3 rootkit activation graph TD subgraph A["Layer 1 — Atomic Sentinel (Silent · 30-day index)"] A1["Driver dropped to AppData\nDay 0 · DeviceName: SRV-01"] A2["Outbound HTTPS rare ASN\nDay 1 · DeviceName: SRV-01"] A3["Registry RunKey write\nDay 2 · DeviceName: SRV-01"] end subgraph B["Layer 2 — Composite Anchors (Fire independently)"] B1["🔴 BYOVD Service Activation\nCRITICAL · Day 3 · SRV-01"] B2["🔴 Security Product Tamper\nCRITICAL · Day 3 · SRV-01"] end subgraph C["Incident Layer — Narrative Convergence"] C1["APT Intrusion Confirmed\nSRV-01 · Day 0 to Day 3\nDriver staged Day 0\nBeacon established Day 1\nPersistence Day 2\nBYOVD + Tamper Day 3\nBlast radius: 3 hosts\nIMMEDIATE ISOLATION REQUIRED"] end A1 & A2 & A3 -->|Entity Key: DeviceName\n30-day index| B1 B1 & B2 -->|Anchor fires → atomic\ntimeline surfaces| C1 ### Entity Keys — The Stitching Mechanism | Entity Key | Stitching Context | |------------|------------------| | `DeviceName` | Host-level — connects all events to one machine | | `AccountName` | Identity-level — connects to same actor across hosts | | `DeviceId` | Hardware-level — tamper-resistant stitching | | `SHA256` | Artefact-level — connects binary drops across time | | `RemoteIP / ASN` | Infrastructure-level — C2 attribution across hosts | ### The KQL Primitive Collector Executed automatically when a composite fires — reconstructing the full entity timeline: // ATOMIC PRIMITIVE COLLECTOR // Triggered by composite anchor — not an alert, a hunting pivot let EntityKey_Device = "SRV-01"; // Injected from composite let AnchorTime = datetime(2026-05-20T14:22:00Z); // Composite fire timestamp let LookbackWindow = 30d; let ForwardWindow = 2h; let P_Execution = DeviceProcessEvents | where Timestamp between ((AnchorTime - LookbackWindow) .. (AnchorTime + ForwardWindow)) | where DeviceName =~ EntityKey_Device | where FileName in~ ("powershell.exe","cmd.exe","mshta.exe","rundll32.exe","certutil.exe","regsvr32.exe") | project Timestamp, Layer="Execution", Event = strcat(FileName, " | ", InitiatingProcessFileName, " | ", ProcessCommandLine), MITRE = "T1059/T1218"; let P_Persistence = DeviceRegistryEvents | where Timestamp between ((AnchorTime - LookbackWindow) .. (AnchorTime + ForwardWindow)) | where DeviceName =~ EntityKey_Device | where RegistryKey has_any (@"\Run",@"\RunOnce",@"Schedule\TaskCache",@"CurrentControlSet\Services") | project Timestamp, Layer="Persistence", Event = strcat("RegWrite: ", RegistryKey, " → ", RegistryValueData), MITRE = "T1547/T1053"; let P_DriverStaging = DeviceFileEvents | where Timestamp between ((AnchorTime - LookbackWindow) .. (AnchorTime + ForwardWindow)) | where DeviceName =~ EntityKey_Device | where FileName endswith ".sys" | where FolderPath matches regex @"(?i)\\(AppData|Temp|Public|ProgramData)\\" | project Timestamp, Layer="BYOVD Driver Staging", Event = strcat("Drop: ", FolderPath, "\\", FileName, " by: ", InitiatingProcessFileName), MITRE = "T1543.003/T1068"; let P_Network = DeviceNetworkEvents | where Timestamp between ((AnchorTime - LookbackWindow) .. (AnchorTime + ForwardWindow)) | where DeviceName =~ EntityKey_Device | where RemoteIPType == "Public" | where InitiatingProcessFileName in~ ("powershell.exe","rundll32.exe","mshta.exe","svchost.exe") | project Timestamp, Layer="Network", Event = strcat(InitiatingProcessFileName, " → ", RemoteIP, ":", RemotePort), MITRE = "TA0011"; union P_Execution, P_Persistence, P_DriverStaging, P_Network | order by Timestamp asc | project Timestamp, Layer, Event, MITRE ## Composite Threat Hunt Portfolio ### Tier-1 Baseline Pack — Enterprise Mandatory Ecosystems **Live MITRE Coverage Matrix:** https://azdabat.github.io/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules/MITRE-MATRIX.html | Ecosystem | Minimum Truth Sensor | Composite Built | Reinforcement Tuned | Atomic Validated | Maturity | |-----------|----------------------|-----------------|---------------------|------------------|----------| | **PowerShell Execution & Abuse** | Script execution + encoded/runtime intent | ✅ Yes | ⚠️ Partial | ⚠️ In Progress | MED | | **Registry Autoruns (Run/RunOnce)** | RegistryValueSet on logon trigger keys | ✅ Yes | ✅ Strong | ✅ Tested | HIGH | | **Scheduled Tasks (CLI Creation)** | `schtasks.exe /create` process truth | ✅ Yes | ✅ Strong | ✅ Tested | HIGH | | **Scheduled Tasks (Silent TaskCache)** | TaskCache persistence without schtasks.exe | ✅ Yes | ⚠️ Needs Noise Calibration | ⚠️ In Progress | MED | | **Service Persistence (ImagePath)** | Service registry ImagePath modification | ⚠️ Partial | ❌ Not Tuned | ❌ Not Yet | LOW | | **Credential Access (LSASS Surface)** | LSASS access/dump behavioural truth | ✅ Yes | ⚠️ Partial | ⚠️ In Progress | MED | | **NTDS / SAM Extraction** | Hive/NTDS interaction truth | ✅ Yes | ⚠️ Partial | ❌ Not Yet | MED | | **LOLBins Proxy Execution Core** | Signed binary misuse surface | ✅ Yes | ⚠️ Needs Baselines | ❌ Not Yet | MED | | **Cloud Identity Persistence (OAuth Consent)** | High-risk scope grant baseline truth | ✅ Yes | ✅ Strong | ⚠️ Tenant Validation Needed | HIGH | ### Tier-2 Composite Correlation Pack — Senior Threat Hunting Layer Tier-2 introduces multi-surface joins, prevalence reinforcement, kill-chain convergence, and noise suppression through context. | Ecosystem | Minimum Truth Anchor | Composite Reinforcement Layer | Status | Maturity | |-----------|----------------------|-------------------------------|--------|----------| | **Registry Hijacks (IFEO/COM/AppInit)** | Execution interception registry truth | Writable DLL + rare writer + untrusted signer | ⚠️ Partial | MED | | **WMI Persistence + Execution** | Subscription + anomalous consumer truth | Parent lineage break + script consumer scoring | ✅ Built | HIGH | | **Lateral Movement (SMB Service / PsExec)** | Remote service creation truth | File drop + inbound 445 + rare service binary | ⚠️ Partial | MED | | **Defense Evasion (Signed LOLBin Chains)** | Trusted parent → LOLBin baseline | Injection + ghost module + beacon reinforcement | ⚠️ POC → Composite | MED | | **Session / Token Misuse (Post-Consent)** | Token replay baseline truth | ASN+UA divergence + weak auth reinforcement | ✅ Built | HIGH | | **Ingress Tool Transfer** | Writable staging drop truth | Followed by execution + outbound comms | ⚠️ In Progress | MED | | **Shadow Copy Destruction (Ransomware Prep)** | vssadmin/wmic delete truth | Multi-tool convergence scoring | ❌ Missing | LOW | | **Archive Staging + Exfil Prep** | 7z/rar bulk staging truth | Large volume + outbound correlation | ❌ Missing | LOW | ### Tier-3 Research & Novel Threat Ecosystems These are not always-on detections — they are **attack research sensors** for emerging tradecraft. | Threat Ecosystem | Research Truth Anchor | Status | Notes | |-----------------|----------------------|--------|-------| | **React2Shell / IIS Exploit Chains** | Web process → CLR abuse → injection | ✅ Modelled | Requires telemetry hardening | | **EtherRAT / Blockchain C2** | RPC beaconing + low-prevalence infra | ✅ Documented | Network correlation expansion needed | | **SilverFox / ValleyRAT BYOVD** | Signed loader → sideload → driver load truth | ⚠️ Advanced Composite | Needs DriverLoadEvent validation | | **Pulsar RAT Injection + Tasks** | Trusted parent → LOLBin → memory exec | 🟡 Parked POC | Awaiting confirmed ecosystem truth | | **Kernel Driver Abuse (BYOVD)** | Driver service creation + load event | ⚠️ Partial | High impact, tuning required | | **Supply Chain Behaviour Modelling** | Signed update → anomaly divergence | ✅ Threat Modelled | Tier-2 rule ownership pending | ## Architecture Doctrine — At Scale ### Why This Matters In enterprise-scale environments (100k+ endpoints), traditional Detection Engineering fails at the database layer. The Minimum Truth framework flips the paradigm: instead of asking the database to correlate everything at once, it forces the query to establish the absolute minimum baseline of malicious truth *first*, discard the rest of the noise, and only then enrich the surviving data. ### Phase 1 — Establish Minimum Truth (The Funnel) Immediately restrict the dataset to specific high-value keys. Define Danger and Safe parameters dynamically — never hardcoded. ### Phase 2 — Zero-Join Native Enrichment Extract `InitiatingProcess*` fields natively present in the optimised EDR schema. This eliminates the heavyweight `DeviceProcessEvents` join entirely. Zero memory pressure. Full process context. ### Phase 3 — The Safe Join (Prevalence Only) The only join permitted is a **pre-summarised join**. Summarise `DeviceFileEvents` to a tiny `OrgPrevalence` table first, then `leftouter` join it to the already-filtered registry events. **Small table joined to small table.** ### Phase 4 — Convergence Scoring Score the remnant data against a contextual matrix and output with a SOC-ready Hunter Directive. Not a binary alert. A contextual story. ## Composite Rule Template — Registry Persistence TaskCache The full production-grade implementation of the framework's canonical case study: // ============================================================================ // COMPOSITE HUNT (L3): Registry_Persistence_Background_Service_TaskCache // Author: Ala Dabat // Platform: Microsoft Defender XDR / Sentinel Advanced Hunting // Truth Domain: DeviceRegistryEvents (Optimised Schema) // Minimum Truth: RegistryValueSet under Services OR Schedule TaskCache // MITRE: T1543.003, T1053.005 // Zero-join enrichment · Safe pre-summarised prevalence join · Convergence scoring // ============================================================================ let lookback = 14d; let TrustedPublishers = dynamic(["Microsoft Corporation","Microsoft Windows","Google LLC","Mozilla Corporation"]); let TrustedInitiators = dynamic(["msiexec.exe","trustedinstaller.exe","sppsvc.exe","intunemanagementextension.exe","updateinstaller.exe"]); let BackgroundKeys = dynamic([ @"system\currentcontrolset\services", @"software\microsoft\windows nt\currentversion\schedule\taskcache\tree", @"software\microsoft\windows nt\currentversion\schedule\taskcache\tasks" ]); let UserWritableRx = @"(?i)^[a-z]:\\(users|public|programdata|temp|downloads|appdata)\\"; let Base64ChunkedRx = @"(?:[A-Za-z0-9+/]{20,}={0,2})(?:\s+[A-Za-z0-9+/]{20,}={0,2})+"; let IPv4Rx = @"\b(?:(?:25[0-5]|2[0-4]\d|1?\d?\d)\.){3}(?:25[0-5]|2[0-4]\d|1?\d?\d)\b"; let DomainRx = @"\b([a-z0-9][a-z0-9-]{1,62}\.)+[a-z]{2,}\b"; let UrlRx = @"https?://[^\s'""<>]+"; let DangerTokens = dynamic([ "powershell","pwsh","cmd.exe","mshta","rundll32","regsvr32","wscript","cscript", "certutil","bitsadmin","curl","-enc","-encodedcommand","frombase64string","http:","https:" ]); let SafePathAnchors = dynamic([@"c:\program files",@"c:\program files (x86)",@"c:\windows\system32",@"c:\windows\syswow64"]); let SafeVendorKeywords = dynamic(["windows update","microsoft","google","edge","mozilla","firefox","onedrive","teams","intel","nvidia","amd","realtek","adobe","citrix"]); let PayloadSizeThreshold = 500; // PHASE 2: PRE-SUMMARISED PREVALENCE TABLE let OrgPrevalence = DeviceFileEvents | where Timestamp >= ago(30d) | summarize WriterDeviceCount = dcount(DeviceId) by SHA256; // PHASE 1: MINIMUM TRUTH let Raw = DeviceRegistryEvents | where Timestamp >= ago(lookback) | where ActionType == "RegistryValueSet" | extend RK = tolower(tostring(RegistryKey)), RVN = tolower(tostring(RegistryValueName)), RVD = tolower(tostring(RegistryValueData)) | where RK has_any (BackgroundKeys); // PHASE 2: ZERO-JOIN NATIVE ENRICHMENT + SAFE JOIN let Enriched = Raw | extend WriterFile = tostring(InitiatingProcessFileName), WriterCL = tostring(InitiatingProcessCommandLine), WriterSHA = tostring(InitiatingProcessSHA256), WriterSigner = tostring(InitiatingProcessSigner), WriterCompany = tostring(InitiatingProcessVersionInfoCompanyName), WriterUser = tostring(InitiatingProcessAccountName) | extend WriterFileL = tolower(coalesce(WriterFile,"")), WriterCLL = tolower(coalesce(WriterCL,"")), WriterTrustedPublisher = toint(WriterCompany in (TrustedPublishers) or WriterSigner in (TrustedPublishers)), WriterTrustedInitiator = toint(WriterFileL in (TrustedInitiators)) | join kind=leftouter OrgPrevalence on $left.WriterSHA == $right.SHA256 | extend WriterDeviceCount = coalesce(WriterDeviceCount, 0), WriterIsRare = toint(WriterDeviceCount <= 2); // PHASE 3: CONVERGENCE SCORING & FILTERING Enriched | extend IsService = toint(RK has "system\\currentcontrolset\\services"), IsTaskCache = toint(RK has "schedule\\taskcache"), ServiceImagePathWrite = toint(IsService==1 and (RVN == "imagepath" or RVN has "imagepath")), HasDanger = toint(RVD has_any (DangerTokens) or WriterCLL has_any (DangerTokens)), HasBase64 = toint(RVD matches regex Base64ChunkedRx or WriterCLL matches regex Base64ChunkedRx), HasNet = toint(RVD matches regex UrlRx or RVD matches regex IPv4Rx or RVD matches regex DomainRx), PointsWritable = toint(RVD matches regex UserWritableRx), IsLargeBlob = toint(strlen(RVD) > PayloadSizeThreshold), IsSafePath = toint(RVD has_any (SafePathAnchors)), IsSafeVendor = toint(RVD has_any (SafeVendorKeywords) or RVN has_any (SafeVendorKeywords)), UntrustedWriter = toint(WriterTrustedPublisher == 0) | where (IsService==1 or IsTaskCache==1) | where (IsTaskCache==1) or (ServiceImagePathWrite==1) or (HasDanger==1) or (PointsWritable==1) or (IsLargeBlob==1) | where not(IsSafePath==1 and IsSafeVendor==1 and HasDanger==0 and HasBase64==0 and HasNet==0 and PointsWritable==0 and IsLargeBlob==0) | where not(WriterTrustedInitiator==1 and (HasDanger + HasBase64 + HasNet + PointsWritable + IsLargeBlob) == 0) | extend RiskScore = 55 + (25 * IsTaskCache) + (20 * ServiceImagePathWrite) + (25 * HasDanger) + (20 * HasBase64) + (10 * HasNet) + (15 * PointsWritable) + (25 * IsLargeBlob) + (10 * UntrustedWriter) + (10 * WriterIsRare), RiskLevel = case(RiskScore >= 120, "CRITICAL", RiskScore >= 90, "HIGH", RiskScore >= 70, "MEDIUM", "LOW") | where RiskLevel in ("MEDIUM","HIGH","CRITICAL") | extend DecodedPayload = base64_decode_string(tostring(extract(@"([A-Za-z0-9+/]{40,})", 1, RegistryValueData))) | project Timestamp, DeviceName, DecodedPayload, AccountName = coalesce(WriterUser, tostring(AccountName)), RegistryKey, RegistryValueName, RegistryValueData, PersistenceClass = case(IsTaskCache==1,"TaskCache(SilentTask)", ServiceImagePathWrite==1,"Service(ImagePath)","Background(Other)"), WriterProcess = WriterFile, WriterCommandLine = WriterCL, WriterCompany, WriterSigner, WriterSHA, WriterDeviceCount, RiskScore, RiskLevel | extend HunterDirective = case( RiskLevel=="CRITICAL" and PersistenceClass startswith "TaskCache", "CRITICAL: Silent Scheduled Task persistence via TaskCache (API/COM). Pull task definition, isolate if unauthorised.", RiskLevel=="CRITICAL" and PersistenceClass startswith "Service", "CRITICAL: Service persistence set (ImagePath) with strong indicators. Validate service name + binary path.", RiskLevel=="HIGH", "HIGH: Background persistence registry artefact. Pivot to writer ancestry.", "MEDIUM: Background persistence signal. Validate if approved updater/agent; if not, escalate." ) | order by RiskScore desc, Timestamp desc ## Hunter Directives Every composite hunt produces **guidance alongside results — not after**. Each rule outputs a `HunterDirective` that answers: 1. **Why** this fired — baseline truth confirmed 2. **What** reinforces confidence — scoring context 3. **What** to do next — pivot, scope blast radius, escalate Hunter Directives are SOC-ready playbooks embedded in the detection output. Rules are not just detections — they are operational response guides. ## The Rule Factory Checklist Before publishing any composite hunt: | Requirement | Check | |-------------|-------| | Minimum Truth is 1 clear anchor | ✅ | | Reinforcement signals are optional (2–4 max) | ✅ | | Convergence window is defined | ✅ | | Noise suppression is explicit | ✅ | | Org prevalence is scoring only — never a hard filter | ✅ | | Severity is cumulative, not binary | ✅ | | Output is SOC-actionable with Hunter Directive | ✅ | ## Architectural Strategy — Split vs Composite ### The Four Rules **Rule 1 — Split when the Minimum Truth changes.** If the baseline event requires a schema change, telemetry change, or mechanism change — SPLIT. | Shift | Decision | |-------|----------| | Host process execution → Identity log transaction | ✂️ SPLIT | | SMB lateral movement → WMI lateral movement | ✂️ SPLIT | | Endpoint execution → Identity sign-in truth | ✂️ SPLIT | | Same LOLBin surface, different intent primitives | ✅ KEEP | **Critical caveat:** Reinforcement signals may cross telemetry surfaces as long as they remain optional and do not replace the baseline truth. Baseline truth = svchost(schedule) spawning suspicious child Reinforcement = TaskCache registry artefacts ← optional cross-table evidence Reinforcement = Task XML drops ← optional cross-table evidence Reinforcement = Org prevalence rarity ← optional cross-table scoring Truth anchor remains execution. Registry is supporting evidence, not the trigger. **Rule 2 — Split when the noise domain changes.** SCCM automation noise vs developer PowerShell vs DC replication are entirely different noise profiles requiring distinct suppression models. SPLIT. **Rule 3 — Split when the telemetry surface changes.** `DeviceProcessEvents ≠ DeviceRegistryEvents ≠ SigninLogs ≠ DeviceNetworkEvents` **Rule 4 — Keep composite when refining context.** Classification, scoring, enrichment, and reinforcement belong inside the rule when the Minimum Truth stays the same. ### Decision Matrix | Ecosystem | Scenario | Decision | Reason | |-----------|----------|----------|--------| | **Scheduled Tasks** | `schtasks.exe /create` vs `Register-ScheduledTask` | ✂️ SPLIT | Different truth surface: CLI vs API | | **Scheduled Tasks** | `schtasks.exe /create` vs `schtasks.exe /change` | ✅ KEEP | Same truth domain: same binary + schema | | **Lateral Movement** | SMB service exec vs WMI remote process | ✂️ SPLIT | Different mechanism, different noise domain | | **Credential Access** | LSASS dump vs DCSync vs Kerberoasting | ✂️ SPLIT | Different telemetry surfaces entirely | | **LOLBin Execution** | rundll32 vs regsvr32 vs mshta (same parent, same intent) | ✅ KEEP | Same process surface, same attacker goal | Truth Anchor = Sensor Reinforcement = Evidence Cousins = Adjacent sensors Incident = Story stitching Truth defines the rule. Reinforcement strengthens it. ## Cousin Rules & Attack Ecosystem Coverage ### Why Cousin Rules Are Non-Negotiable Sophisticated threat actors use cousin attack surfaces as **fallback layers**. Automated deployment toolkits contain cousin-surface fallback logic: graph TD A["Attacker executes LockBit deployment script"] --> B B["SMB Service Exec\nPort 445 — Segment A\nSMB_Service_Lateral FIRES 🔴"] --> C C{"Port 445 blocked\non Segment B"} --> D D["Automatic WMI fallback\nPort 135 — Segment B\nWMI_RemoteExec_Cousin FIRES 🔴"] --> E E["Shadow copy deletion\nFleet-wide\nShadow_Copy_Composite FIRES 🔴"] --> F F["Incident Layer\nAccountName stitching\n'Adversary pivoted SMB → WMI\nScope full subnet NOW'"] A monolithic rule anchored on SMB service creation scores NULL the moment the attacker pivots to WMI. The cousin composite fires independently. The incident layer stitches both surfaces to the same actor. **The pivot is not a defence. It is a data point.** ### Cousin Ecosystem Discovery This framework is not built on theoretical MITRE grouping — it is built on **empirically discovered cousin ecosystems**, validated through ADX-Docker simulation, Empire-style telemetry, and repeated convergence testing. **Full living discovery journal:** https://github.com/azdabat/Production-READY-Composite-Threat-Hunting-Rules/blob/main/Cousin_Discovery_Log.md ### Ecosystem Table — Composites + Cousins ## **Live Roadmap:** https://azdabat.github.io/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules/MITRE-MATRIX.html ## 🗺️ God Mode Composite Roadmap (Still under construction, but existing content can still be browsed) **[→ OPEN INTERACTIVE ROADMAP](https://azdabat.github.io/Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules/MTDF-Composite-Roadmap.html)** The God Mode Composite Roadmap is the capstone artefact of the Minimum Truth Detection Framework — a complete, interactive reference mapping every major ATT&CK tactic to its **core baseline composite rule**, its **cousin technique ecosystem**, and a **full red team breakdown** of how each attack is deployed in the real world. This is not a vendor rule pack or an ATT&CK navigator heatmap. It is a living R&D document that answers three questions for every detection: - **Why is this composite shaped this way?** — the blue team rationale, minimum truth anchor, and reinforcement signal logic - **How does an attacker actually deploy this technique?** — CEH/OSCP level methodology, tool names, real malware families, full kill chain position - **What are the cousin attack surfaces?** — adjacent techniques that share substrate or intent primitives, each with their own composite and red team context ### Coverage | Tactic | Core Composites | Cousin Rules | |---|---|---| | Execution | PowerShell, WMI, LOLBins | AMSI bypass, download cradles, mshta, rundll32, regsvr32 | | Persistence | Registry run keys, WMI subscriptions | Scheduled tasks, DLL sideloading | | Privilege Escalation | Process injection, token theft | UAC bypass, fodhelper chain | | Defense Evasion | BYOVD, EDR kill chain | Byte-flip mutation, callback deregistration | | Credential Access | LSASS memory dump | DCSync, Kerberoasting | | Discovery | Velocity sweep composite | BloodHound/LDAP enumeration | | Lateral Movement | Pass-the-Hash/Ticket | WMI remote execution, Impacket chains | | Collection | Archive staging | Pre-exfil compression patterns | | C2 | HTTPS beaconing | DNS tunnelling, blockchain C2 | | Exfiltration | Volume anomaly | Cloud storage exfil (rclone, MEGAcmd) | | Impact | Ransomware in-progress | VSS deletion anchor, encryption velocity | | ★ Novel Research | SilverFox/ValleyRAT BYOVD | EtherRAT blockchain C2, Steganographic loaders | ### Novel Research Tab The roadmap includes a dedicated tab for original MTDF threat research — families with no vendor rule coverage, fully documented from kill chain through detection engineering: - **SilverFox / ValleyRAT** — 7-rule, 3-tier detection covering the complete BYOVD EDR kill chain including byte-flip hash mutation defence - **EtherRAT / React2Shell (CVE-2025-55182)** — Blockchain C2 detection via Ethereum RPC polling; the C2 channel is the public blockchain itself - **Steganographic Loader Ecosystem** — Multi-phase detection covering LSB image inspection, .NET in-memory execution, and full IR SOP flowchart TD A["🗺️ MTDF GOD MODE ROADMAP"] --> B["ATT&CK Ecosystem Tabs"] A --> C["★ Novel Research Tab"] B --> D["Core Baseline Composite\n🔵 Blue Team Rationale\nMinimum Truth Anchor"] D --> E["Cousin Rule Cards\n🔴 Red Team Breakdowns\nTool · Malware · Kill Chain"] C --> F["SilverFox / ValleyRAT\nBYOVD 7-Rule Chain"] C --> G["EtherRAT / React2Shell\nBlockchain C2 CVE-2025-55182"] C --> H["Stego Loader Ecosystem\nLSB Image Payload Delivery"] E --> I["When · Why · How\nCEH/OSCP Level Detail\nReal Malware Families"] F & G & H --> J["Original Detection Rules\nValidated · GitHub Published"] style A fill:#0d0d14,stroke:#f59e0b,color:#f59e0b style B fill:#0a1628,stroke:#00aaff,color:#00aaff style C fill:#1a1000,stroke:#f59e0b,color:#f59e0b style D fill:#0a1628,stroke:#00aaff,color:#7dd3fc style E fill:#1a0a0e,stroke:#ff2244,color:#fca5a5 style I fill:#1a0a0e,stroke:#ff2244,color:#fca5a5 style F fill:#1a1000,stroke:#f59e0b,color:#fcd34d style G fill:#1a1000,stroke:#f59e0b,color:#fcd34d style H fill:#1a1000,stroke:#f59e0b,color:#fcd34d style J fill:#0a1a0f,stroke:#00ff88,color:#00ff88 | Ecosystem | Primary Composite | MITRE | Cousin Composite | Cousin MITRE | Notes | |-----------|------------------|-------|-----------------|--------------|-------| | **Registry Persistence** | `Registry_Persistence_Background_Service_TaskCache` | T1543.003, T1053.005 | Registry Persistence Alternate Anchors | T1543, T1053 | HKEY_CLUSTER_SERVICE, COM task persistence | | | `Registry_Persistence_Hijack_Interception` | T1546.* | Registry Hijack Cousins | T1546.* | Winlogon handler, shell open interception | | | `Registry_Persistence_Userland_Autoruns` | T1547.001/014/004 | Userland Autoruns Cousin | T1547.* | Policies RunOnce, ActiveSetup deep variants | | **Scheduled Tasks** | TaskCache + Registry | T1053.005 | `ScheduledTask_Execution_TwinRule` | T1053.005 | svchost/taskeng exec without schtasks.exe | | **Service Execution** | `SMB_Service_Execution` | T1021.002 / T1543.003 | `Service_Exec_ScheduleTask_Cousin` | T1053.005 | svchost scheduler execution surface | | **Lateral Movement** | `SMB_Service_Lateral` | T1021.002 | `WMI_RemoteExec_Cousin` | T1021.003 | Remote process via WMI | | | | | `WinRM_Exec_Cousin` | T1021.006 | PowerShell/WinRM lateral | | **Execution (LOLBins)** | `TrustedParent_LOLBin_InMemoryInjection_Chain` | T1218 / T1055 | `TaskExec_LOLBin_Injection_Cousin` | T1218/T1055 | LOLBin from Scheduled Task surface | | **Credential Access** | LSASS composite | T1003.001 | `LSASS_Access_Cousin` | T1003.006 | DCSync / NTLM Harvest twin | | **Identity Abuse** | OAuth consent composite | T1621 / T1078.004 | `Identity_ConsentGrant_Cousin` | T1621 | Token replay vs lateral token misuse | | **Persistence (Driver)** | BYOVD POC/Research | T1547 / T1543 | `Driver_Persistence_Cousin` | T1543.008 | KMDF/Driver load surface | ### Framework Logic Behind Cousin Pairing **Different Noise Domain.** `services.exe` service exec — low noise, aggressive rules applicable. `svchost.exe` scheduled task exec — high noise, strict anchors required. Both cover lateral movement. Noise profiles differ fundamentally. **Separate Truth Anchors.** Service rule anchors on `services.exe` spawning an uncommon child. Scheduled Task cousin anchors on explicit task create/exec signals AND/OR TaskCache registry writes. Logically adjacent — structurally different anchors. **Composite Isolation.** Coupling them in one rule breaks noise suppression, operational fidelity, and analyst clarity. Separate sensors maintain precision. **Ecosystem Continuity.** Every primary composite answers four questions: 1. What is the attack surface? 2. What is the minimum truth anchor? 3. What adjacent surfaces share intent? 4. What cousin composites must exist to cover those surfaces? **Full design doctrine:** https://github.com/azdabat/Production-READY-Composite-Threat-Hunting-Rules/blob/main/Ecosystem_Deaign_Architecture.md ## Router Rules — Rules That Sit Outside Ecosystems Not every composite belongs inside a single attack ecosystem. In production, a second class of rules exists alongside composite sensors: graph TD subgraph Layer1["The Two Rule Types"] T1["TYPE 1 — ECOSYSTEM COMPOSITE SENSOR\nAnswers: Is this specific attack mechanism real?\nBase score: 55 · Threshold: ≥ 75\nOutput: HunterDirective\nHigh-fidelity · Production alert"] T2["TYPE 2 — ROUTER RULE\nAnswers: Is attack intent present anywhere?\nBase score: 0 · Threshold: ≥ 30\nOutput: RoutingDirective\nBroad surface · Triage signal"] end subgraph Flow["How They Interact"] R["Router fires\nRoutingDirective: PIVOT TO T1197 Composite"] --> C C["Composite executed\nMinimum truth confirmed\nRiskScore + HunterDirective"] --> I I["Incident layer stitches both\nvia DeviceId entity key"] end T1 --> Flow T2 --> Flow ### When Router Rules Are Valid A router rule is **valid** when ALL of the following are true: 1. Multiple techniques are in scope AND they have **different noise domains** 2. Output explicitly **routes to ecosystem composites** — a RoutingDirective, not a HunterDirective 3. Base score is **0** — signals build from zero, not from 55 4. No technique in the rule has a **validated composite** yet (if it does, the composite takes over) 5. A **decomposition tracker** is present — documents which composites retire each technique A router rule is **NOT valid** when: | Condition | Reason | |-----------|--------| | Validated composite exists for the technique | The composite takes over — retire the technique from the router | | Techniques share the same noise domain | Combine in a composite pack instead | | High-confidence SOC alert required | Composite sensors only — routers cannot produce reliable high-confidence alerts | | Used permanently instead of building composites | Router rules are coverage debt, not architecture | ### The Canonical Example — Ingress Tool Transfer The ingress tool transfer router rule (Hunt Pack 04) is the clearest example of a valid router rule in the MTDF ecosystem: graph TD Router["Hunt Pack 04 — Router Rule\nbitsadmin · certutil · curl · PowerShell\nBase score: 0 · Threshold: 30\nTriage surface — routes to composites"] --> B B{Which signal fires?} B -->|"/transfer flag"| C1["RoutingDirective:\nPIVOT TO T1197 BITSAdmin Composite\n✅ Built — REMOVE from router"] B -->|"-urlcache flag"| C2["RoutingDirective:\nPIVOT TO T1140 Certutil Composite\n🔴 Pending — keep in router"] B -->|"curl -o remote URL"| C3["RoutingDirective:\nPIVOT TO curl Composite\n🔴 Not built — keep in router"] B -->|"IsMasqueraded"| C4["RoutingDirective:\nPIVOT TO T1036 Masquerading Composite\n🔴 Pending — keep in router"] C1 --> D["Decomposition Tracker Updated:\nbitsadmin → RETIRED from router\ncertutil → Active\ncurl → Active\nMasquerade → Active"] **Why these tools cannot be combined in a composite:** | Tool | Enterprise Noise Profile | Suppression Logic Needed | |------|-------------------------|--------------------------| | bitsadmin.exe | SCCM, Windows Update, Intune | Managed endpoint lineage penalty | | certutil.exe | Developer certificate tooling, PKI ops | Dev machine + IT baseline penalty | | curl.exe | DevOps pipelines, Linux tooling | CI/CD runner context penalty | | powershell.exe | Everything | Extensive intent filtering required | These four tools require four completely different suppression models. Combining them into one composite produces a rule that cannot be tuned for one tool without creating blind spots for another. ### The Scoring Architecture Difference // ── ROUTER RULE — Base starts at ZERO ─────────────────────── // No minimum truth is established in a router rule. // Signals build the case incrementally from nothing. // The analyst acts when enough signals converge — but this is // triage, not a production alert. Low threshold reflects this. | extend RawScore = 0 + iff(IsMasqueraded == 1, 50, 0) // Renamed binary — strong + iff(HasRemoteURL == 1, 20, 0) // External download + iff(IsShellParent == 1, 15, 0) // Shell parent + iff(IsDangerousExtension == 1, 10, 0) // Executable drop | extend RiskScore = iif(RawScore < 0, 0, RawScore) | where RiskScore >= 30 // LOW THRESHOLD — this is triage // ── COMPOSITE SENSOR — Base starts at 55 ──────────────────── // Minimum truth has already been established in Phase 1. // The 55 base reflects that the anchor alone is inherently // suspicious. Reinforcement amplifies an already elevated signal. | extend RawScore = 55 // Base: /transfer + remote URL = structural truth + iff(IsHighRiskDomain == 1, 15, 0) + iff(IsUserWritableDrop == 1, 10, 0) + iff(IsShellParent == 1, 10, 0) | extend RiskScore = iif(RawScore < 0, 0, RawScore) | where RiskScore >= 75 // HIGH THRESHOLD — production alert ### Router Rule Template // ============================================================================ // ROUTER RULE: [Technique Family Name] // Architecture: Router Rule (Architecture 2) — Triage Surface // Scope: [list techniques covered] // Base score: 0 — signals build from zero // Threshold: 30 — lower than composite (triage, not production alert) // TEMPORARY: Retire each technique when dedicated composite ADX validated // // DECOMPOSITION STATUS: // ┌──────────────────┬──────────────────────────────┬────────────────────┐ // │ Technique │ Composite Status │ Action │ // ├──────────────────┼──────────────────────────────┼────────────────────┤ // │ [technique 1] │ [Composite name — built/pend]│ [Remove/Keep] │ // │ [technique 2] │ [Not built] │ [Keep in router] │ // └──────────────────┴──────────────────────────────┴────────────────────┘ // ============================================================================ let lookback = 7d; // ── PHASE 1: BROAD SURFACE FILTER ─────────────────────────────────────────── // Cover multiple techniques — different noise domains, same adversary goal [broad filter across technique family] // ── PHASE 2: SIGNAL ENRICHMENT ────────────────────────────────────────────── // IsMasqueraded: original filename check (unique value from Hunt Pack 04) | extend IsMasqueraded = toint( isnotempty(tolower(tostring( column_ifexists("ProcessVersionInfoOriginalFileName", "")))) and tolower(FileName) != tolower(tostring( column_ifexists("ProcessVersionInfoOriginalFileName", ""))) ) [other technique-specific signals...] // ── PHASE 3: ROUTING SCORE ────────────────────────────────────────────────── // Base = 0 for router rules — non-negotiable | extend RawScore = 0 + iff(IsMasqueraded == 1, 50, 0) [other signals...] | extend RiskScore = iif(RawScore < 0, 0, RawScore) | where RiskScore >= 30 // Triage threshold // ── PHASE 4: ROUTING DIRECTIVE ────────────────────────────────────────────── // Not a HunterDirective — a RoutingDirective // Tells analyst which composite to run for deep investigation | extend RoutingDirective = case( IsMasqueraded == 1, "→ PIVOT TO: T1036 Masquerading Composite", ProcessCommandLine has "/transfer", "→ PIVOT TO: T1197 BITSAdmin Transfer Composite", ProcessCommandLine has "-urlcache", "→ PIVOT TO: T1140 Certutil Composite — pending build", "→ INVESTIGATE: No composite yet — engineer one" ) // [RULE-1] arg_max for deterministic output — never any() | summarize arg_max(Timestamp, *) by DeviceId, AccountName | project Timestamp, DeviceName, AccountName, FileName, ProcessCommandLine, RiskScore, IsMasqueraded, RoutingDirective | sort by RiskScore desc ### Router + Composite Interaction Model sequenceDiagram participant SOC as SOC Analyst participant Router as Router Rule participant Comp as T1197 Composite participant IL as Incident Layer Router->>SOC: Fires RiskScore=65\nRoutingDirective: PIVOT TO T1197 SOC->>Comp: Runs T1197 on DeviceId=X Comp->>SOC: Fires RiskScore=90 CRITICAL\nHunterDirective: BITSAdmin /transfer confirmed SOC->>IL: Both events on same DeviceId IL->>SOC: Attack story stitched\nRouter breadth + Composite depth **Full Router Rule Framewor:** https://github.com/azdabat/Router-Rule-Franework/blob/main/Router%20Rule%20Framework.md **Full Deployment Specification:** https://github.com/azdabat/Production-READY-Composite-Threat-Hunting-Rules/blob/main/Operational_Correlation_Deployment.md ### Repository Architecture | Repository | Role in Framework | |------------|-------------------| | `Minimum-Truth-Detection-Framework-ADX-Validated-Composite-Rules` | Tier-1/Tier-2 deployable composites — ADX validated | | `Production-READY-Composite-Threat-Hunting-Rules` | Production-hardened rules with receipts | | `Attack-Ecosystems-and-POC` | Tier-3 novel threats + emerging tradecraft | | `THREAT-MODELLING-SOP-Behavioural-Patch-Resistant-TTPs` | Architectural doctrine + design SOPs | | `ATLAS-ATTACK-ECOSYSTEM` | Strategic ecosystem map — cousin relationships, coverage gaps | ## The Attack Ecosystem ATLAS While this repository provides the **tactical sensors**, understanding how these sensors fit together requires a strategic map. ### Why You Need The ATLAS The Framework provides the **Micro-View** (Rule Logic): - How do I detect Scheduled Task abuse? - What is the Minimum Truth for a Run Key? - Why does `scrcons.exe` loading `vbscript.dll` matter? The ATLAS provides the **Macro-View** (Ecosystem Strategy): - How does the Scheduled Task rule relate to its Cousin in Registry Persistence? - Which rules form the complete Lateral Movement ecosystem? - How do I deploy these composites to cover an entire attack surface without gaps? **[Enter The ATLAS — The Strategic Map of Attack Ecosystems](https://github.com/azdabat/ATLAS-ATTACK-ECOSSYSTEM)** **Full Ecosystem Design & Architecture:** https://github.com/azdabat/Production-READY-Composite-Threat-Hunting-Rules/blob/main/Ecosystem_Deaign_Architecture.md ╔══════════════════════════════════════════════════════════════════════════════╗ ║ FINAL PRINCIPLE ║ ║ ║ ║ Substrate answers: Did the execution surface exist? ║ ║ Intent answers: Was attacker capability created? ║ ║ Reinforcement answers: Is this contextually malicious? ║ ║ Scoring answers: How urgent is this? ║ ║ Narrative convergence: Is this an incident? ║ ║ ║ ║ Minimum Truth defines the attack. ║ ║ Reinforcement increases confidence. ║ ║ Prevalence scales triage. ║ ║ Cousins cover every adjacent surface. ║ ║ Primitives stitch the story across time. ║ ║ ║ ║ The rule is the sensor. The incident is the narrative. ║ ╚══════════════════════════════════════════════════════════════════════════════╝ *Copyright (c) 2026 Ala Dabat. All Rights Reserved.* *Licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode)* *Attribution required · Non-commercial use only · ShareAlike*

标签：AMSI绕过, KQL, 事件响应, 后端开发, 告警规则, 威胁检测, 安全运营, 扫描框架, 风险评估