Kim-Hammar/awesome-rl-for-cybersecurity

GitHub: Kim-Hammar/awesome-rl-for-cybersecurity

一份专注于强化学习在网络安全领域应用的精选资源汇总，涵盖训练环境、学术论文、书籍和演讲等多种资源类型。

Stars: 1074 | Forks: 144

强化学习在网络安全中的
优秀资源汇总

本列表精心整理了专门针对强化学习应用于网络安全的各类资源。请注意，本列表仅收录使用强化学习的研究工作，未包含应用于网络安全的通用机器学习方法。有关其他相关的精选列表，请参见： * [网络安全中的优秀机器学习资源](https://github.com/jivoi/awesome-ml-for-cybersecurity) * [优秀的对抗机器学习资源](https://github.com/yenchenlin/awesome-adversarial-machine-learning)

## 目录 - [RL-环境](#-environments) - [论文](#-papers) - [书籍](#-books) - [博客文章](#-blogposts) - [演讲](#-talks) - [其他](#-miscellaneous) ## [↑](#table-of-contents) 环境 ### Continuous CyberBattleSim

C-CyberBattleSim

An enhanced version of Microsoft CyberBattleSim that integrates graph neural networks and language models to create generalizable, scalable continuous spaces for reinforcement learning, features an extended scenario generation pipeline utilizing Shodan and the NVD, and provides a unified framework for RL training and evaluation.

Paper: (2025) Scalable and Generalizable RL Agents for Attack Path Discovery via Continuous Invariant Spaces
Documentation: Read The Docs

### CyGym

CyGym: A Simulation-Based Game-Theoretic Analysis Framework for Cybersecurity

CyGym is a cybersecurity encounter simulator leveraging the OpenAI Gym framework for game-theoretic reinforcement learning research in network defense. It features realistic network topologies, a broad array of vulnerabilities and exploits including zero-day attacks, and diverse defensive mechanisms. It introduces a PSRO-style equilibrium computation framework for strategic agent interactions, and a novel zero-day exploit modeling approach. Its realism and analytic power are demonstrated via deployment against the Volt Typhoon APT scenario.
Paper: (2025) CyGym: A Simulation-Based Game-Theoretic Analysis Framework for Cybersecurity

### Cyborg++

CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents

CybORG++ is an advanced toolkit for reinforcement learning research focused on network defence. Building on the CAGE 2 CybORG environment, it introduces key improvements, including enhanced debugging capabilities, refined agent implementation support, and a streamlined environment that enables faster training and easier customization. Along with addressing several software bugs from its predecessor, CybORG++ introduces MiniCAGE, a lightweight version of CAGE 2.
Paper: (2024) CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents

### Cybershield

CYBERSHIELD: A Competitive Simulation Environment for Training AI in Cybersecurity

CyberShield encompasses a comprehensive environment with multiple computers, each hosting various services with unique vulnerabilities. Within this environment, two opposing agents, defender and attacker, participate in a strategic battle, each equipped with distinct actions aimed at outsmarting the other. CyberShield is optimized for competitive multi-agent training using RL algorithms.
Paper: (2024) CYBERSHIELD: A Competitive Simulation Environment for Training AI in Cybersecurity

### Cyberwheel

Cyberwheel: A Reinforcement Learning Simulation Environment

Cyberwheel is a Reinforcement Learning (RL) simulation environment built for training and evaluating autonomous cyber defense models on simulated networks. It was built with modularity in mind, to allow users to build on top of it to fit their needs, supporting various robust configuration files to build networks, services, host types, defensive agents, and more. Cyberwheel is being developed by Oak Ridge National Lab (ORNL).
Paper: (2024) Towards a High Fidelity Training Environment for Autonomous Cyber Defense Agents

### `面向强化学习 Agent 的渗透测试训练框架 (PenGym)`

PenGym: Pentesting Training Framework for Reinforcement Learning Agents

PenGym is a framework for creating and managing realistic environments used for the training of Reinforcement Learning (RL) agents for penetration testing purposes. PenGym uses the same API with the Gymnasium fork of the OpenAI Gym library, thus making it possible to employ PenGym with all the RL agents that follow those specifications. PenGym is being developed by Japan Advanced Institute of Science and Technology (JAIST) in collaboration with KDDI Research, Inc.
Paper: (2024) PenGym: Pentesting Training Framework for Reinforcement Learning Agents
Paper: (2025) PenGym: Realistic training environment for reinforcement learning pentesting agents
Thesis: (2024) Realistic Pentesting Training Framework for Reinforcement Learning Agents

### `ARCD 初级 AI 训练环境 (PrimAITE)`

The ARCD Primary-level AI Training Environment (PrimAITE)

The ARCD Primary-level AI Training Environment (PrimAITE) provides an effective simulation capability for the purposes of training and evaluating AI in a cyber-defensive role.

### `CSLE: 网络安全学习环境`

CSLE: The Cyber Security Learning Environment

CSLE is a platform for evaluating and developing reinforcement learning agents for control problems in cyber security. It can be considered as a cyber range specifically designed for reinforcement learning agents. Everything from network emulation, to simulation and implementation of network commands have been co-designed to provide an environment where it is possible to train and evaluate reinforcement learning agents on practical problems in cyber security.
Paper: (2022) Intrusion Prevention Through Optimal Stopping
Thesis: (2024) Optimal Security Response to Network Intrusions in IT Systems

### AutoPentest-DRL

AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning

AutoPentest-DRL is an automated penetration testing framework based on Deep Reinforcement Learning (DRL) techniques. AutoPentest-DRL can determine the most appropriate attack path for a given logical network, and can also be used to execute a penetration testing attack on a real network via tools such as Nmap and Metasploit. This framework is intended for educational purposes, so that users can study the penetration testing attack mechanisms. AutoPentest-DRL is being developed by the Cyber Range Organization and Design (CROND) NEC-endowed chair at the Japan Advanced Institute of Science and Technology (JAIST) in Ishikawa,Japan.

### NASimEmu

NASimEmu

NASimEmu is a framework for training deep RL agents in offensive penetration-testing scenarios. It includes both a simulator and an emulator so that a simulation-trained agent can be seamlessly deployed in emulation. Additionally, it includes a random generator that can create scenario instances varying in network configuration and size while fixing certain features, such as exploits and privilege escalations. Furthermore, agents can be trained and tested in multiple scenarios simultaneously.

Paper: (2023) NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Framework: NASimEmu
Implemented agents: NASimEmu-agents

### gym-idsgame

gym-idsgame

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym. Paper: (2020) Finding Effective Security Strategies through Reinforcement Learning and Self-Play

### CyberBattleSim (Microsoft)

CyberBattleSim

CyberBattleSim is an experimentation research platform to investigate the interaction of automated agents operating in a simulated abstract enterprise network environment. The simulation provides a high-level abstraction of computer networks and cyber security concepts. Its Python-based Open AI Gym interface allows for the training of automated agents using reinforcement learning algorithms. Blogpost: (2021) Gamifying machine learning for stronger security and AI models

### gym-malware

gym-malware

Malware Env for OpenAI Gym Paper: (2018) Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning

### malware-rl

malware-rl

Extended and Updated `gym_malware` which supports recent LIEF versionS and an enhanced collection of models (EMBER, MalConv and SOREL-20M) Paper: (2018) Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning

### gym-flipit

gym-flipit

Gym environment for FLIPIT: The Game of "Stealthy Takeover" invented by Marten van Dijk, Ari Juels, Alina Oprea, and Ronald L. Rivest. Paper: (2019) QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game

### gym-threat-defense

gym-threat-defense

Gym environment for the environment described in the paper: (2019) Optimal Defense Policies for Partially Observable Spreading Processes on Bayesian Attack Graphs

### gym-nasim

gym-nasim

Thesis: (2018) Autonomous Penetration Testing using Reinforcement Learning

### gym-optimal-intrusion-response

gym-optimal-intrusion-response

An OpenAI Gym interface to a MDP/Markov Game model for optimal intrusion response of a realistic infrastructure simulated using system traces. Paper: (2021) Learning Intrusion Prevention Policies through Optimal Stopping

### sql_env

sql_env

Paper: (2021) SQL Injections and Reinforcement Learning: An Empirical Evaluation of the Role of Action Structure

### cage-challenge

cage-challenge-1

The first Cyber Autonomos Gym for Experimentation (CAGE) challenge environment released at the 1st International Workshop on Adaptive Cyber Defense held as part of the 2021 International Joint Conference on Artificial Intelligence (IJCAI).

cage-challenge-2

The second Cyber Autonomous Gym for Experimentation (CAGE) challenge environment announced at the AAAI-22 Workshop on Artificial Intelligence for Cyber Security Workshop (AICS). Paper: (2023) On Autonomous Agents in a Cyber Defence Environment

cage-challenge-3

The third Cyber Autonomous Gym for Experimentation (CAGE) challenge environment.

cage-challenge-4

The fourth Cyber Autonomous Gym for Experimentation (CAGE) challenge environment.

### ATMoS

ATMoS

Paper: (2020) ATMoS: Autonomous Threat Mitigation in SDN using Reinforcement Learning

### MAB-Malware

MAB-malware

Paper: (2022) MAB-Malware: A Reinforcement Learning Framework for Attacking Static Malware Classifiers

### ASAP

Autonomous Security Analysis and Penetration Testing framework (ASAP)

Paper: (2020) Autonomous Security Analysis and Penetration Testing

### Yawning Titan

Yawning Titan

Yawning Titan is an abstract, highly flexible, cyber security simulator that is capable of simulating a range of cyber security scenarios. 论文: (2022) 通过网络安全仿真开发最优因果网络防御 Agent

### Cyborg

Cyborg

Cyborg is a gym for autonomous cyberg operations research that is driven by the need to efficiently support reinforcement learning to train adversarial decision-making models through simulation and emulation. This is a variation of the environments used by cage-challenge above. 论文: (2021) CybORG: 用于开发自主网络安全 Agent 的健身房环境

### FARLAND

FARLAND (github repository missing)

FARLAND is a framework for advanced Reinforcement Learning for autonomous network defense, that uniquely enables the design of network environments to gradually increase the complexity of models, providing a path for autonomous agents to increase their performance from apprentice to superhuman level, in the task of reconfiguring networks to mitigate cyberattacks. 论文: (2021) 面向自主网络防御的网络环境设计

### SecureAI

SecureAI

SecureAI: Deep Reinforcement Learning for Self-Protection in Non-Stationary Cloud Architectures Paper: (2021) An Intrusion Response Approach for Elastic Applications Based on Reinforcement Learning

### CYST

CYST

CYST is a multi-agent discrete-event simulation framework tailored for cybersecurity domain. Its goal is to enable high-throughput and realistic simulation of cybersecurity interactions in arbitrary infrastructures.

Paper: (2020) Session-level Adversary Intent-Driven Cyberattack Simulator
Code: HERE

### CLAP

CLAP: Curiosity-Driven Reinforcment Learning Automatic Penetration Testing Agent

CLAP is a reinforcement learning PPO agent performs Penetration Testing in simulated computer network environment (we use Network Attack Simulator (NASim)). The agent is trained to scan for vulnerabilities in the network and exploit them to gain access to various network resources.

Paper: (2022) Behaviour-Diverse Automatic Penetration Testing: A Curiosity-Driven Multi-Objective Deep Reinforcement Learning Approach
Code: HERE

### CyGIL

CyGIL: A Cyber Gym for Training Autonomous Agents over Emulated Network Systems

CyGIL is an experimental testbed of an emulated RL training environment for network cyber operations. CyGIL uses a stateless environment architecture and incorporates the MITRE ATT&CK framework to establish a high fidelity training environment, while presenting a sufficiently abstracted interface to enable RL training. Its comprehensive action space and flexible game design allow the agent training to focus on particular advanced persistent threat (APT) profiles, and to incorporate a broad range of potential threats and vulnerabilities. By striking a balance between fidelity and simplicity, it aims to leverage state of the art RL algorithms for application to real-world cyber defence.

Paper: (2021) CyGIL: A Cyber Gym for Training Autonomous Agents over Emulated Network Systems

### BRAWL

BRAWL

BRAWL seeks to create a compromise by creating a system to automatically create an enterprise network inside a cloud environment. OpenStack is the only currently supported environment, but it is being designed in such a way as to easily support other cloud environments in the future.

### DETERLAB

DeterLab: Cyber-Defense Technology Experimental Research Laboratory

Since 2004, the DETER Cybersecurity Testbed Project has worked to create the necessary infrastructure - facilities, tools, and processes-to provide a national resource for experimentation in cyber security. The next generation of DETER envisions several conceptual advances in testbed design and experimental research methodology, targeting improved experimental validity, enhanced usability, and increased size, complexity, and diversity of experiments.

Paper: (2010) The DETER project: Advancing the science of cyber security experimentation and test

### EmuLab

Mininet creates a realistic virtual network, running real kernel, switch and application code, on a single machine (VM, cloud or native), in seconds, with a single command.

Paper: (2015) Emulation of Software Defined Networks Using Mininet in Different Simulation Environments

### Vine

VINE: A Cyber Emulation Environment for MTD Experimentation

Paper: (2015) VINE: A Cyber Emulation Environment for MTD Experimentation

### CRATE

CRATE Exercise Control – A cyber defense exercise management and support tool

Paper: (2020) CRATE Exercise Control – A cyber defense exercise management and support

### GALAXY

Galaxy: A Network Emulation Framework for Cybersecurity tool

Paper: (2018) Galaxy: A Network Emulation Framework for Cybersecurity

## [↑](#table-of-contents) 论文 ### 综述 * [(2025) 使用强化学习的自主渗透测试：系统性文献综述](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5208526) * [(2024) 基于深度强化学习的网络入侵检测综述](https://arxiv.org/abs/2410.07612) * [(2024) 走向自主网络防御之路](https://arxiv.org/pdf/2404.10788.pdf) * [(2023) 使用人工智能和强化学习算法进行网络安全的技术与策略综述](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10260699) * [(2023) 自动化网络防御：综述](https://arxiv.org/pdf/2303.04926.pdf) * [(2022) 网络、博弈与学习的融合：基于网络的多智能体决策制定博弈论框架](https://arxiv.org/abs/2105.08158) * [(2022) 网络安全与强化学习——简述](https://www.sciencedirect.com/science/article/pii/S0952197622002512) * [(2022) 基于区块链和联邦深度强化学习的电力物联网云边端协同安全](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9801730) * [(2022) 深度强化学习用于网络安全威胁检测与防护：综述](https://arxiv.org/pdf/2206.02733.pdf) * [(2022) 强化学习在内网安全网络防御决策中的应用与挑战](https://www.mdpi.com/1999-4893/15/4/134) * [(2021) 强化学习在支持反馈的网络弹性中的应用](https://arxiv.org/pdf/2107.00783.pdf) * [(2021) 前瞻性人工智能方法在主动网络防御中的应用](https://arxiv.org/pdf/2104.09981.pdf) * [(2019) 深度强化学习在网络安全中的应用](https://arxiv.org/abs/1906.05799) ### 演示论文 * [(2023) 网络安全学习环境 (CSLE) v.0.2.0 演示](https://www.youtube.com/watch?v=iE2KPmtIs2A) * [(2022) 一种用于交互式检查已学习安全策略的系统](https://ieeexplore.ieee.org/document/9789707) [**(视频)**](https://www.youtube.com/watch?v=18P7MjPKNDg) ### 立场论文 * [(2026) 使用人工智能进行自主渗透测试：从网络安全视角的探讨](https://www.sciencedirect.com/science/article/pii/S2542660526000624) * [(2025) 比较传统黑客工具与人工智能驱动的替代方案](https://ieeexplore.ieee.org/abstract/document/11012027) * [(2025) 走向自主网络防御之路](https://ieeexplore.ieee.org/document/10612251) * [(2023) 自主网络防御：从实验室到实际操作的路线图](https://cetas.turing.ac.uk/sites/default/files/2023-06/autonomous_cyber_defence_final_report.pdf) * [(2022) 网络防御的数学基础](https://www.ams.org/journals/notices/202206/rnoti-p1019.pdf) ### 常规论文 * [(2025) 使用 AI 进行渗透测试：基于 LLM 和 RL 的攻击 Agent 案例研究](https://link.springer.com/chapter/10.1007/978-3-032-02725-2_5) * [(2025) 探索多智能体强化学习在自主网络防御中的效能：CAGE 挑战赛 4 的视角](https://ojs.aaai.org/index.php/AAAI/article/view/35158) * [(2025) 通过贝叶斯学习和信念量化解决模型误设下的在线事件响应规划](https://arxiv.org/pdf/2508.14385) * [(2025) 使用具有减少幻觉的轻量级大型语言模型进行事件响应规划](https://arxiv.org/abs/2508.05188) * [(2025) 通过信念聚合与展开实现自适应网络安全策略](https://arxiv.org/abs/2507.15163) * [(2025) 评估用于网络防御的 AI Agent：深度强化学习与 LLM 方法的比较](https://doi.org/10.1007/978-3-032-10489-2_36) * [(2025) 通过连续不变空间为攻击路径发现提供可扩展且可泛化的 RL Agent](https://ieeexplore.ieee.org/document/11352493) * [(2025) 在用于自主网络防御的多智能体强化学习中学习通信](https://arxiv.org/pdf/2507.14658) * [(2025) 比较传统黑客工具与 AI 驱动的替代方案](https://ieeexplore.ieee.org/abstract/document/11012027/) * [(2025) 少即是多？网络防御 RL 中的奖励](https://arxiv.org/abs/2503.03245) * [(2024) 用于网络防御的分层多智能体强化学习](https://arxiv.org/abs/2410.17351) * [(2024) 车载网络的入侵响应系统：一种基于不确定性感知深度强化学习的方法](https://ieeexplore.ieee.org/abstract/document/10773966) * [(2024) 通过域随机化和元强化学习实现可泛化的自主渗透测试](https://arxiv.org/pdf/2412.04078) * [(2024) 网络安全问题中在线学习的内在可解释性与不确定性感知模型](https://arxiv.org/pdf/2411.09393) * [(2024) 推进验证安全协议的自动化能力](https://ieeexplore.ieee.org/document/10443063) * [(2024) 元 Stackelberg 博弈：针对自适应和混合投毒攻击的鲁棒联邦学习](https://arxiv.org/pdf/2410.17431) * [(2024) 入侵容忍作为一种双层博弈](https://link.springer.com/chapter/10.1007/978-3-031-74835-6_1) * [(2024) 用于自主网络防御的基于实体的强化学习](https://arxiv.org/pdf/2410.17647) * [(2024) 用于网络防御的分层多智能体强化学习](https://arxiv.org/pdf/2410.17351) * [(2024) 自主网络防御中的多智能体 Actor-Critics](https://arxiv.org/pdf/2410.09134) * [(2024) 基于 NHSC-PPO 的渗透测试路径发现](https://dl.acm.org/doi/10.1145/3650400.3650693) * [(2024) 利用深度强化学习进行网络攻击模拟以增强网络安全](https://www.mdpi.com/2079-9292/13/3/555) * [(2024) 面向强化学习网络安全 Agent 的训练环境](https://ieeexplore.ieee.org/abstract/document/10690552) * [(2024) 基于深度强化学习与动态博弈论的网络防御决策](https://ieeexplore.ieee.org/abstract/document/10700942) * [(2024) 用于空中机动防冲突的行动鲁棒强化学习以对抗冲突诱导欺骗](https://ieeexplore.ieee.org/abstract/document/10682497) * [(2024) 用于自主网络防御的具有因果感知的强化学习 Agent](https://www.sciencedirect.com/science/article/pii/S0950705124011559) * [(2024) 用于自主弹性网络防御的强化学习](https://i.blackhat.com/BH-US-24/Presentations/US-24-MilesFarmer-ReinforcementLearningForAutonomousResilientCyberDefence-wp.pdf) * [(2024) 动态欺诈检测：将强化学习集成到图神经网络中](https://arxiv.org/pdf/2409.09892) * [(2024) 走向自主网络防御：面向防御 Agent 的强化学习环境](https://ieeexplore.ieee.org/abstract/document/10667139) * [(2024) 线性二次型调节器的无模型强化学习中的中间人攻击检测](https://ieeexplore.ieee.org/abstract/document/10644963) * [(2024) 利用深度强化学习在增强的 ATT &CK 上优化缓解部署](https://link.springer.com/article/10.1007/s00607-024-01344-4) * [(2024) 基于博弈论强化学习的编队跟踪控制抗干扰攻击混合策略](https://ieeexplore.ieee.org/abstract/document/10660492) * [(2024) 无人机安全与深度强化学习综述](https://www.sciencedirect.com/science/article/pii/S1570870524002531) * [(2024) 增强水下物联网安全：一种基于多智能体强化学习的协同追踪策略](https://ieeexplore.ieee.org/abstract/document/10644013) * [(2024) 风险感知的联邦强化学习安全车联网通信](https://www.computer.org/csdl/journal/tm/5555/01/10643312/1ZAxmYkGmOs) * [(2024) 基于深度强化学习的软件定义卫星网络多播移动目标防御](https://ieeexplore.ieee.org/abstract/document/10622302) * [(2024) 通过竞争性强化学习为自主网络行动寻找最优安全策略](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10639381) * [(2024) 一种基于智能强化学习的移动边缘网络威胁检测方法](https://onlinelibrary.wiley.com/doi/abs/10.1002/nem.2294) * [(2024) 面向自主网络防御 Agent 的高保真训练环境](https://doi.org/10.1145/3675741.3675752) * [(2024) 强化学习在网络事件响应中高效且有效的恶意软件调查中的应用](https://arxiv.org/pdf/2408.01999) * [(2024) 利用深度强化学习进行网络攻击路径预测：构建、泛化与评估](https://hal.science/hal-04662428/document) * [(2024) 基于情景记忆强化学习的高效渗透测试路径规划](https://cdn.techscience.cn/files/CMES/2024/TSP_CMES-140-3/TSP_CMES_28553/TSP_CMES_28553.pdf) * [(2024) 如何训练你的杀毒软件：通过问题空间进行基于 RL 的强化](https://kclpure.kcl.ac.uk/ws/portalfiles/portal/278114787/AutoRobust_RAID_Accepted.pdf) * [(2024) 用于网络安全的多智能体强化学习：方法与挑战](https://ceur-ws.org/Vol-3735/paper_09.pdf) * [(2024) 使用 A3C、Q-learning 和 DQN 评估用于自主渗透测试的强化学习](https://arxiv.org/pdf/2407.15656) * [(2024) 通过强化学习优化动态 Active Directory 中的网络防御](https://arxiv.org/pdf/2406.19596) * [(2024) 面向对抗通信的无线应用安全多智能体强化学习](https://ieeexplore.ieee.org/abstract/document/10584557) * [(2024) 基于强化学习的云数据中心环境自主网络防御](https://www.spiedigitallibrary.org/conference-proceedings-of-spie/13185/131850H/Autonomous-network-defense-in-cloud-data-center-environments-based-on/10.1117/12.3032677.full) * [(2024) 在保证 QoS 的同时缓解 DDoS：一种基于深度强化学习的方法](https://ieeexplore.ieee.org/abstract/document/10588889) * [(2024) 使用因果建模和树搜索为 CAGE-2 寻找最优防御策略](https://arxiv.org/abs/2407.11070) * [(2024) 基于消息传递神经网络与强化学习的自主网络事件响应中的结构泛化](https://arxiv.org/pdf/2407.05775v1) * [(2024) 基于深度强化学习的针对未知攻击的自进化移动目标防御方法](https://ieeexplore.ieee.org/abstract/document/10586877) * [(2024) 面向分布式 Volt-VAR 控制观测扰动的注意力增强多智能体强化学习](https://ieeexplore.ieee.org/abstract/document/10587051) * [(2024) CyberRL：用于高效网络入侵检测的类脑强化学习](https://ieeexplore.ieee.org/abstract/document/10579883) * [(2024) 在红队中利用强化学习进行高级勒索软件攻击模拟](https://arxiv.org/pdf/2406.17576) * [(2024) 深度强化学习在网络安全中的自适应防御](https://dl.acm.org/doi/abs/10.1145/3660853.3660930) * [(2024) 面向基于 AI 的入侵检测即服务的 AI：使用强化学习配置模型、任务和容量](https://www.sciencedirect.com/science/article/pii/S1084804524001139) * [(2024) DeepIDPS：一种基于自适应 DRL 的 SDN 入侵检测与防御系统](https://cis.temple.edu/~jiewu/research/publications/Publication_files/ICC2024.pdf) * [(2024) 使用多智能体强化学习在内网中追踪攻击者 ](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10545725) * [(4) 一种新型的两步计算机网络攻防策略](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10544975) * [(2024) AdaRisk：用于脆弱节点检测的风险自适应深度强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10549866) * [(2024) 使用混合 AI 模型设计自主网络防御 Agent](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10540988) * [(2024) 使用 CyberBattleSim 的 IoT 安全强化学习方法：一项基于仿真的研究](https://ieeexplore.ieee.org/abstract/document/10541295) * [(2024) 基于奖励机器的知识引导自动渗透测试](https://arxiv.org/pdf/2405.15908) * [(2024) 通过即时动态图逆向强化学习演进恶意软件检测](https://www.sciencedirect.com/science/article/pii/S0950705124006257) * [(2024) 通过即时动态图逆向强化学习演进恶意软件检测](https://www.sciencedirect.com/science/article/pii/S0950705124006257) * [(2024) 利用强化学习和智能技术智能预防 DDoS 攻击](https://journals.flvc.org/FLAIRS/article/view/135349) * [(2024) 强化学习策略在网络态势风险感知与防范中的应用研究](https://link.springer.com/article/10.1007/s44196-024-00492-x) * [(2024) 基于强化学习的自主攻击者揭露计算机网络漏洞](https://link.springer.com/article/10.1007/s00521-024-09668-0) * [(2024) 通过防御感知鲁棒强化学习实现可信赖的自动驾驶以应对最坏情况下的观测扰动](https://www.sciencedirect.com/science/article/pii/S0968090X24001530) * [(2024) DRL²FC：一种基于深度强化学习的抗攻击自动发电控制控制器](https://arxiv.org/pdf/2404.16974) * [(2024) 基于深度强化学习的可解释跨层入侵响应系统用于工业控制系统](https://ieeexplore.ieee.org/abstract/document/10508089) * [(2024) 一种基于分层多智能体强化学习的网络攻防博弈与协同防御决策方法](https://www.sciencedirect.com/science/article/pii/S016740482400172X) * [(2024) 利用深度强化学习技术在 SCADA 基础设施中进行入侵检测](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10504835) * [(2024) 基于异策略 Actor-Critic 深度强化学习方法的入侵检测系统警报优先级排序](https://www.sciencedirect.com/science/article/pii/S016740482400155X) * [(2024) 应用强化学习的工业控制系统安全评估](https://www.mdpi.com/2227-9717/12/4/801) * [(2024) 网络弹性的基础：博弈论、控制论和学习理论的融合](https://arxiv.org/pdf/2404.01205.pdf) * [(2024) 通过双层反馈控制实现网络化系统的入侵容忍](https://arxiv.org/abs/2404.01741) * [(2024) 基于强化学习与结合 ATT&CK 的网络威胁知识图谱的最优攻击路径规划在空中交通管理系统中的应用 ](https://ieeexplore.ieee.org/abstract/document/10473161) * [(2024) 基于随机博弈的自动化边缘智能 IoT 恶意软件传播抑制策略的 DQN 改进算法比较](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10478522) * [(2024) 用于网络攻击检测的深度强化学习方法](https://online-journals.org/index.php/i-joe/article/view/48229) * [(2024) WENDIGO：用于 GraphQL 拒绝服务查询发现的深度强化学习](https://kclpure.kcl.ac.uk/ws/portalfiles/portal/251249221/Wendigo.pdf) * [(2024) 节能无线传感器网络中基于深度强化学习策略的安全性增强](https://www.mdpi.com/1424-8220/24/6/1993) * [(2024) 战略性网络战中网络欺骗行动的共生博弈与基础模型](https://arxiv.org/pdf/2403.10570.pdf) * [(2024) Mirage：在仿真与模拟环境中针对自主网络攻击的网络欺骗](https://link.springer.com/article/10.1007/s12243-024-01018-4) * [(2024) PenGym：面向强化学习 Agent 的渗透测试训练框架](https://www.jaist.ac.jp/~razvan/publications/pengym_framework_rl_agents.pdf) * [(2024) 如何训练你的杀毒软件：通过问题空间进行基于 RL 的强化](https://arxiv.org/pdf/2402.19027.pdf) * [(2024) 非对称信息随机博弈中基于一阶信念的推测在线学习](https://arxiv.org/abs/2402.18781) * [(2024) 委派联邦强化学习以构想网络安全策略](https://ieeexplore.ieee.org/document/10440912) * [(2024) 转变网络安全动态：增强的自博弈强化学习在入侵检测与防御系统中的应用](https://www.researchgate.net/publication/378288610_Transforming_Cybersecurity_Dynamics_Enhanced_Self-Play_Reinforcement_Learning_in_Intrusion_Detection_and_Prevention_System) * [(2024) 通过具有自适应推测的在线学习进行自动化安全响应](https://arxiv.org/abs/2402.12499) * [(2024) IoTWarden：一种基于深度强化学习的实时防御系统以缓解触发式 IoT 攻击](https://arxiv.org/pdf/2401.08141.pdf) * [(2024) 使用强化学习在 Tor 和公共网络上发现命令与控制 (C2) 通道](https://arxiv.org/pdf/2402.09200.pdf) * [(2024) 深度强化学习用于自主网络行动：综述](https://arxiv.org/abs/2310.07745) * [(2024) 强化学习遇见网络入侵检测：一种用于异常行为识别的可迁移且适应性强的框架](https://ieeexplore.ieee.org/abstract/document/10399344) * [(2024) 图神经网络在辅助防御性网络行动中的使用](https://arxiv.org/pdf/2401.05680.pdf) * [(2024) 通过强化学习和语义奖励实现 LLM 驱动的代码漏洞修复](https://arxiv.org/pdf/2401.03374.pdf) * [(2024) 利用黑盒模糊测试和强化学习解决资源耗尽型缺陷](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10381445) * [(2024) 在交通管理系统中增强道路安全与网络安全：发挥强化学习的潜力](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10381696) * [(2023) 面向海事操作技术网络安全的多智能体强化学习](https://arxiv.org/abs/2401.10149) * [(2023) CO-DECYBER：使用深度多智能体强化学习的协同网络安全决策](https://www.researchgate.net/publication/374373412_CO-DECYBER_Co-operative_Decision_Making_for_Cybersecurity_using_Deep_Multi-agent_Reinforcement_Learning?_tp=eyJjb250ZXh0Ijp7ImZpcnN0UGFnZSI6InByb2ZpbGUiLCJwYWdlIjoicHJvZmlsZSJ9fQ) * [(2023) 网络安全中的最优欺骗资产部署：多智能体随机博弈中的 Nash Q-Learning 方法](https://www.mdpi.com/2076-3417/14/1/357) * [(2023) 面向高渗透率分布式能源的配电系统网络攻击防御自适应深度强化学习算法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10368040) * [(2023) WebGuardRL：一种用于高级 Web 攻击检测的创新强化学习方法](https://dl.acm.org/doi/abs/10.1145/3628797.3628982) * [(2023) PSP-Mal：通过基于优先经验与 Shapley 先验的强化学习逃避恶意软件检测](https://dl.acm.org/doi/abs/10.1145/3627106.3627178) * [(2023) 金丝雀与哨子：具备（或不具备）深度强化学习能力的弹性无人机通信网络](https://dl.acm.org/doi/abs/10.1145/3605764.3623986) * [(2023) 使用改进的双重决斗深度 Q 网络在网络安全中进行有效防御的策略](https://www.sciencedirect.com/science/article/pii/S0167404823004881) * [(2023) 针对动态多策略基础架构 DDoS 攻击的自主网络防御](https://ieeexplore.ieee.org/abstract/document/10288937) * [(2023) 基于自适应一致性强化学习的分布式 Web 黑客攻击](https://www.sciencedirect.com/science/article/pii/S0004370223001789) * [(2023) 奖励塑造以实现更快乐的自主网络安全 Agent](https://arxiv.org/pdf/2310.13565.pdf) * [(2023) MalBoT-DRL：在 IoT 网络中使用深度强化学习检测恶意软件僵尸网络](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10283893) * [(2023) Raiju：强化学习引导的后渗透阶段用于自动化网络系统安全评估](https://arxiv.org/pdf/2309.15518.pdf) * [(2023) 基于深度强化学习的物联网安全防御策略算法](https://www.sciencedirect.com/science/article/pii/S266729522300065X?via%3Dihub) * [(2023) 利用强化学习增强数据泄露路径分析](https://arxiv.org/pdf/2310.03667.pdf) * [(2023) 通过深度强化学习进行具有细粒度控制的自动化渗透测试](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10272349&tag=1) * [(2023) 通过基于群体的网络安全渗透测试在智能家居和物联网网络中实现安全意识](https://www.mdpi.com/2078-2489/14/10/536) * [(2023) 用于网络入侵检测的 Soft Actor-Critic 强化学习算法](https://www.sciencedirect.com/science/article/pii/S0167404823004121) * [(2023) 使用强化学习技术的网络入侵检测系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10245608) * [(2023) PENTESTGPT：一款由 LLM 赋能的自动渗透测试工具](https://arxiv.org/pdf/2308.06782.pdf) * [(2023) 论网络防御环境中的自主 Agent](https://arxiv.org/pdf/2309.07388.pdf) * [(2023) EPPTA：用于渗透测试应用的高效部分可观测强化学习 Agent](https://d197for5662m48.cloudfront.net/documents/publicationstatus/147896/preprint_pdf/b159e549387e455fd76cdb936f0a8b33.pdf) * [(2023) 如何干扰网络侦察：一种基于深度强化学习的移动目标防御方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10247015) * [(2023) 通过递归分解实现可扩展的入侵响应学习](https://arxiv.org/abs/2309.03292) * [(2023) 物联网入侵检测的深度强化学习：最佳实践、经验教训与开放性挑战。](https://www.sciencedirect.com/science/article/pii/S1389128623004619) * [(2023) 用于增强网络安全性与威胁评估的自动化入侵检测与防御模型](https://www.ijcna.org/Manuscripts/IJCNA-2023-O-42.pdf) * [(2023) 基于深度强化学习的 Flipit 博弈欺骗策略选择方法](https://downloads.hindawi.com/journals/ijis/2023/5560416.pdf) * [(2023) 智能安全感知路由：使用无模型强化学习](https://ieeexplore.ieee.org/abstract/document/10230195) * [(2023) 当移动目标防御遇上数字孪生中的攻击预测：一种卷积与分层强化学习方法](https://ieeexplore.ieee.org/abstract/document/10234402) * [(2023) 脱离牢笼：随机鹦鹉如何在网络安全环境中获胜](https://arxiv.org/pdf/2308.12086.pdf) * [(2023) 用于智能渗透测试路径设计的深度强化学习](https://www.mdpi.com/2076-3417/13/16/9467) * [(2023) 基于强化学习的社会工程学攻防策略](https://www.techscience.com/csse/v47n2/53636/html) * [(2023) 通过深度强化学习进行实时防御策略选择](https://dl.acm.org/doi/abs/10.1145/3600160.3600176) * [(2023) CyberForce：用于恶意软件缓解的联邦强化学习框架](https://arxiv.org/pdf/2308.05978.pdf) * [(2023) 使用强化学习 Agent 模拟所有原型的 SQL 注入漏洞利用](https://link.springer.com/article/10.1007/s10207-023-00738-3) * [(2023) 基于智能体强化学习的云边界网络主动防御决策方法研究](https://www.sciencedirect.com/science/article/pii/S2667295223000430?via%3Dihub) * [(2023) 软件定义网络中网络安全的对抗性深度强化学习](https://arxiv.org/pdf/2308.04909.pdf) * [(2023) 使用基于 POMDP 的方法解决自我保护软件的不确定性感知自适应问题](https://arxiv.org/pdf/2308.02134.pdf) * [(2023) EIReLaND：评估与解释基于强化学习的网络防御](https://www.csl.sri.com/users/gehani/papers/ACD-2023.EIReLaND.pdf) * [(2023) 基于SDN/NFV的使用强化学习自主防御低速 DDoS 攻击的框架](https://www.sciencedirect.com/science/article/pii/S0167739X23003047) * [(2023) 使用强化学习进行网络测试的全战役仿真](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10208253) * [(2023) 用于自主网络防御的神经进化](https://dl.acm.org/doi/pdf/10.1145/3583133.3590596) * [(2023) 基于强化学习攻击图分析的污水处理厂研究](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10192325) * [(2023) QL 与 SARSA：软件定义物联网网络中入侵防御系统的性能评估](https://ieeexplore.ieee.org/abstract/document/10183144) * [(2023) TSGS：基于深度强化学习的物联网两阶段安全博弈解决方案](https://www.sciencedirect.com/science/article/pii/S0957417423014677) * [(2023) 基于云边端协同车联网中 DRL 的安全感知资源分配方案](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10176313) * [(2023) 使用强化学习的网络入侵检测系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10170630) * [(2023) 面向自主网络 Agent 的统一仿真-模拟训练环境](https://arxiv.org/pdf/2304.01244.pdf) * [(2023) 学习应对动态攻击者的近似最优入侵响应](https://ieeexplore.ieee.org/document/10175554) * [(2023) 使用多智能体强化学习实现自主网络防御的课程框架](https://ieeexplore.ieee.org/abstract/document/10165310) * [(2023) 利用基于强化学习的决策支持增强超视距空战中的态势感知](https://ieeexplore.ieee.org/abstract/document/10156497) * [(2023) 强化学习在针对自动发电控制的不可检测攻击中的应用](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10159364) * [(2023) 用于安全自动化的数字孪生](https://ieeexplore.ieee.org/abstract/document/10154288) * [(2023) 利用可解释强化学习探索自主网络防御](https://arxiv.org/pdf/2306.09318v1.pdf) * [(2023) 自动化的对抗方在环网络物理防御规划](https://dl.acm.org/doi/pdf/10.1145/3596222) * [(2023) RLAuth：一种基于强化学习的风险感知身份验证系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10151855) * [(2023) SQIRL：使用强化学习的 SQL 注入漏洞灰盒检测](http://www.doc.ic.ac.uk/~maffeis/papers/usenix23.pdf) * [(2023) 基于 Dual 强化学习的 5G 工业信息物理系统攻击路径预测](https://ieeexplore.ieee.org/abstract/document/10149069) * [(2023) 通过强化学习方法检测充电状态虚假报告攻击](https://ieeexplore.ieee.org/abstract/document/10149139) * [(2023) 学习通过攻击进行防御（反之亦然）：网络安全博弈中的学习迁移](https://arxiv.org/pdf/2306.02165.pdf) * [(2023) NASimEmu：用于训练 Agent 泛化到新场景的网络攻击仿真器与模拟器](https://arxiv.org/abs/2305.17246) * [(2023) 物联网边缘基于强化学习的协同隐蔽 DDoS 检测方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10133833) * [(2023) 基于网络态势感知与深度强化学习的智能 SDWN 路由算法](https://arxiv.org/pdf/2305.10441.pdf) * [(2023) 木马游乐场：用于硬件木马插入与检测的强化学习框架](https://arxiv.org/pdf/2305.09592.pdf) * [(2023) 合作多智能体强化学习中的去中心化异常检测](https://people.kth.se/~gyuri/Pub/KazariSD-DistributedDetectionMARL-IJCAI23.pdf) * [(2023) 通过随机博弈与强化学习演进 6G 网络的预防策略](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10115274) * [(2023) 强化学习中使用 Bellman 最优性方程进行网络攻击检测](https://www.atlantis-press.com/article/125986296.pdf) * [(2023) 结合 IAM 建模与深度强化学习的云访问控制灰盒渗透测试](https://arxiv.org/pdf/2304.14540.pdf) * [(2023) 面向 RL 网络行动 Agent 的多智能体 CyberBattleSim](https://arxiv.org/pdf/2304.11052.pdf) * [(2023) 强化学习解决信息物理系统抵御重放攻击的安全问题](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10105656) * [(2023) AIRS：基于深度强化学习的安全应用解释](https://www.usenix.org/system/files/sec23fall-prepub-36-yu-jiahao.pdf) * [(2023) SSQLi：一种基于学习的 SQL 注入黑盒对抗攻击方法](https://www.mdpi.com/1999-5903/15/4/133) * [(2023) 论强化学习在攻击和防御负荷频率控制中的应用](https://arxiv.org/pdf/2303.15736.pdf) * [(2023) 基于深度强化学习的面向基于容器的云的最优主动防御安全框架](https://www.mdpi.com/2079-9292/12/7/1598) * [(2023) AutoCAT：用于自动化探索缓存时序攻击的强化学习](https://hsienhsinlee.github.io/MARS/pub/hpca2023.pdf) * [(2023) 应用强化学习以增强抵御对抗性仿真的网络安全](https://www.mdpi.com/1424-8220/23/6/3000) * [(2023) 离线 RL+CKG：用于网络安全任务的混合 AI 模型](https://ebiquity.umbc.edu/_file_directory_/papers/1180.pdf) * [(2023) 使用基于图的攻击仿真学习自动化防御策略](https://www.ndss-symposium.org/wp-content/uploads/2023/09/wosoc2023-23006-paper.pdf) * [(2023) 针对恶意软件图像的网络自动网络弹性防御方法](https://ieeexplore.ieee.org/abstract/document/10043078) * [(2023) 针对 DoS 攻击在多跳网络上的能量调度：深度强化学习方法](https://www.sciencedirect.com/science/article/pii/S0893608023000916) * [(2023) 将网络安全视为井字棋游戏：在网络对抗性人工智能系统中使用自主前进（攻击）和后退（防御）的渗透测试](https://ieeexplore.ieee.org/document/10034922) * [(2023) 动态对抗不确定性下用于网络系统防御的深度强化学习](https://arxiv.org/pdf/2302.01595.pdf) * [(2023) 有本事就来抓我：使用 Q-Learning 算法改进网络安全中的对手](https://www.researchgate.net/publication/368330555_Catch_Me_If_You_Can_Improving_Adversaries_in_Cyber-Security_With_Q-Learning_Algorithms/references) * [(2023) 使用强化学习进行信息物理系统的安全分析](https://www.mdpi.com/1424-8220/23/3/1634) * [(2023) 超越冯·诺依曼时代：类脑超维计算的救援](https://dl.acm.org/doi/pdf/10.1145/3566097.3568354) * [(2023) 使用网络攻击模式的语义嵌入和深度强化学习增加攻击者在 SSH 蜜罐上的参与度](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10022206) * [(2023) 迈向动态夺旗赛训练环境以用于强化学习攻击性安全 Agent](https://ieeexplore.ieee.org/abstract/document/10020389) * [(2023) 利用深度强化学习在侦察与漏洞利用阶段实现渗透测试自动化](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10013801) * [(2023) HAXSS：用于 XSS 载荷生成的分层强化学习](http://wwwhomes.doc.ic.ac.uk/~maffeis/papers/trustcom22.pdf) * [(2023) 基于迁移双重深度 Q 网络的车联网 DDoS 检测方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10005139) * [(2023) 用于高效且有效的大型网络自动化渗透测试的分层强化学习](https://link.springer.com/article/10.1007/s10844-022-00738-0) * [(2023) 在 Web 应用漏洞评估中动态生成攻击载荷的原型 Agent](https://www.researchgate.net/publication/379712682_Prototyping_an_Agent_for_Dynamic_Generation_of_Attack-Payloads_in_Web_Application_Vulnerability_Assessment) * [(2023) 通过利用领域自适应理论威慑渗透测试中的对抗性学习](https://ieeexplore.ieee.org/document/10137792) * [(2023) 深度强化学习在攻击和保护基于结构特征的恶意 PDF 检测器中的应用](https://www.sciencedirect.com/science/article/abs/pii/S0167739X22003740) * [(2023) ReinforSec：一种通过强化学习生成合成恶意软件样本和拒绝服务攻击的自动化生成器](https://www.mdpi.com/1424-8220/23/3/1231) * [(2022) 改进基于 POMDPs 的深度循环 Q 网络用于自动化渗透测试](https://www.mdpi.com/2076-3417/12/20/10339) * [(2022) 使用强化学习的综合临床环境安全分析](https://www.mdpi.com/2306-5354/9/6/253) * [(2022) 使用 AI 强化渗透测试](https://www.researchgate.net/publication/362611895_Reinforcing_Penetration_Testing_Using_AI) * [(2022) DUSC-DQN：一种用于智能渗透测试路径设计的改进深度 Q 网络](https://ieeexplore.ieee.org/document/9846482) * [(2022) 使用深度强化学习进行攻击图博弈的最优策略选择](https://ieeexplore.ieee.org/document/10074866) * [(2022) 深度强化学习在 FlipIt 安全博弈中的应用](https://arxiv.org/pdf/2002.12909.pdf) * [(2022) DRAGON：用于自主电网运行与攻击检测的深度强化学习](https://dl.acm.org/doi/10.1145/3564625.3567969) * [(2022) 一种无模型的入侵响应系统方法](https://www.sciencedirect.com/science/article/pii/S2214212622000400) * [(2022) 用于在网络靶场场景中模拟正常与恶意行为的强化学习 Agent](https://ceur-ws.org/Vol-3260/paper1.pdf) * [(2022) 基于强化学习的供应链网络顺序拓扑攻击](https://ieeexplore.ieee.org/abstract/document/9970706) * [(2022) 防御以制胜：在防御高级持续性威胁时限制信息泄露](https://ieeexplore.ieee.org/abstract/document/9987540) * [(2022) 如何利用强化学习攻击和防御下一代无线电接入网络切片](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9984930) * [(2022) 知识引导的双人强化学习在网络攻击与防御中的应用](https://ebiquity.umbc.edu/_file_directory_/papers/1173.pdf) * [(2022) 超越 CAGE：调查已学习的自主网络防御策略的泛化能力](https://arxiv.org/pdf/2211.15557.pdf) * [(2022) 从自动化到自主网络防御的桥梁：表格型 Q-Learning 的基础分析。](https://dl.acm.org/doi/pdf/10.1145/3560830.3563732) * [(2022) 自主渗透测试中针对大动作空间的级联强化学习 Agent。](https://www.mdpi.com/2076-3417/12/21/11265) * [(2022) 软件定义网络中的无模型深度强化学习。](https://www.semanticscholar.org/reader/fd2fc84bc8366962b90c1c8228ff12ad17154cbb) * [(2022) 带有威胁规避的分层强化学习指导。](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9940160) * [(2022) 通过强化学习、攻击图和网络地形揭示监控检测路线。](https://arxiv.org/pdf/2211.03027.pdf) * [(2022) 自主智能网络防御中动态决策的认知模型。](https://www.researchgate.net/profile/Baptiste-Prebot/publication/364965185_Cognitive_Models_of_Dynamic_Decisions_in_Autonomous_Intelligent_Cyber_Defense/links/636165142f4bca7fd0229e7b/Cognitive-Models-of-Dynamic-Decisions-in-Autonomous-Intelligent-Cyber-Defense.pdf) * [(2022) 使用深度强化学习优化网络安全事件响应决策。](https://ijece.iaescore.com/index.php/IJECE/article/view/28164/16141) * [(2022) 针对未知攻击的鲁棒移动目标防御：一种元强化学习方法](https://www.cs.tulane.edu/~zzheng3/publication/metaRL-MTD.pdf) * [(2022) 学习博弈以防御网络系统中的高级持续性威胁](https://ieeexplore.ieee.org/abstract/document/9923774) * [(2022) 使用深度强化学习符合 IEEE P2668 标准的多层 IoT-DDoS 防御系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9916301) * [(2022) 信息物理系统的隐私增强入侵检测与防御：一种深度强化学习方法](https://downloads.hindawi.com/journals/scn/2022/4996427.pdf) * [(2022) DeepThrottle：用于路由节流以防御 SDN 中 DDoS 攻击的深度强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9894298) * [(2022) 突破自适应与成本感知的硬件辅助零日恶意软件检测：一种基于强化学习的方法](https://www.researchgate.net/profile/Zhangying-He/publication/364290246_Breakthrough_to_Adaptive_and_Cost-Aware_Hardware-Assisted_Zero-Day_Malware_Detection_A_Reinforcement_Learning-Based_Approach/links/634384d82752e45ef6a78bc6/Breakthrough-to-Adaptive-and-Cost-Aware-Hardware-Assisted-Zero-Day-Malware-Detection-A-Reinforcement-Learning-Based-Approach.pdf) * [(2022) 缓解 5G 异构网络中的干扰攻击：一种联邦深度强化学习方法](https://ieeexplore.ieee.org/abstract/document/9914678) * [(2022) 基于深度强化学习的逃避生成对抗网络用于僵尸网络检测](https://arxiv.org/pdf/2210.02840.pdf) * [(2022) 使用改进的 D3QN 在 SDN 中缓解自适应威胁](https://www.spiedigitallibrary.org/conference-proceedings-of-spie/12339/1233911/Adaptive-threat-mitigation-in-SDN-using-improved-D3QN/10.1117/12.2652679.full?SSO=1) * [(2022) 通过强化学习对 IoT 设备边缘服务器进行安全攻击的综合调查](https://www.researchgate.net/profile/Anit-Kumar-6/publication/363832239_A_Comprehensive_Survey_on_Security_Attacks_to_Edge_Server_of_IoT_Devices_through_Reinforcement_Learning/links/632ffdab86b22d3db4de4061/A-Comprehensive-Survey-on-Security-Attacks-to-Edge-Server-of-IoT-Devices-through-Reinforcement-Learning.pdf) * [(2022) 基于深度强化学习的智能电网蠕虫检测](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9880818) * [(2022) 物理层安全下基于深度强化学习的 IRS 辅助移动边缘计算](https://www.sciencedirect.com/science/article/pii/S1874490722001732) * [(2022) 用于入侵检测的强化学习：更长的模型寿命与更少的更新](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9893186) * [(2022) AutoDefense：基于强化学习的自动反应式防御对抗网络攻击](https://cs.ucf.edu/~mohaisen/doc/cns22.pdf) * [(2022) ProAPT：利用深度强化学习预测 APT 威胁](https://arxiv.org/pdf/2209.07215.pdf) * [(2022) 用于边缘云中 VSI-DDoS 检测的强化 Transformer 学习](https://ieeexplore.ieee.org/document/9878326/) * [(2022) H4rm0ny：用于逃避恶意软件生成和检测的多智能体学习的竞争性零和双人马尔可夫博弈](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9850345) * [(2022) 强化学习在硬件安全中的应用：机遇、发展与挑战](https://arxiv.org/pdf/2208.13885.pdf) * [(2022) Attrition：使用强化学习攻击静态硬件木马检测技术](https://arxiv.org/pdf/2208.12897.pdf) * [(2022) 深度强化学习在高级网络安全威胁检测与防护中的应用](https://link.springer.com/article/10.1007/s10796-022-10333-x) * [(2022) ReCEIF：强化学习控制的有效入口过滤](https://www.computer.org/csdl/proceedings-article/lcn/2022/09843478/1G9C5AMjieI) * [(2022) AutoCAT：用于自动化探索缓存时序攻击的强化学习](https://arxiv.org/pdf/2208.08025.pdf) * [(2022) GPDS：MEC 网络中抗干扰安全计算的多智能体深度强化学习博弈](https://www.sciencedirect.com/science/article/pii/S0957417422015044) * [(2022) 基于强化学习的对抗性恶意软件样本生成以应对黑盒检测器](https://www.sciencedirect.com/science/article/pii/S0167404822002632) * [(2022) SAC-AP：基于 Soft Actor Critic 的警报优先级排序深度强化学习](https://arxiv.org/pdf/2207.13666.pdf) * [(2022) 如何在 SD-IoV 中智能缓解 DDoS：一种移动目标防御方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9829332) * [(2022) ReLFA：在 SDN-IoT 中通过 Renyi 熵和深度强化学习抵抗链路泛洪攻击](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9837856) * [(2022) 基于人工智能的动态网络漏洞管理过程优化框架](https://arxiv.org/pdf/2208.02369.pdf) * [(2022) 基于多智能体深度强化学习的窃听博弈](https://ieeexplore.ieee.org/abstract/document/9833927) * [(2022) 基于动态奖励深度确定性策略梯度的隐藏攻击序列检测方法](https://www.hindawi.com/journals/scn/2022/1488344/) * [(2022) 通过强化学习与博弈论对抗 DoS 攻击的信息物理系统安全状态估计](https://www.mdpi.com/2076-0825/11/7/192) * [(2022) 通过网络安全仿真开发最优因果网络防御 Agent](https://www.researchgate.net/publication/361638424_Developing_Optimal_Causal_Cyber-Defence_Agents_via_Cyber_Security_Simulation) * [(2022) 使用决斗双重深度 Q 学习启用入侵检测系统](https://www.emerald.com/insight/content/doi/10.1108/DTS-05-2022-0016/full/pdf?title=enabling-intrusion-detection-systems-with-dueling-double-deep-italicqitalic-learning) * [(2022) 多智能体深度强化学习驱动的缓解网络攻击对电动汽车充电站的不利影响](https://arxiv.org/pdf/2207.07041.pdf) * [(2022) 基于深度强化学习的 XSS 对抗样本攻击](https://www.sciencedirect.com/science/article/pii/S0167404822002255) * [(2022) 分析网络安全中的多智能体强化学习与协同进化](https://dl.acm.org/doi/pdf/10.1145/3512290.3528844) * [(2022) AlphaSOC：基于强化学习的信息物理系统网络安全自动化](https://ieeexplore.ieee.org/abstract/document/9797597?casa_token=CLYC6uNfXhgAAAAA:t8ohceSJb-eI-NeyhUFtizY_786VsCnFfLDe_zAh33be__HI31foWepaXvIhQ4PCF69_s3Vm) * [(2022) 工业控制系统中的在线网络攻击检测：一种深度强化学习方法](https://www.hindawi.com/journals/mpe/2022/2280871/) * [(2022) 检测网络攻击：一种基于强化学习的入侵检测系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9813892) * [(2022) 使用深度强化学习与随机博弈稳健增强入侵检测系统](https://ieeexplore.ieee.org/abstract/document/9809923) * [(2022) irs-partition：利用深度 Q 网络和系统分区的入侵响应系统](https://www.sciencedirect.com/science/article/pii/S2352711022000796) * [(2022) 基于深度强化学习对抗云中侦察攻击的防御性欺骗框架](http://scis.scichina.com/en/2022/170305.pdf) * [(2022) 有本事就破解我：强化学习的模仿游戏](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9797367) * [(2022) 基于深度强化学习的 QoS 感知 SDN-IoT 安全路由](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8935210) * [(2022) 资源受限网络中使用不完全信息博弈的通用网络安全方案](https://link.springer.com/content/pdf/10.1007/s12065-021-00684-w.pdf) * [(2022) 使用强化学习与攻击图进行渗透测试的分层参考模型](https://arxiv.org/pdf/2206.06934.pdf) * [(2022) 基于深度强化学习的低速 DDoS 攻击缓解的灵活 SDN 框架](https://www.sciencedirect.com/science/article/pii/S1084804522000960) * [(2022) 通过博弈与最优停止学习安全策略](https://arxiv.org/abs/2205.14694) * [(2022) 通过分布式深度强化学习方法针对 FDI 攻击的微电网系统弹性最优防御策略](https://ieeexplore.ieee.org/abstract/document/9783467) * [(2022) 孤立直流微电网中智能攻击的数据驱动型网络攻击检测](https://ieeexplore.ieee.org/abstract/document/9782082) * [(2022) 基于奖励随机化强化学习的多域网络空间攻防博弈](https://arxiv.org/pdf/2205.10990.pdf) * [(2022) 在基于图的攻击仿真中使用强化学习进行网络威胁响应](https://ieeexplore.ieee.org/abstract/document/9789835) * [(2022) 通过最优停止实现入侵预防](https://ieeexplore.ieee.org/document/9779345) * [(2022) 学习玩自适应网络欺骗博弈](https://optlearnmas22.github.io/files/paper10.pdf) * [(2022) 不完全信息下雷达抗干扰动态博弈的神经虚拟自我博弈](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9775208) * [(2022) 一种用于防御多场景负荷重分配攻击的强化学习方法](https://ieeexplore.ieee.org/abstract/document/9776523) * [(2022) 基于多智能体深度强化学习的 MIMO 系统中的主动窃听博弈](https://ieeexplore.ieee.org/abstract/document/977039) * [(2022) FEAR：分布式软件定义网络中具有深度 Q 网络的联邦网络攻击反应](https://ieeexplore.ieee.org/abstract/document/9768169) * [(2022) EvadeRL：利用深度强化学习逃避 PDF 恶意软件分类器](https://www.hindawi.com/journals/scn/2022/7218800/) * [(2022) Link：使用强化学习进行跨站脚本漏洞的黑盒检测](https://dl.acm.org/doi/pdf/10.1145/3485447.3512234) * [(2022) MERLIN - 使用强化学习的恶意软件逃避](https://arxiv.org/pdf/2203.12980.pdf) * [(2022) DeepAir：软件定义网络中用于自适应入侵响应的深度强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9732448) * [(2022) DroidRL： Android 恶意软件检测的强化学习驱动特征选择](https://arxiv.org/pdf/2203.02719.pdf) * [(2022) MAB-Malware：用于攻击静态恶意软件分类器的强化学习框架](https://arxiv.org/pdf/2003.03100.pdf) * [(2022) 行为多样的自动化渗透测试：一种好奇心驱动的多目标深度强化学习方法](https://arxiv.org/pdf/2202.10630.pdf) * [(2022) 无线安全中的安全探索：一种具有分层结构的安全强化学习算法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9705557) * [(2022) 利用攻击图和强化学习发现数据泄露路径](https://arxiv.org/pdf/2201.12416.pdf) * [(2022) 用于储能系统对抗 DoS 攻击的去中心化弹性二次控制的多智能体强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9676705) * [(2021) 基于随机博弈系统与深度循环 Q 网络的网络防御决策](https://www.sciencedirect.com/science/article/pii/S0167404821003047) * [(2021) 使用多目标强化学习环境发现反射型跨站脚本漏洞](https://www.sciencedirect.com/science/article/pii/S0167404821003679) * [(2021) 通过深度强化学习增强 NOP 指令的插入以混淆恶意软件](https://www.sciencedirect.com/science/article/pii/S0167404821003679) * [(2021) 使用深度强化学习实现后渗透阶段的自动化](https://www.sciencedirect.com/science/article/pii/S0167404820303813) * [(2021) 作为超越 5G 主动防御要素的移动目标防御](https://ieeexplore.ieee.org/document/9579381) * [(2021) 流行病攻击下的网络弹性：深度强化学习网络拓扑适应](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9686036) * [(2021) 一种基于强化学习的弹性应用入侵响应方法](https://ieeexplore.ieee.org/abstract/document/9659882) * [(2021) 强化学习辅助的动态蜜罐适应阈值优化以增强 IoBT 网络安全](https://ieeexplore.ieee.org/abstract/document/9660066) * [(2021) 基于强化学习的分层种子调度用于灰盒模糊测试](https://www.cs.ucr.edu/~heng/pubs/afl-hier.pdf) * [(2021) SquirRL：使用深度强化学习对区块链激励机制进行自动化攻击分析](https://www.ndss-symposium.org/wp-content/uploads/ndss2021_3C-4_24188_paper.pdf) * [(2021) 强化学习在计算机系统入侵检测问题中的应用](https://link.springer.com/chapter/10.1007/978-981-16-2380-6_66) * [(2021) 基于 FlipIt 模型和 Q-learning 方法的 APT 攻击主动检测定时策略](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9670619) * [(2021) 用于入侵检测的协作多智能体强化学习 ](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9660402) * [(2021) ATMoS+：使用置换等变和不变深度强化学习在 SDN 中实现可泛化的威胁缓解](http://rboutaba.cs.uwaterloo.ca/Papers/Journals/2021/TsangCommMag21.pdf) * [(2021) 基于随机博弈与深度强化学习的网络安全防御决策方法](https://downloads.hindawi.com/journals/scn/2021/2283786.pdf) * [(2021) 通过神经虚拟自我博弈求解大规模扩展式网络安全博弈](https://arxiv.org/abs/2106.00897) * [(2021) 一种应用于工业控制系统跨层防御机制的高效并行强化学习方法](https://ieeexplore.ieee.org/abstract/document/9650577) * [(2021) 使用多智能体强化学习的基于 SDN 的移动目标防御](https://www.researchgate.net/publication/349991931_SDN-based_Moving_Target_Defense_using_Multi-agent_Reinforcement_Learning) * [(2021) 强化学习在工业控制网网络安全编排中的应用](https://arxiv.org/abs/2106.05332) * [(2021) 使用深度强化学习实现权限提升的自动化](https://arxiv.org/abs/2110.01362) * [(2021) SDN-IoT 中用于瞬时负载检测与预防的多智能体强化学习框架](https://www.mdpi.com/2227-7080/9/3/44) * [(2021) 使用带有攻击图的强化学习进行关键资产分析](https://arxiv.org/abs/2108.09358) * [(2021) 基于深度 Q 学习的强化学习方法在网络入侵检测中的应用](https://arxiv.org/abs/2111.13978) * [(2021) 航空计算网络中基于深度强化学习的入侵检测](https://ieeexplore.ieee.org/document/9520324) * [(2021) 深度强化学习在保护具有分布式控制平面的软件定义工业网络中的应用](https://ieeexplore.ieee.org/document/9618870) * [(2021) 通过深度强化学习实现的自主网络进攻策略](https://www.spiedigitallibrary.org/conference-proceedings-of-spie/11746/1174622/Autonomous-network-cyber-offence-strategy-through-deep-reinforcement-learning/10.1117/12.2585173.full?SSO=1) * [(2021) CyGIL：在仿真网络系统上训练自主 Agent 的网络健身房](https://arxiv.org/abs/2109.03331) * [(2021) 约束满足驱动的自主网络防御强化学习](https://arxiv.org/abs/2104.08994#:~:text=Constraints%20Satisfiability%20Driven%20Reinforcement%20Learning%20for%20Autonomous%20Cyber%20Defense,-Ashutosh%20Dutta%2C%20Ehab&text=The%20incorporation%20of%20SMT%20does,toward%20safe%20and%20effective%20actions.) * [(2021) 好奇的 SDN 用于缓解网络攻击](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9742225) * [(2021) 学到了就来抓我：具备学习能力的 CPS 中的实时攻击检测与缓解](https://ieeexplore.ieee.org/document/9622383) * [(2021) SyzVegas：利用强化学习打破内核模糊测试的胜算](https://www.usenix.org/system/files/sec21-wang-daimeng.pdf) * [(2021) 面向自主网络防御的网络环境设计](https://arxiv.org/pdf/2103.07583.pdf) * [(2021) CybORG：用于开发自主网络安全 Agent 的健身房环境](https://arxiv.org/pdf/2108.09118.pdf) * [(2021) SQL 注入与强化学习：动作结构作用的实证评估](https://link.springer.com/chapter/10.1007/978-3-030-91625-1_6) * [(2021) 使用基于 MuZero 智能体走向 SDN 网络的自主防御](https://ieeexplore.ieee.org/abstract/document/9499101) * [(2021) 智能电网中防御高级持续性威胁：一种强化学习方法](https://ieeexplore.ieee.org/document/9549271) * [(2021) 用于自动化渗透测试的深度分层强化学习 Agent](https://arxiv.org/abs/2109.06449) * [(2021) 针对基于图的物联网僵尸网络检测方法的对抗性攻击与防御](https://ieeexplore.ieee.org/document/9514255) * [(2021) 使用非对称兵棋仿真与 Soar 强化学习及协进化算法模拟物流企业](https://dl.acm.org/doi/pdf/10.1145/3449726.3463172) * [(2021) 深度强化学习缓解信息物理 DER 电压不平衡攻击](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9482815) * [(2021) 通过强化学习实现监控中人机群体协作的混合主导平衡](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9594355) * [(2021) 基于近端策略的深度强化学习在群体机器人中的应用 ](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9499288) * [(2021) 使用深度强化学习规避 Web 应用防火墙](https://ieeexplore.ieee.org/document/9720473) * [(2021) 基于 Q-learning 方法的复杂网络序列节点攻击](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9401544) * [(2021) 通过最优停止学习入侵预防策略](https://arxiv.org/pdf/2106.07160.pdf) * [(2021) 在用于渗透测试的强化学习中使用网络地形](https://arxiv.org/abs/2108.07124) * [(2021) 基于强化学习的自适应移动目标防御对抗 DDoS 攻击](https://www.researchgate.net/publication/349576214_Reinforcement_learning_based_self-adaptive_moving_target_defense_against_DDoS_attacks) * [(2021) 针对工业医疗系统的建模、检测与缓解威胁：结合软件定义网络与强化学习的方法](https://ieeexplore.ieee.org/document/9470933) * [(2021) 无人机网络的轻量级 IDS：一种周期性深度强化学习方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9463947) * [(2021) DESOLATER：基于深度强化学习的资源分配与移动目标防御部署框架](https://ieeexplore.ieee.org/document/9418999) * [(2021) RAIDER：强化辅助的鱼叉网络钓鱼检测器](https://arxiv.org/abs/2105.07582) * [(2021) 基于时空流量规律的车联网 DDoS 缓解：一种特征自适应强化学习方法](https://ieeexplore.ieee.org/document/9408414) * [(2021) 云环境下 DoS 攻击下基于强化学习与稀疏约束的电力系统结构优化](https://www.sciencedirect.com/science/article/pii/S1569190X21000034) * [(2021) 基于半监督深度强化学习的网络异常流量检测模型](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9577211) * [(2021) 一种使用带有严重性分析器的 Q-Learning 的自适应蜜罐](https://www.researchgate.net/publication/350743085_An_adaptive_honeypot_using_Q-Learning_with_severity_analyzer) * [(2021) 基于博弈论 Actor-Critic 的无线 SDN 物联网网络入侵响应方案 (GTAC-IRS)](https://ieeexplore.ieee.org/document/9162048) * [(2021) 强化学习在用于检测高级持续性威胁的动态信息流跟踪博弈中的应用](https://arxiv.org/pdf/2007.00076.pdf) * [(2021) 深度强化学习用于抵御对手的备份策略](https://arxiv.org/pdf/2102.06632.pdf) * [(2021) 通过动态伪装实现的在攻击下未知动态系统的安全学习控制策略](https://arxiv.org/pdf/2102.00573.pdf) * [(2020) 特征欺骗问题中的学习与规划](https://arxiv.org/pdf/1905.04833.pdf) * [(2020) 机器学习网络攻击与防御策略](https://www.sciencedirect.com/science/article/pii/S0167404818309799) * [(2020) SDN 启用网络中用于攻击缓解的强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9165383) * [(2020) 通过直接控制强化学习实现基于主机的 DDoS 缓解](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8935157) * [(2020) 移动社交网络中基于博弈论与强化学习的安全边缘缓存](https://ieeexplore.ieee.org/document/9036917) * [(2020) 基于强化学习生成对抗样本的新型黑盒攻击](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9123270) * [(2020) 针对僵尸网络逃避攻击的深度强化对抗学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9226405) * [(2020) 深度强化学习用于自适应网络防御与攻击者模式识别](https://link.springer.com/book/10.1007/978-3-030-19353-9) * [(2020) 基于强化学习方法的 Flip 攻击检测](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9303818) * [(2020) FlipIt 中的强化学习](https://arxiv.org/pdf/2002.12909.pdf) * [(2020) 边缘计算中利用 DCNN Q-Learning 的 CPSS LR-DDoS 检测与防御](https://ieeexplore.ieee.org/document/9016201) * [(2020) 贝叶斯 Stackelberg 马尔可夫博弈中用于自适应移动目标防御的多智能体强化学习](https://arxiv.org/abs/2007.10457) * [(2020) 基于强化学习的欺骗资源智能部署策略](https://ieeexplore.ieee.org/document/9001034) * [(2020) 防御高级持续性威胁：使用多阶段迷宫网络博弈的最优网络安全加固](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9219722) * [(2020) 通过强化学习实现针对信息物理系统的自动化对手仿真](https://arxiv.org/abs/2011.04635) * [(2020) DRL-FAS：一种基于深度强化学习的人脸反欺诈新框架](https://arxiv.org/abs/2009.07529) * [(2020) Q-Bully：一种基于强化学习的网络欺凌检测框架](https://ieeexplore.ieee.org/document/9154092) * [(2020) 基于强化学习的应用层 DDoS 防御](https://ieeexplore.ieee.org/document/9213026) * [(2020) DQ-MOTAG：基于深度强化学习对抗 DDoS 攻击的移动目标防御](https://ieeexplore.ieee.org/document/9172847) * [(2020) 面向信息物理系统安全的混合博弈论与强化学习方法](https://ieeexplore.ieee.org/document/9110453) * [(2020) 机器学习网络攻击与防御策略](https://www.sciencedirect.com/science/article/pii/S0167404818309799) * [(2020) 通过强化学习实现突破后的自动化渗透测试](https://ieeexplore.ieee.org/abstract/document/9162301) * [(2020) DeepBLOC：一种通过在随机博弈上进行深度强化学习来保护 CPS 的框架](https://ieeexplore.ieee.org/document/9162219) * [(2020) 深度强化学习用于 DER 网络攻击缓解](https://arxiv.org/abs/2009.13088) * [(2020) 使用基于学习的 POMDP 针对多阶段攻击的自适应网络防御](https://dl.acm.org/doi/abs/10.1145/3418897) * [(2020) 使用知识图谱与强化学习进行恶意软件分析](https://ebiquity.umbc.edu/_file_directory_/papers/1053.pdf) * [(2020) 自主安全分析与渗透测试](https://ieeexplore.ieee.org/document/939428) * [(2020) POMDP + 信息衰减：在自主渗透测试中纳入防御者行为](https://ojs.aaai.org/index.php/ICAPS/article/view/6666/6520) * [(2020) ATMoS：使用强化学习在 SDN 中进行自主威胁缓解](https://ieeexplore.ieee.org/document/9110426) * [(2020) 使用夺旗挑战赛利用强化学习进行渗透测试建模：无模型学习与先验知识之间的权衡](https://arxiv.org/pdf/2005.12632.pdf) * [(2020) 通过强化学习与自我博弈寻找有效的安全策略](https://arxiv.org/abs/2009.08120) * [(2020) AFRL：用于 FANET 中智能干扰防御的自适应联邦强化学习](https://ieeexplore.ieee.org/document/9143577) * [(2020) 强化学习实现高效网络渗透测试](https://www.mdpi.com/2078-2489/11/1/6) * [(2020) Agent Web 模型 -- 为强化学习建模 Web 黑客攻击](https://arxiv.org/abs/2009.11274) * [(2020) 使用监督学习的随机动态信息流跟踪博弈用于检测高级持续性威胁](https://arxiv.org/abs/2007.12327) * [(2020) 用于 VANET 的基于强化学习的 PHY 认证](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8961122) * [(2020) 深度强化学习用于风力集成电力系统的网络安全评估](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9261465) * [(2020) 智能安全审计：具有深度神经网络逼近器的强化学习](https://ieeexplore.ieee.org/abstract/document/9139683) * [(2020) 高级持续性威胁的快速检测：一种半马尔可夫博弈方法](https://ieeexplore.ieee.org/document/9095996) * [(2020) 分布式强化学习用于具有多个远程状态估计的信息物理系统对抗 DoS 攻击者](https://ieeexplore.ieee.org/abstract/document/9174773) * [(2020) 5G 车联网中的安全众感：当深度强化学习遇上区块链](https://ieeexplore.ieee.org/document/9311241) * [(2020) 基于深度强化学习的云基础设施入侵检测系统](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9027452) * [(2020) 深度强化学习在有监督问题入侵检测中的应用](https://www.sciencedirect.com/science/article/pii/S0957417419306815) * [(2019) 一种基于 Q-learning 的博弈论方法以使非法智能合约失效](https://www.sciencedirect.com/science/article/pii/S0020025519304645) * [(9) 基于模型的入侵响应中深度强化学习的性能评估](https://www.cse.msstate.edu/wp-content/uploads/2019/11/ic12.pdf) * [(2019) 深度 Q 学习与粒子群优化用于在线社交网络中的僵尸网络检测](https://ieeexplore.ieee.org/document/8944493) * [(2019) 在移动的干草堆中寻找针：使用对抗性强化学习确定警报的优先级](https://arxiv.org/abs/1906.08805) * [(2019) 基于强化学习的虚假数据注入攻击对自动电压控制评估](https://ieeexplore.ieee.org/document/8248780) * [(2019) 对抗阶段博弈中电网防御策略的学习研究](https://ieeexplore.ieee.org/document/8834202) * [(2019) 学习应对对抗性攻击](https://arxiv.org/abs/1906.12061) * [(2019) 通过深度强化学习在安全博弈中学习分布式协作策略](https://ieeexplore.ieee.org/abstract/document/8753973) * [(2019) 一种高效的基于强化学习的僵尸网络检测方法](http://nrl.northumbria.ac.uk/id/eprint/41349/1/JNCA_1.pdf) * [(2019) 面向主动、自适应和自主网络防御的战略性学习](https://arxiv.org/abs/1907.01396) * [(2019) QFlip：FlipIt 安全博弈的一种自适应强化学习策略](https://arxiv.org/abs/1906.11938) * [(2019) 使用深度强化学习解决网络警报分配马尔可夫博弈](https://link.springer.com/chapter/10.1007/978-3-030-32430-8_11) * [(2019) 通过半马尔可夫决策过程的强化学习实现自适应蜜罐交互](https://link.springer.com/chapter/10.1007/978-3-030-32430-8_13) * [(2019) 通过深度强化学习检测钓鱼网站](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8754075) * [(2019) 基于对抗性深度强化学习的自适应移动目标防御](https://arxiv.org/abs/1911.11972) * [(2019) 使用强化学习的自主渗透测试](https://arxiv.org/abs/1905.05965) * [(2019) 智能电网安全中的多阶段博弈：一种强化学习解决方案](https://ieeexplore.ieee.org/document/8603817) * [(2019) 使用强化学习实现渗透测试自动化](https://stefann.eu/files/Automating%20Penetration%20Testing%20using%20Reinforcement%20Learning.pdf) * [(2019) 软件定义网络中基于强化学习的 DoS 缓解](https://www.springerprofessional.de/en/reinforcement-learning-based-dos-mitigation-in-software-defined-/17630266) * [(2019) 强化学习中的对抗性攻击与防御——从 AI 安全视角看](https://cybersecurity.springeropen.com/track/pdf/10.1186/s42400-019-0027-x.pdf) * [(2019) 信息物理电力系统中对抗性重复博弈的基于学习的解决方案](https://par.nsf.gov/servlets/purl/10280062) * [(2019) 强化学习用于电力系统的信息物理安全评估](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8810568) * [(2019) 在大型感知数据上赋能强化学习以进行入侵检测](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8761575) * [(2019) 基于深度强化学习的智能电网网络攻击恢复策略](https://ieeexplore.ieee.org/document/8915727) * [(2019) 深度强化学习用于众感系统中的部分可观测数据投毒攻击](https://ieeexplore.ieee.org/document/8945245) * [(2019) 使用强化学习在分布式 CSOC 之间实现平衡最佳性能的自适应警报管理](https://ieeexplore.ieee.org/document/8762232) * [(2018) 使用 Q-Learning 强化学习 Agent 模拟 SQL 注入漏洞利用](https://arxiv.org/abs/2101.03118) * [(2018) 移动边缘缓存中基于强化学习的安全](https://ieeexplore.ieee.org/document/8403961) * [(2018) 使用基于强化学习的动态进化神经网络检测在线钓鱼邮件](https://www.sciencedirect.com/science/article/pii/S0167923618300010) * [(2018) 基于强化学习的攻击图分析方法](https://researchonline.gcu.ac.uk/ws/portalfiles/portal/26084628/H.Tianfield_attack_graph.pdf) * [(2018) 软件定义网络中用于自主防御的强化学习](https://arxiv.org/abs/1808.05770) * [(2018) 通过强化学习学习逃避静态 PE 机器学习恶意软件模型](https://arxiv.org/abs/1801.08917) * [(2018) 使用风险状态与强化学习的自主计算机网络防御](https://www.ccdcoe.org/uploads/2018/10/17_BEAUDOIN-Autonomic-Computer-Network-Defence.pdf) * [(2018) 用于智能渗透测试的强化学习](https://ieeexplore.ieee.org/document/8611595) * [(2018) 自主智能网络安全防御 Agent (AICA) 参考架构](https://arxiv.org/abs/1803.10664) * [(2018) 基于深度强化学习的信息物理系统在未知网络攻击下的最优防御](https://ieeexplore.ieee.org/document/8285298) * [(2018) 网络攻击下自主系统中观察者设计的对抗性强化学习](https://arxiv.org/abs/1809.06784) * [(2018) 用于自主网络防御的机器学习](https://www.nsa.gov/portals/75/documents/resources/everyone/digital-media-center/publications/the-next-wave/TNW-22-1.pdf) * [(2018) 智能电网中的在线网络攻击检测：一种强化学习方法](https://arxiv.org/abs/1809.05258) * [(2018) 软件定义网络中基于深度强化学习的智能 DDoS 泛洪缓解](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8514971) * [(2018) 用于网络安全中入侵响应的异策略 Q-learning 技术](https://www.semanticscholar.org/paper/Off-Policy-Q-learning-Technique-for-Intrusion-in-Stefanova-Ramachandran/737667620f7696ad2089808eb810f8a95ee2a1e3#extracted) * [(2018) 面向智能干扰的车联网中的 UAV 中继与强化学习](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8246580) * [(2018) 基于多智能体强化学习的关键基础设施网络安全博弈论方法](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8442695) * [(2018) 移动边缘缓存中基于强化学习的安全](https://arxiv.org/pdf/1801.05915.pdf) * [(2018) 机器人 CTF (RCTF)，一个用于机器人黑客攻击的游乐场](https://arxiv.org/abs/1810.02690) * [(2018) NIDSRL：使用强化学习的基于网络的入侵检测系统 ](https://www.researchtrend.net/ijeece/pdf/IJEECE-1093-DARSHANA%20KAMAVISDAR.pdf) * [(2018) 用于信息物理攻击意图预测与恢复的 IRL 方法](https://ieeexplore.ieee.org/document/8430922) * [(2018) QRASSH - 由 Q-Learning 驱动的自适应 SSH 蜜罐](https://ieeexplore.ieee.org/document/8430922) * [(2018) 使用强化学习隐藏蜜罐功能](https://www.semanticscholar.org/paper/Using-Reinforcement-Learning-to-Conceal-Honeypot-Dowling-Schukat/a081d7606d18dc6e30a7b0395faf7909e84c721c) * [(2018) 通过高效强化学习参数应对自动化恶意软件以改进自适应蜜罐功能](https://www.researchgate.net/publication/326494108_Improving_adaptive_honeypot_functionality_with_efficient_reinforcement_learning_parameters_for_automated_malware) * [(2018) 通过强化学习增强基于机器学习的恶意软件检测模型](https://dl.acm.org/doi/abs/10.1145/3290480.3290494) * [(2017) 基于强化学习与帕累托优化的网络防御策略选择](https://pdfs.semanticscholar.org/4f3c/53bba5acfa7507c4c487c71eaf74771dc382.pdf) * [(2017) 网络安全仿真中的对抗性强化学习](https://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/CyberSec_ICAART.pdf) * [(2017) 在资源受限环境中使用强化学习检测隐蔽的僵尸网络](https://dl.acm.org/doi/abs/10.1145/3140549.3140552) * [(2017) 基于 Q-learning 的智能电网针对序列拓扑攻击的脆弱性分析](https://ieeexplore.ieee.org/ielaam/10206/7726079/7563294-aam.pdf) * [(2017) 基于多智能体强化学习的认知抗干扰](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7925694) * [(2017) 基于强化学习的移动卸载用于基于云的恶意软件检测](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8254503) * [(2017) 一种基于深度强化学习的安全移动众感博弈](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8006228) * [(2017) 贝叶斯攻击图上自适应网络防御的在线算法](http://php.scripts.psu.edu/muz16/pdf/ZH-ea-MTD17.pdf) * [(2016) 马尔可夫安全博弈：空间安全问题中的学习](https://www.google.com/search?q=Markov+Security+Games%3A+Learning+in+Spatial+Security+Problems&oq=Markov+Security+Games%3A+Learning+in+Spatial+Security+Problems&aqs=chrome..69i57.2463j0j7&sourceid=chrome&ie=UTF-8) * [(2016) 使用强化学习实现网络安全分析师的动态调度以最小化风险](https://dl.acm.org/doi/pdf/10.1145/2882969) * [(2016) 在动态威胁环境中平衡安全性与性能以实现敏捷性](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7579776) * [(2016) 基于强化学习的宽带自主认知无线电抗干扰](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7636793) * [(2016) 无线网络中利用强化学习的物理层欺骗检测](https://ieeexplore.ieee.org/document/7398138) * [(2015) 强化学习在增强认知无线电网络安全性中的应用](https://reader.elsevier.com/reader/sd/pii/S156849461500589X?token=F6600716BEC8310CAFDEA5B8B8FFC78C469E6D0EA7E2EC3A8E3A3CCA0A35E2411C618F4EBEF6E833959BDA8C0464DF5D&originRegion=eu-west-1&originCreation=20211214113944) * [(2015) 合作认知无线电网络中基于强化学习的抗干扰功率控制](https://link.springer.com/article/10.1007/s11227-015-1420-1) * [(2015) 带有学习的博弈论用于网络安全监控](https://assured-cloud-computing.illinois.edu/files/2014/03/Game-Theory-with-Learning-for-Cyber-Security-Monitoring.pdf) * [(2015) 无线网络中利用强化学习的欺骗检测](https://ieeexplore.ieee.org/document/7417078) * [(2015) 基于学习的恶意软件检测移动云卸载](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7179384) * [(2014) 用于对抗心脏出血的自适应网络防御强化学习算法](https://dl.acm.org/doi/10.1145/2663474.2663481) * [(2014) 使用模糊 Q-learning 的合作博弈论方法用于检测和预防无线传感器网络中的入侵](https://wsc9.softcomputing.net/eaai2014.pdf) * [(2014) Q-Learning：从计算机网络安全到软件安全](https://ieeexplore.ieee.org/document/7033124) * [(2013) 多智能体路由节流：针对 DDoS 攻击的去中心化协同响应](https://www.aaai.org/ocs/index.php/IAAI/IAAI13/paper/download/6244/6434) * [(2013) 随机博弈中的混合学习及其在网络安全中的应用](https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.471.2032&rep=rep1&type=pdf) * [(2013) 竞争性移动网络博弈：利用强化学习拥抱抗干扰与干扰策略](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6682689) * [(2012) 使用日志文件与强化学习的入侵检测系统](http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=707AB2AFCC69B73C2E1C6A832304001C?doi=10.1.1.677.8112&rep=rep1&type=pdf) * [(2012) 认知无线电网络中使用强化学习算法的抗干扰](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=6331885) * [(2011) 认知无线电网络中信道接入的一种抗干扰策略](https://www.springerprofessional.de/en/an-anti-jamming-strategy-for-channel-access-in-cognitive-radio-n/3778338) * [(2011) 分布式战略学习及其在网络完全中的应用](https://ieeexplore.ieee.org/document/5991373) * [(2010) 动态基于策略的 IDS 配置](https://ieeexplore.ieee.org/document/5399894) * [(2008) 强化学习在 Peer-to-Peer 网络漏洞评估中的应用](http://web.engr.oregonstate.edu/~afern/papers/iaai08.pdf) * [(2007) 使用隐马尔可夫模型和协同强化学习防御 DDoS 攻击](https://www.researchgate.net/publication/221451383_Defending_DDoS_Attacks_Using_Hidden_Markov_Models_and_Cooperative_Reinforcement_Learning) * [(2006) 一种有限观测的入侵检测博弈](http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.419.3971) * [(2005) 一种使用系统调用序列进行基于主机的入侵检测的强化学习方法](https://link.springer.com/chapter/10.1007/11538059_103) * [(2005) 用于入侵检测的多智能体强化学习](https://dl.acm.org/doi/10.5555/1898681.1898696) * [(2000) 下一代入侵检测：网络攻击的自主强化学习](https://neuro.bstu.by/ai/To-dom/My_research/Papers-0/For-research/D-mining/Anomaly-D/Intrusion-detection/033.pdf) ### 博士论文 * [(2024) IT 系统中针对网络入侵的最优安全响应](https://kth.diva-portal.org/smash/record.jsf?pid=diva2%3A1912164&dswid=7946) * [(2023) 使用带有在线强化学习的诱饵检测复杂网络攻击](https://scholarworks.utep.edu/open_etd/3878/) * [(2022) 竞争性多人游戏中的异常检测](https://open.bu.edu/handle/2144/45316) * [(2022) 安全自动化与自主系统](https://escholarship.org/content/qt7vt506c3/qt7vt506c3.pdf) * [(2014) 用于网络入侵响应的分布式强化学习](https://etheses.whiterose.ac.uk/8109/1/phd-thesis-malialis.pdf) * [(2009) 用于入侵检测的多智能体强化学习](https://etheses.whiterose.ac.uk/690/) ### 硕士论文 * [(2024) 面向强化学习 Agent 的现实渗透测试训练框架](https://dspace.jaist.ac.jp/dspace/bitstream/10119/19357/5/paper.pdf) * [(2024) 机器学习算法在入侵检测系统中的应用](http://hdl.handle.net/11110/2898) * [(2024) MetaNet：一种用于网络化系统自动渗透测试的元学习模型](https://www.diva-portal.org/smash/get/diva2:1871151/FULLTEXT01.pdf) * [(2024) 在用于自主网络防御的多智能体强化学习中学习通信](https://espace.rmc-cmr.ca/jspui/bitstream/11264/1843/1/Thesis_Faizan_Learning_to_Communicate.pdf) * [(2023) RESONANT：基于强化学习的信用卡欺诈检测移动目标防御](https://vtechworks.lib.vt.edu/server/api/core/bitstreams/4b38d0f3-a6d5-4083-b7c4-9a688e857336/content) * [(2023) 用于自主网络行动的竞争性强化学习](https://espace.rmc.ca/jspui/bitstream/11264/1227/1/Competitive_RL_for_ACO.pdf) * [(2023) 使用协作多智能体强化学习从零开始学习网络防御战术](https://espace.rmc.ca/jspui/bitstream/11264/1082/1/Thesis_Final_Wiebe.pdf) * [(2022) 使用强化学习实现 SQL 注入漏洞利用的自动化](https://www.duo.uio.no/handle/10852/100271) * [(2022) 用于寻找入侵预防策略的自博弈强化学习](https://limmen.dev/assets/papers/Master_Thesis_Jakob_Stymne_Final_2_June.pdf) * [(2022) 强化学习辅助的逃避性恶意软件动态分析](https://webthesis.biblio.polito.it/22588/1/tesi.pdf) * [(2021) 基于强化学习的入侵检测](https://era.library.ualberta.ca/items/9ad1c6b8-143e-45d3-921d-c4e754c59328/view/440db5ae-1cb6-4643-9868-871e82b59615/Yang_Bin_202109_MSc.pdf) * [(2021) 用于网络入侵预防的贝叶斯强化学习方法](https://limmen.dev/assets/papers/Antonio_Nesti_Lopes_2021_Master_Thesis.pdf) * [(2019) 学习黑客技术](https://stefann.eu/files/RL%20vs%20GA%20in%20GT%20Cybersec%20Thesis.pdf) * [(2018) 使用强化学习的自主渗透测试](https://arxiv.org/pdf/1905.05965.pdf) * [(2018) 使用机器学习算法（深度强化学习算法）的网络入侵检测系统分析](http://www.diva-portal.org/smash/get/diva2:1255686/FULLTEXT02.pdf) ### 学士论文 * [(2022) 通过 CyberBattleSim Web 平台模拟网络横向移动](https://dspace.mit.edu/bitstream/handle/1721.1/143191/Esteban-jesteban-meng-eecs-2022-thesis.pdf?sequence=1&isAllowed=y) * [(2018) 使用强化学习的自主渗透测试](https://arxiv.org/pdf/1905.05965.pdf) ### 海报 * [(2023) 海报：防御未知：探索强化学习 Agent 在现实且未见网络中的部署](https://ceur-ws.org/Vol-3652/paper2.pdf) * [(2023) 海报：为自主网络防御生成经验](https://dl.acm.org/doi/abs/10.1145/3576915.3624381) * [(2023) 学习应对动态攻击者的近似最优入侵响应](https://limmen.dev/assets/papers/CDIS_conf_23_Hammar_Stadler_poster_12_may.pdf) * [(2022) 使用强化学习的自主网络防御](https://dl.acm.org/doi/abs/10.1145/3488932.3527286) * [(2022) 通过最优停止实现入侵预防](https://limmen.dev/assets/papers/CDIS_Conference_Poster_24_May_Hammar_Stadler.pdf) * [(2022) 通过最优停止实现入侵预防](https://limmen.dev/assets/papers/ML_Day_KTH_Poster_17_Jan_2022_Hammar_Stadler.pdf) * [(2021) 通过最优停止学习入侵预防策略](https://limmen.dev/assets/papers/poster_dlrl_21_optimal_stopping_KimHammar_jul_21.pdf) * [(2021) RELACCS：用于网络安全的强化学习](https://www.osti.gov/servlets/purl/1882079) ## [↑](#table-of-contents) 书籍 * [(2021) 网络安全的博弈论与机器学习（第 5 章关于 RL）](https://www.wiley.com/en-sg/Game+Theory+and+Machine+Learning+for+Cyber+Security-p-9781119723929) * [(2019) 信息物理强化学习与网络安全案例研究](https://www.bkstr.com/rivierstore/product/reinforcement-learning-for-cyber-physical-systems-494947-1) * [(2010) 网络安全：决策与博弈论方法](https://www.amazon.com/Network-Security-Decision-Game-Theoretic-Approach/dp/0521119324) ## [↑](#table-of-contents) 博客文章 * [(2024) 强化学习如何帮助防范网络攻击？](https://www.linkedin.com/advice/1/how-can-reinforcement-learning-help-protect-tkdxe) * [(2023) 超越生成式 AI：自主网络防御 Agent 的崛起](https://www.cybersecuritypulse.net/p/beyond-genai-the-rise-of-autonomous) * [(2021) 将机器学习游戏化以增强安全性和 AI 模型](https://www.microsoft.com/security/blog/2021/04/08/gamifying-machine-learning-for-stronger-security-and-ai-models/) * [(2021) 使用强化学习实现网络安全自动化](https://winder.ai/automating-cyber-security-with-reinforcement-learning/) * [(2021) 迈向一种使用强化学习计算有效入侵预防策略的方法](https://limmen.dev/towards-a-method-for-computing-effective-intrusion-prevention-policies-using-rl) ## [↑](#table-of-contents) 演讲 * [(2024) 通过对抗策略攻击强化学习 – 由 Wong Wai Tuck 提供](https://www.youtube.com/watch?v=jKvuS1LirYk) * [(2024) CYBRAL：利用高级 AI 实现自动化网络安全](https://www.youtube.com/watch?v=N0aVB1Wrrck) * [(2024) 从网络态势感知到自适应网络防御](https://www.youtube.com/watch?v=T9bmqccjfkg) * [(2024) 利用优化、控制理论和机器学习实现 DER 的运营网络安全](https://www.youtube.com/watch?v=3nF6lk_bSuc) * [(2024) 通过具有自适应推测的在线学习进行自动化安全响应](https://www.youtube.com/watch?v=K2JnC6z72fI&t=795s) * [(2024) CSLE v0.5](https://www.youtube.com/watch?v=l_g3sRJwwhc) * [(2024) 用于网络防御的机器学习：从网络安全与端点安全的视角](https://www.youtube.com/watch?v=UC-oE2-sP_0) * [(2023) 面向海事操作技术网络安全的多智能体强化学习](https://www.youtube.com/watch?v=kWIKEdIzXNY) * [(2023) 学习自动化入侵响应](https://www.youtube.com/watch?v=_Y_1I_BEb58) * [(2023) CSLE v0.2](https://www.youtube.com/watch?v=iE2KPmtIs2A&t=3s) * [(2023) 提升网络防御能力](https://www.youtube.com/watch?v=kPAEhLypD3A) * [(2023) 通过分解学习大规模 IT 基础设施的近似最优入侵响应](https://www.youtube.com/watch?v=LDCgOygjn3k&) * [(2023) 强化学习在自主网络防御中的应用](https://www.youtube.com/watch?v=PcqeG5_A4tE) * [(2023) 用于安全自动化的数字孪生与强化学习](https://www.youtube.com/watch?v=Gi-_KSNYVCk&) * [(2023) 使用强化学习 (RL) 自动化数字犯罪调查](https://www.youtube.com/watch?v=lB7aZnT8VIw) * [(2023) 在网络兵棋引擎中应用多智能体强化学习 (MARL)](https://www.youtube.com/watch?v=FQYub1u5-GE) * [(2023) 通过最优停止实现入侵响应](https://www.youtube.com/watch?v=Qzp_wiNW91o) * [(2022) 利用可解释强化学习探索自主网络防御 (CAMLIS 2022)](https://www.youtube.com/watch?v=i59PtruGd1o) * [(2022) 走向自动驾驶 SOC 之旅](https://www.youtube.com/watch?v=lW3-N_ZqvRQ) * [(2022) 深度强化学习在网络安全中的应用](https://www.youtube.com/watch?v=FPDRkx8ocng) * [(2022) CNSM 2022，在动态 IT 环境中调整安全策略 - Hammar & Stadler](https://www.youtube.com/watch?v=r1FD2-b-25g&) * [(2022) 用于网络防御的自学习系统](https://youtu.be/tpal1DoNBy8) * [(2022) 海报：使用强化学习进行智能网络靶场仿真研究](https://www.youtube.com/watch?v=-BSlO98lqMM) * [(2022) 论文研究 - 网络安全强化学习调查](https://www.youtube.com/watch?v=gyLyR9yM7QU) * [(2022) 人工智能在网络防御中的作用 | AI 与网络安全 | Vincent Lenders](https://www.youtube.com/watch?v=GpvpSBaif6M) * [(2022) 通过游戏博弈与最优停止学习安全策略 - Hammar & Stadler](https://www.youtube.com/watch?v=Qz6huGXjhec) * [(2022) 面向复杂安全博弈及其他的强化学习](https://www.youtube.com/watch?v=fVLwKRLDYSg) * [(2022) NOMS22 演示 - 交互式学习安全策略检查系统 - Hammar & Stadler](https://www.youtube.com/watch?v=18P7MjPKNDg) * [(2022) 强化学习应用：网络安全](https://www.youtube.com/watch?v=-V2BJ96zWno) * [(2021) 人工智能在网络安全中的应用 (AI ATAC) 挑战赛 I&II](https://underline.io/lecture/14589-artificial-intelligence-applications-to-cybersecurity-(ai-atac)-prize-challenges-iandii) * [(2021) NordSec 2021 - SQL 注入与强化学习](https://www.youtube.com/watch?v=pb9Z2rjJaWo) * [(2021) 用于自动化渗透测试的深度分层强化智能体](https://www.youtube.com/watch?v=i_Qu9uF50AI) * [(2021) USENIX Security '21 - SyzVegas: 利用强化学习打破内核模糊测试的概率](https://www.youtube.com/watch?v=72Ngu3305TU) * [(2021) CyGIL：在仿真网络系统上训练自主智能体的网络训练场](https://www.youtube.com/watch?v=JzZHjPGjoWg&t=675s) * [(2021) 使用结合 Soar 强化学习与协同进化算法的非对称兵棋推演仿真模拟物流企业](https://www.youtube.com/watch?v=H8qDeIsZQk8) * [(2021) 将欺骗机制融入 CyberBattleSim 以实现自主防御](https://www.youtube.com/watch?v=mDY1OH7x4ZI) * [(2021) CybORG：用于开发自主网络智能体的训练场](https://www.youtube.com/watch?v=a9EhsiB3XhA) * [(2021) 用 AI 捍卫网络前线 - CyCon 2021](https://www.youtube.com/watch?v=9t6v_EDs74I) * [(2021) 利用网络专家表现数据为自主欺骗系统提供信息](https://www.youtube.com/watch?v=kTwn_ADKVgg&t=2s) * [(2021) ACD 2021 主题演讲 - George Cybenko 教授 - 自适应网络防御中的消耗战](https://www.youtube.com/watch?v=OYp2Drgr-RM&t=789s) * [(2021) 强化学习在入侵检测中的应用方法](https://www.youtube.com/watch?v=AsfQAdraFhw) * [(2021) 在网络兵棋推演引擎中应用深度强化学习 (DRL)](https://scp.cc.gatech.edu/2021/03/19/applying-deep-reinforcement-learning-drl-in-a-cyber-wargaming-engine/) * [(2021) 使用强化学习的自动化渗透测试](https://www.youtube.com/watch?v=Ys3vo1oHdOU) * [(2021) 使用深度强化学习训练自主渗透测试工具](https://www.youtube.com/watch?v=EiI69BdWKPs&t=1754s) * [(2021) 通过最优停止学习入侵防御策略](https://www.youtube.com/watch?v=_zL4qR5-jU8&t=286s) * [(2020) 通过强化学习与自我博弈寻找有效的安全策略](https://www.youtube.com/watch?v=9ihiIPVRB58) * [(2020) 自主安全分析与渗透测试 (ASAP) - Ankur Chowdhary](https://www.youtube.com/watch?v=6EyOPqLm2jg) * [(2020) 自主安全分析与渗透测试：一种强化学习方法。](https://www.youtube.com/watch?v=TISD4VCT5UE) * [(2020) 基于人工智能的自主渗透测试。](https://www.youtube.com/watch?v=NMcrTOtpcXI) * [(2019) 使用深度强化学习进行高性价比的恶意软件检测](https://www.youtube.com/watch?v=gVY1M2NikaM) * [(2019) 尝试将 Meterpreter 变成对抗样本](https://www.youtube.com/watch?v=eYAZ3BTUq6c) * [(2019) 面向智能、安全和高效信息物理自主系统的强化学习框架](https://www.youtube.com/watch?v=fdpW8hMxvEw&t=454s) * [(2019) 通过半马尔可夫决策过程的强化学习进行自适应蜜罐交互](https://www.youtube.com/watch?v=GPKT3uJtXqk) * [(2018) 自主网络防御：AI 与免疫系统方法](https://www.youtube.com/watch?v=Wa6WHJfakbA) * [(2018) 拯救者的善意软件：未来自主网络防御智能体 | Alexander Kott 博士 | CAMLIS 2018](https://www.youtube.com/watch?v=W9iYTO9vEbA) * [(2018) CSIAC 网络研讨会 - 学习获胜：为自主网络安全解决方案提供依据](https://www.youtube.com/watch?v=K-ma_ZVzqec) ## [↑](#table-of-contents) 其他 * [(2024) IEEE COMSOC 安全领域人工智能与机器学习专业兴趣小组](https://cn.committees.comsoc.org/special-interest-groups-sigs/cognitive-network-security-sig/) * [(2024) GameSec '24：第 15 届安全决策与博弈论会议 (GameSec-24)](https://www.gamesec-conf.org/) * [(2024) 自主弹性网络防御 (ARCD)](https://www.qinetiq.com/en/what-we-do/services-and-products/autonomous-resilient-cyber-defence) * [(2023) 信息安全应用机器学习会议论文集 (CAMLIS 2023)](https://dblp.org/db/conf/camlis/camlis2023.html) * [(2023) 第 2 届自适应网络防御研讨会论文集](https://arxiv.org/html/2308.09520) * [(2023) GameSec '23：第 14 届安全决策与博弈论会议 (GameSec-23)](https://www.gamesec-conf.org/) [会议录](https://link.springer.com/book/10.1007/978-3-031-50670-3) * [(2023) AISec '23：第 16 届 ACM 人工智能与安全研讨会](https://aisec.cc/) * [(2022) 信息安全应用机器学习会议论文集 (CAMLIS 2022)](https://dblp.org/db/conf/camlis/camlis2022.html) * [(2022) 网络防御中的 AI 邮件列表](https://groups.google.com/g/ai-for-cyberdefence) * [(2022) 网络防御中的 AI (AICD) 研究中心 - Alan Turing Institute](https://www.turing.ac.uk/aicd) * [(2022) DARPA 的 CASTLE：使用强化学习的安全测试与学习环境网络智能体](https://www.darpa.mil/news-events/2022-10-24) * [(2022) AISec '22：第 15 届 ACM 人工智能与安全研讨会](https://dl.acm.org/doi/abs/10.1145/3548606.3563683) * [(2022) ICML 网络安全机器学习研讨会](https://sites.google.com/view/icml-ml4cyber/home?authuser=0) * [(2022) AAAI 网络安全人工智能研讨会 (AICS)](http://aics.site/AICS2022/) * [(2022) ECMLPKDD 网络安全机器学习研讨会(MLCS)](https://mlcs.lasige.di.fc.ul.pt/) * [(2021) IJCAI 第一届自适应网络防御国际研讨会](https://arxiv.org/html/2108.08476) * [(2021) ICONIP 网络安全人工智能研讨会 (AICS)](https://www.csmining.org/cdmc2021/index.php?id=16) * [(2021) ECMLPKDD 网络安全机器学习研讨会 (MLCS)](https://mlcs.lasige.di.fc.ul.pt/2021/index.html) * [(2021) 自学习 AI](https://www.darktrace.com/en/self-learning-ai/) * [(2021) 网络安全中的 AI/ML：挑战、解决方案与新颖思路 (SIAM 数据挖掘 2021)](https://arxiv.org/html/2104.13254) * [(2020) ECMLPKDD 网络安全机器学习研讨会(MLCS)](https://mlcs.lasige.di.fc.ul.pt/2020/) * [(2020) 网络防御的自学习系统](https://www.kth.se/cdis/research/self-learning-systems-for-cyber-defense-1.1050591) * [(2020) 网络安全人工智能研讨会 (AICS)](https://arxiv.org/html/2002.08320) * [(2019) ECMLPKDD 网络安全机器学习研讨会 (MLCS)](https://mlcs.lasige.di.fc.ul.pt/2019/) * [(2019) 网络安全人工智能研讨会 (AICS)](https://arxiv.org/html/1812.07469) ## 贡献非常欢迎您的贡献。请使用 Github issues 和 pull requests。 ### 贡献者列表感谢您的所有贡献，让这个项目保持最新状态。

标签：AI安全, Apex, Awesome清单, Chat Copilot, CyberBattleSim, DNS枚举, DNS解析, Ruby, 人工智能, 凭据扫描, 安全仿真, 开源项目, 强化学习, 机器学习, 深度强化学习, 用户模式Hook绕过, 知识库, 网络安全, 网络安全工具, 网络安全环境, 网络战模拟, 网络攻防, 自动化攻击, 自动化防御, 论文合集, 请求拦截, 逆向工具, 隐私保护

Kim-Hammar/awesome-rl-for-cybersecurity

强化学习在网络安全中的优秀资源汇总

强化学习在网络安全中的
优秀资源汇总