Kenny-JT/idea-stress-test

GitHub: Kenny-JT/idea-stress-test

一个 Claude Code / OpenCode 的 agent skill，通过对抗性批判循环对新想法进行压力测试，核心是过滤层能区分真正的风险与过度担忧。

Stars: 0 | Forks: 0

# Idea 压力测试这是一项 agent skill，旨在通过对抗性批判循环，在你记录或开始构建**新想法或项目方向**之前，对其进行压力测试。它做到了普通的“魔鬼代言人”（devil's advocate）prompt 做不到的一件事：它会对批评进行**过滤**，这样你就既不会因为听好话而做出糟糕的决定，也不会被偏执的噪音所淹没。 ## 工作原理 ``` You float an idea -> a Critic assumes it will fail and attacks it across 8 angles -> the Assistant FILTERS objections into: real-risk / over-worry / your-call -> you answer targeted questions -> re-judge; loop the critic if needed -> until the idea is mature -> record it, or build it ``` 中间的**过滤层**是核心所在。现成的“魔鬼代言人” prompt 只会把所有的反对意见都倒给你。而在这里，assistant 会将批评者的攻击点分类为： - **真正的风险** —— 承重假设、高概率 × 高影响、单向门（不可逆操作），且针对的是*你*的具体情况。 - **过度担忧** —— 低概率/可逆/为时过早/还没达到你的规模 —— 会*附带原因*进行标记，以便你选择忽略。 - **你来决定** —— 取决于你的意图、风险承受能力或优先级。它还会根据利害关系调整强度（一个 20 分钟的可逆实验不需要经历三轮严酷考验），并限制了循环次数，使其不会无限旋转。 ## 要求 **一名批评者。** 循环需要一个对手。按优先顺序： 1. **强推理批评者 subagent**（最佳）。如果你的平台可以在 frontier model 上生成 sub-task / consultant agent，该 skill 会将批评工作委托给它 —— 独立的 context 批评得会更严厉、更清晰。**在 OpenCode（配合 OhMyOpenCode）中，使用 `oracle` agent** 作为批评者（可选由 `explore` + `librarian` 提供支持）；在 Claude Code 中，在 frontier model 上生成一个 Task subagent。 2. **Inline 回退方案**（适用于所有环境）。没有 subagent？assistant 会在明确的对抗阶段自行运行批评，然后再切换回过滤模式。该 skill 已经包含了此项指令 —— 无需额外设置。 **可选，用于为批评提供现实依据：** - Codebase 搜索（是否已经存在类似的东西？） - 网络/文档研究（类似想法在现实世界中的失败案例）两者均为可选；该 skill 在没有它们的情况下也能优雅降级。 ## 安装这是一个标准的单文件 agent skill（一个目录 + `SKILL.md`）。 - **Claude Code / Claude.ai**：将 `idea-stress-test/` 文件夹放入你的 skills 目录中（例如 `.claude/skills/`）。 - **OpenCode**：将其放入 `~/.config/opencode/skills/`（或你项目的 `.opencode/skills/`）。文件夹名称**必须**与 `SKILL.md` 中的 `name` 字段（`idea-stress-test`）相匹配。如果你的工具在启动时加载 skills，请重启或开启一个新的会话。 ## 使用 - **自动**：当你提出真正的新想法/方向时，该 skill 会自动调用 —— “我有一个想法……”、“正在考虑构建 X”、“这个靠谱吗？”、“压力测试一下”、“对它进行红蓝对抗”。 - **手动**：`/idea-stress-test`。 - 对于低风险/可逆的想法，它会提供快速检查，而不是完整的严酷考验。 ## 示例它会运行 critic，然后向你展示**过滤后**的结果 —— 而不是原始的倾倒： ``` [Maturity] Early — one load-bearing assumption is unverified, and there's a one-way-door risk. [Real risks] (severity-sorted) 1. Auto-SENDING on your behalf is a one-way door. - why it matters: a wrong auto-reply can't be unsent; one bad email to a client is high-impact and irreversible. - what you need to answer: does it send automatically, or draft and wait for your approval? 2. "The replies sound like me" is the assumption the whole value rests on. - why it matters: if they read as robotic, people trust your emails less — the feature backfires. - what you need to answer: OK with draft-only until you've validated tone on ~50 real threads? [I think these are over-worry] (push back if you disagree) - "It won't scale to millions of users" — you're building it for yourself; scale isn't a real constraint. - "Gmail could change their DOM and break it" — true, but cheap to fix later; not a reason to not start. [Targeted questions] 1. Auto-send, or draft-only? 2. OK to gate it behind manual approval until the tone is proven? ``` ``` [Maturity] Mature — it held up. The one-way-door risk is gone (draft-only); the tone risk is now a validation step, not a blind bet. Watch-item: confirm drafting actually saves time vs. just writing the reply yourself. Record this, or start building? ``` 请注意过滤器的作用：它提升了两项可能真正伤害你的风险，**驳回**了两个普遍性的反对意见（并附带了原因，以便你可以推翻），并且只提出了那些会改变决策的问题。 ## 自定义 - **记录目标** —— 默认情况下，该 skill 会将成熟的想法记录到“项目的笔记/记忆文件”中。将其指向你保存项目 context 的任何地方（一个 `NOTES.md`、一个 Obsidian vault、一个 issue tracker，等等）。 - **语言** —— 该 skill 会使用你的语言与你交谈；critic prompt 为英文（如果你愿意，可以替换它）。 - **署名** —— 在 `SKILL.md` frontmatter 中设置 `author` 字段，并在 `LICENSE` 中设置版权行。 ## 许可证 MIT —— 详见 `LICENSE`。八角度对抗性框架受公开的“魔鬼代言人/红蓝对抗” prompt 模式启发；过滤层设计（真正的风险 vs 过度担忧 vs 你来决定）是其与众不同之处。

标签：AI智能体, Claude代码助手, 头脑风暴, 提示词工程, 策略决策点, 辅助决策, 项目评估