lakeraai/pint-benchmark

GitHub: lakeraai/pint-benchmark

一个用于客观评估各类LLM提示注入检测系统性能的基准测试框架,提供多语言、多类别攻击样本和标准化评测方法。

Stars: 164 | Forks: 21

# :beer: Lakera PINT 基准测试 **提示注入测试 (PINT) 基准测试**提供了一种中立的方式来评估提示注入检测系统的性能,例如 [Lakera Guard](https://www.lakera.ai/),而无需依赖这些工具可用来优化评估性能的已知公共数据集。 ## PINT 基准测试得分 | 名称 | PINT 得分 | 测试日期 | | ---- | ---------- | --------- | | [Lakera Guard](https://lakera.ai/) | 95.2200% | 2025-05-02 | | [AWS Bedrock Guardrails](https://aws.amazon.com/bedrock/guardrails/) | 89.2404% | 2025-05-02| | [Azure AI Prompt Shield Documents + User Prompts](https://learn.microsoft.com/en-us/azure/ai-services/content-safety/concepts/jailbreak-detection#prompt-shields-for-documents) | 89.1241% | 2025-05-02 | | [`protectai/deberta-v3-base-prompt-injection-v2`](https://huggingface.co/protectai/deberta-v3-base-prompt-injection-v2) | 79.1366% | 2025-05-02 | | [Llama Prompt Guard 2 (86M)](https://huggingface.co/meta-llama/Llama-Prompt-Guard-2-86M) | 78.7578% | 2025-05-05 | | [Google Model Armor](https://cloud.google.com/security-command-center/docs/model-armor-overview) | 70.0664% |2025-08-27 | | [Aporia Guardrails](https://www.aporia.com/) | 66.4373% | 2025-05-02 | | [Llama Prompt Guard](https://huggingface.co/meta-llama/Prompt-Guard-86M) | 61.8168% | 2025-05-02 |