FindSkills

Prompt injection detection skill

Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted users in group chats or public channels, (4) generating responses that discuss violence, self-harm, sexual content, hate speech, or other sensitive topics, or (5) deploying agents in public-facing or multi-user environments where adversarial input is expected.

作者 clawhub 社区 来源: clawhub
质量: 中等 安全: 社区 分类: 通信 更新时间: 2026-02-15
在 ClawHub 上查看 JSON API

通信分类的相关技能

depth-estimation
Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM vid...
claude-to-im
Bridge Claude Code / Codex to IM platforms — chat with AI coding agents from Tel...
myclaw-guardian
🛡️ Guardian watchdog for OpenClaw Gateway — auto-monitor, self-repair via docto...
mailbox-cli
email skill
interclaw
Secure, sequenced, PGP-signed email mesh for agent-to-agent coordination
Iblai Openclaw Email
Triage every email with the cheapest model that can handle it and save 90%+

查看全部通信技能