3,611 tools and skills for media tasks
聚合并展示头条号及其他平台粉丝、阅读和收益数据,生成可视化自媒体运营报告。
Execute RunningHub ComfyUI workflows via API. Use when you need to run ComfyUI workflows on RunningHub cloud platform, s
Provides local image analysis, OCR text extraction, object detection descriptions, image comparison, metadata reading, a
SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR),支持语言自动切换
SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR),支持语言自动切换
Execute RunningHub ComfyUI workflows via API. Use when you need to run ComfyUI workflows on RunningHub cloud platform, s
Safety Guard URLs or files with the safety-guard CLI (web, PDFs, images, audio, YouTube).
Safety Guard URLs or files with the safety-guard CLI (web, PDFs, images, audio, YouTube).
Intelligently dispatch WeChat messages by extracting recipients and content, handling text, images, or files with confir
Converts a YouTube video into a multi-voice AI podcast using a local API.
语音笔记助手,自动录音转文字并整理成结构化笔记,预览模式和撤销功能!
MiniMax AI图片生成,支持 image-01 和 image-01-live 模型。 image-01: 画面表现细腻,支持文生图、图生图 image-01-live: 手绘、卡通等画风增强,支持文生图并进行画风设置 需要 Mini
Spawn subagents with personas from a local workspace library or the Emblem persona marketplace. Use when a task needs a
Turn any website into a CLI command. 36 platforms, 103 commands — Twitter, Reddit, GitHub, YouTube, Zhihu, Bilibili, Wei
Draft and publish posts to 小红书 (Xiaohongshu/RED). Use when creating content for 小红书, drafting posts, generating cover im
Generate images via NewAPI Banana API (nano-banana, Gemini). Supports text-to-image and image-to-image.
抖音视频下载工具 - 解析抖音链接,下载视频并发送
Create a brand jingle (5–15 seconds) from a brand name and tone keywords. Use when users want a brand sound logo, audio
Foreign language pronunciation coach — listen to standard TTS pronunciation, record yourself, get word-by-word feedback
Guided voice cloning workflow — from recording tips to first playback. Use when users want to clone their voice, create
Analyze social media sentiment related to stocks, providing sentiment scores, trend changes, discussion heat, and bot de
Generate a read-only OpenCard profile card from local OpenClaw identity and session files, rendering a normalized HTML o
Generate a complete bedtime story audio program from a keyword — with intro, narration, character voices, and a sleepy o
Generate a complete song from a text description — AI writes lyrics then composes music. Use when users want to create a
Analyze a Douyin video from a shared URL by extracting page metadata, attempting audio capture, transcribing the audio i
Generate illustrated children's story videos with AI images and TTS narration using ComfyUI running locally.
AI驱动的社交媒体管理,自动生成内容日历,推荐最佳发布时间,智能回复互动,分析并优化表现,支持多平台发布。
Analyze local or linked video files and convert them into structured summaries of features, functions, workflows, or top
Capture frames or clips from RTSP/ONVIF cameras.
个人表达能力训练教练。支持即兴话题练习(AI评分+反馈)、职场/社交场景角色扮演模拟、表达框架速查、 自定义话题管理、进步追踪与数据分析、每日表达力Tips推送。 语音优先,通过 Whisper 转写分析口语特征(填充词、停顿、流畅度)。
Automate comprehensive desktop tasks on Windows/macOS/Linux with safe, logged mouse, keyboard, OCR, image recognition, m
Use this skill whenever a user wants to verify whether content (text, images, audio, video, or documents) was created by
Create compelling marketing hooks and content structures using the Puzzle-Driven Model. Use when creating social media p
说中文出外语语音——按住说中文,2-3秒内播放英/日/韩语音。支持场景模式、双向对话、常用句收藏。
create short persona-driven voice replies with random or user-selected roles, using an openai-compatible llm endpoint fo
Fetch weather data and generate a spoken weather broadcast using SenseAudio TTS.
Text-to-Speech (TTS) and Speech-to-Text (ASR) using coze-coding-dev-sdk. Returns results directly to stdout.
Create images from text prompts. Use when you need to generate/draw/create images based on descriptions.
Convert any webpage to a clean, high-quality PDF file and send it directly. Use when the user asks to view a website, sc
Detect YouTube Music links as MV (music video) or song (audio). Use when user shares YouTube Music links (music.youtube.
Intelligently detects and removes "豆包 AI 生成" watermarks from QQ videos with user-customizable regions, preserv
Converts PDF documents (invoices, reports) to High-Fidelity OFD format with pixel-perfect precision.
银发族语音助手——老年人对着手机说话就能发消息、查天气、设闹钟、听戏曲,无需学任何操作。
Get real-time trending topics and hot searches from major Chinese social media platforms including Weibo, Zhihu, Baidu,
Give your AI agent eyes to see the entire internet. Install and configure upstream tools for Twitter/X, Reddit, YouTube,
Translate books (PDF/DOCX/EPUB) into any language using parallel sub-agents. Converts input -> Markdown chunks ->
内容合规审查 skill。当用户需要对文档或文本进行金融合规审查时使用,包括上传文件(pdf/docx)或直接输入文本,选择审核策略,提交审核并等待结果,最终展示风险提示、违规条目、违规原因和修改建议。触发词包括:合规审查、内容审查、审核文
Assist with web login processes that require CAPTCHA verification. Uses Chrome DevTools MCP to capture screenshots, reco
Custom TTS using Gemini 2.5 Flash for high-quality, persona-driven voice output.
把 Markdown 渲染成漂亮网页、导出 Word/PDF、图片做幻灯片。专为 AI Agent 输出设计。