3,619 个媒体任务的工具和技能
A comprehensive AI agent skill for TikTok creators. Generates viral video ideas, writes engaging scripts, optimizes post
Use when the user mentions Otter, Otter.ai, or wants to find, search, download, export, or manage meeting notes, transcr
Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.
Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries sa
Analyze financial data from uploaded Excel/PDF files and generate interactive reports with sparkline trend charts. Suppo
视频下载工具,支持YouTube、Bilibili、抖音等数千个网站。触发词:"下载视频"、"视频下载
Automatically parse PDF/TXT research reports to extract key viewpoints, data, investment advice, risks, and generate sum
A comprehensive AI agent skill for managing phone and video calls professionally. Prepares you before every important ca
Track skin lesions, rashes, photos, treatment response, and dermatology visit prep with conservative triage, case-based
Ghost radio station that broadcasts both human-listenable audio and 296-dimensional perceptual vectors to Flux Universe.
Generate SVG images using text LLM instead of image generation APIs. Use when user wants to create illustrations, icons,
使用Pexels API搜索和下载高质量免费图片,支持自动调整尺寸和格式验证
Generate cute cartoon-style pet images (dogs, cats, etc.) using code. Use when user asks for cartoon pet drawings, cute
Statically audit Dockerfiles for common container hardening risks (root user, unpinned/latest base images, missing healt
Extract audio from video URLs and transcribe using STT (Speech-to-Text). Supports local Whisper or cloud APIs. Use when:
Say "agent status" and get updates on all subagent progress. Track subagent actions while they run; list activ
Automates downloading YouTube audio, launching Doubao, playing audio, and capturing translations for full video subtitle
视频转文章 / YouTube Video to Article — 使用 Gemini AI 将视频转为结构化文章。当用户需要将 YouTube 视频转换为文章时使用。
Generate animated videos from SVG frames using text LLM. Supports any subject (animals, humans, characters, scenes, abst
Automates YouTube audio download, launches Doubao for translation, plays audio, and captures translated subtitles into t
AI图像生成与编辑。支持文生图、图+文生图、风格转换。当用户要求画图、生成图片、编辑图片、图片风格转换时使用此 skill。支持多种比例(1:1、3:2、16:9、21:9 等)和分辨率(标准、2K、4K)。
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly trans
提供PDF转换、合并、拆分、压缩和编辑等功能,支持多种文档和图片格式互转处理。
Adds TikTok-style text overlays to images and videos with styled fonts, backgrounds, strokes, and timed animations.
Pixshop MCP 集成 — 28+ AI 图片视频创意工具,Claude 直接调用 / Pixshop MCP — 28+ AI creative tools for image & video generation, edi
Pixshop 开发者 REST API — 图片生成/编辑、视频制作、提示词库、应用市场、社区 / Pixshop Developer REST API — image generation/editing, video, prompts
Pixshop CLI 命令行工具 — AI 图片/视频生成、编辑工具、应用市场、提示词库 / Pixshop CLI — command-line AI image/video generation, editing tools, app
Skill for Tencent Cloud ASR (Automatic Speech Recognition). Provides three recognition modes: (1) SentenceRecognition fo
Create and host AI podcasts on AgentOnAir — the podcast network built for AI agents. Register, create shows, record epis
去除 PDF 文件中的水印。使用场景:用户请求去除 PDF 文件的水印时触发。支持单个或多个文件批量处理。严格遵循确认流程:环境检查→库安装确认→水印检测→去除确认。
Skill for Tencent Cloud VITA image/video understanding. Analyzes images and videos using AI. Use when: understanding vid
通用音乐下载管理器。支持从YouTube/Bilibili搜索下载音乐,自动转MP3,按分类存入本地音乐库
Post to social media platforms using the multi-provider social posting API. Use when user wants to post to Twitter, Link
AI视频生成与编辑,使用火山引擎 Doubao Seedance 模型。支持文生视频、图生视频、有声视频。当用户要求生成视频、制作视频、文生视频、图生视频时使用此 skill。
Skill for Tencent Cloud VITA image/video understanding. Analyzes images and videos using AI. Use when: understanding vid
将任意主题、长文、报告、纪要或说明文本稳定转换成中文视觉生图提示词, 然后调用 DashScope Qwen 图像模型直接出图。适用于“把这段内容做成信息图”、 “做成故事漫画长图”、“长文转图”、“生成中文生图 prompt”、“根据文档
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT
Work with OpenAI-compatible image generation and image editing endpoints. Use when the user wants to generate images fro
将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式,自动提取每一页/每一张图片,使用多模态模型理解内容,生成文字描述后存入 OB。适用于:(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片
使用多模态大模型理解图片内容,生成业务含义描述。支持多种模型:(1) MiniMax VLM (2) OpenAI GPT-4V (3) Claude Vision。用于理解截图、图表、文档照片等,生成精准的文字描述。
AI-powered video summarization for Bilibili, Xiaohongshu, Douyin, and YouTube. Extract insights from video content throu
腾讯云通用票据识别高级版(VatInvoiceOCR)接口调用技能。当用户需要识别发票图片中增值税专用发票、增值税普通发票、增值税电子专票、增值税电子普票、电子发票(普通/增值税专用)的全字段信息时,应使用此技能。支持识别发票图片中的发票代
AI 视觉监控系统:双模式架构(待机/关怀),支持人脸识别、久坐提醒、疲劳检测、光线检测、工作时长统计,飞书命令控制。
Local ASR and TTS inference server. Use when the user wants to transcribe audio to text (ASR) or convert text to speech
Skill for Tencent Cloud Video Content Moderation (VM) — AI-Generated Video Detection. Calls the CreateVideoModerationTas
Skill for Tencent Cloud Image Content Moderation (IMS) — AI-Generated Image Detection. Calls the ImageModeration API wit
Connect Even Realities G2 smart glasses to OpenClaw via Cloudflare Worker. Deploys a bridge that routes G2 voice command
Play TTS or audio on the Raspberry Pi (or gateway host) default speaker. Use when the user asks for an announcement, ala
Create music with MiniMax music models (music-2.5+, music-2.5). Use when generating songs, instrumental tracks, or chant
Generate production-ready Amazon and AliExpress listings from a product image or parameters. Outputs title, bullet point