3,611 tools and skills for media tasks
fal.ai API integration with managed API key authentication. Run AI models for image generation, video generation, audio
CLI for the Seer media request management API. Search movies and TV shows, create and manage media requests, manage user
文章配图推荐。根据文章主题、内容关键词,推荐合适的配图来源和搜索关键词,帮助用户找到符合文章意境的图片。当用户提到「配图」「找图」「文章图片」「封面图」「插图」时激活。
Analyze audio quality, detect noise types, and provide improvement recommendations. Use when users need to check audio q
语音录音转录并保存到 Notion 数据库。使用 faster-whisper 转录,自动提取关键信息并写入数据库。
Control Ezviz PTZ cameras via the open platform, supporting device listing, status, PTZ control, presets, and cruise pla
Generate audiobooks from novels and long-form text with chapter management and character voices. Use when users mention
Transcribe meetings with speaker identification and generate summaries with action items. Use when users need meeting tr
Create language learning audio with adjustable speed, pronunciation examples, and bilingual content. Use when users need
Diagnose why short-video retention drops and suggest practical fixes. Use when views start but audience leaves early.
Build and troubleshoot SenseAudio speech recognition integrations, including HTTP transcription (`/v1/audio/transcriptio
Create videos using ShortVideo API. Supports product-to-video, image-to-ad-video, and replicate-video. Use this skill wh
Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap
Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjo
Direct high-fidelity cinematic video with AI — translates creative intent into technical cinematographic directives for
离线使用 OpenAI Whisper 免费转录本地视频音频,支持多格式多语言,生成时间戳字幕及AI内容摘要。
Generates dual-disease transcriptomic and ML research designs for shared biomarkers, hub genes, and mechanisms, outputti
Generates complete Mendelian Randomization + single-cell transcriptomics (scRNA-seq) research designs from a user-provid
远程配置萤石摄像机参数,支持布防状态、镜头遮蔽、全天录像和移动侦测灵敏度等9种设备设置。
Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation using technical camera gramma
使用豆包2.0模型解析视频。当需要执行分析视频内容等需要理解视频视觉信息时调用该技能。你必须在持有本地视频路径或网络视频链接时才能调用该技能
Convert Markdown documents to presentation slides (PDF/PPTX/HTML) using Marp. Supports Mermaid diagrams (gantt, flowchar
Manage YouTube video categories. Use this skill to list available video categories. Useful when working with YouTube vid
直接调用通义万相2.6视频生成模型(Qwen Wan 2.6),支持文生视频和图生视频,无需中间API代理。适用于需要直接对接阿里云大模型的视频创作场景。
Import local PDF files into Zotero from the command line on Windows/macOS/Linux via the Zotero local connector (127.0.0.
Generate a pack of professional or aesthetic photos from a single reference image while preserving the exact identity of
Analyze audio quality, detect noise types, and provide improvement recommendations. Use when users need to check audio q
通过 Zotero 本地连接器(127.0.0.1),在 Windows/macOS/Linux 上使用命令行将本地 PDF 文件导入 Zotero。用户需要导入单个文件、批量导入文件夹、导入到已有分类、列出分类或校验最近导入的附件时使用。
Clone voices from short audio samples and generate personalized audio content. Use when users want to clone voices, crea
Build real-time voice chatbot applications with natural conversation flow and customizable personalities. Use when users
Generate professional video narration with timing synchronization and style matching. Use when users need voiceovers, vi
Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via
Predict intelligence skill for AI agents. Generates professional PDF reports with probability-ranked predictions, D3 vis
Integration guide for SenseAudio Open Platform APIs, including TTS (sync/SSE/WebSocket), ASR (HTTP/WebSocket), realtime
Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted
图片处理助手:将受限目录的图片复制到允许的目录,然后使用 image 工具进行分析。适用但不限于 QQBot 下载的本地图片。
BizyAir 图生图(Image-to-Image)助手。将本地图片上传后作为参考,使用 AI 生成新的图片。当用户说"根据这张图片生成"、"图生图"、"参考图片生成"、&quo
Generate customizable social preview images for Open Graph, Twitter, GitHub, and more using a fluent builder API.
Improve ecommerce product image clarity for listings and ads. Use when teams need sharper images without changing the pr
Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-capt
Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/p
Cultural radar of Pernambuco blending football, Manguebeat, and regional music with poetic insights inspired by Recife a
Manage YouTube watermarks. Use this skill to set or unset watermarks for channel videos. Useful when working with YouTub
Create OpenClaw skills from best practice videos or image sequences. Use when creating skill from video, generating skil
Generate news-style social media images (1080x1350) with Thai text overlay and matching captions. Use when asked to crea
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Automate TikTok slideshow marketing for any app or product. Researches competitors, generates AI images, adds text overl
Speech recognition CLI for AI agent automation. Transcribe audio from stdin, files, or URLs.
Guide for SenseAudio voice selection, plan-level voice entitlement checks, and cloned voice usage constraints in TTS cal
Generate synchronized subtitles (SRT/VTT/ASS) from video audio with precise timestamps. Use when users need subtitles, c