メディアタスク向けの 3,619 件のツールとスキル
Optimize ecommerce product images for platforms like Taobao and Amazon by analyzing design elements and predicting click
Use when generating speech from text (text-to-speech) via IMA Open API. Use for: voice synthesis, TTS,朗读, 语音合成, 配音, 有声内容
Video-driven face animation using DreamAct API. Animate multiple input images with expressions, lip movements, and head
Professional markdown rendering, export, and publishing. Convert markdown to styled HTML, PDF, DOCX, and slide decks wit
Image generation via Volcengine Seedream API. Use this when you need to perform Text-to-Image (T2I), Image-to-Image (I2I
Generate videos from first and last frames using Head Tail to Video (Wan2.1) API. Automatically generates 4-second video
Generate videos from text using Text to Video (Wan2.1) API. Automatically generates 4-second videos from text descriptio
Sync, transcribe, and intelligently organize voice memos, audio/video files, and URLs. 同步、转录、智能整理语音备忘录、音视频文件和视频链接。
Generate talking videos from images using Talking Image API. Create talking videos from audio and images, supporting non
Generate videos from images using Image to Video (Wan2.1) API. Automatically generates 4-second videos from text descrip
使用 rembg AI 模型去除图片背景,生成透明背景的 PNG 图片。首次使用需运行 setup/install.py 初始化环境。
Convert text or subtitle files into speech audio with options for voice cloning, emotion control, speed adjustment, and
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers i
抖音账号视频批量采集与文案分析工作流。当用户提供抖音账号主页链接、要求抓取最新 N 条视频链接、提取视频文案(语音转文字)、或对视频内容进行总结归纳时,激活此 skill。依赖 browser 工具(抓取视频列表)和 mcporter +
Fetches the latest news using news-aggregator-skill, formats it into a podcast script in Markdown format, and uses the t
Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video int
Generate and edit images using Google's Nano Banana 2 (Imagen) model — the latest high-quality AI image generation model
Convert UI screenshots/images into fully functional HTML/CSS copies. This skill is used when a user provides images of a
Turn a Bilibili video URL or BV number into a summarized XMind mind map. Use when the user wants to collect subtitles, c
Creaa AI - Generate and edit images, plus generate videos via Creaa.ai API. Text-to-image, image edit, text-to-video, im
Control the user's real Safari browser on macOS using AppleScript and screencapture. Read pages, click elements, type te
BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a
Generate complete blog posts with SEO optimization, image suggestions, meta descriptions, and platform-specific formatti
Generate styled QR images/cards for ClawHub skills so users can scan and install instantly. Supports install-command pay
Convert text or subtitle files into speech audio with options for voice cloning, emotion control, speed, and timeline-ac
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image / Imagen Pro) — the premium AI image generat
Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality
Generate AI videos using ByteDance's Seedance 1.5 Pro — a native audio-visual joint generation model with cinematic came
X2C Distribution and Wallet API — publish video to X2C platform, manage assets (balance, claim X2C, swap to USDC, withdr
Voice cloning and TTS using MiniMax API. User must provide a voice name when cloning; after success, voice_name->voic
Searches Xiaohongshu(小红书) for a given keyword, extracts the top N posts (including texts, images, and user comments), an
Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech ge
Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.
配置并启动 RDK X5 上的 MIPI CSI 摄像头、USB 摄像头或双目深度摄像头(RealSense/ZED/Orbbec),通过 Web 浏览器预览实时画面。Use when the user wants to start a c
Use when building applications that generate images, videos, or other AI-generated content. Reach for this skill when yo
Inspect Yandex Music via the MarshalX yandex-music library. Use when the user wants Yandex Music search, current track l
Generate AI videos using Vidu — featuring text-to-video, image-to-video, reference-to-video, and start-end-to-video with
Generate educational videos programmatically using Remotion + React. Use when creating tutorial videos, explainer conten
AI face swap service - Use verging.ai AI face swap directly from command line. Supports local video files and images, re
Generate AI videos and images using Alibaba's Wan 2.6 and Wan 2.5 — featuring text-to-video, image-to-video, video-to-vi
Generate and edit AI images using ByteDance's Seedream — featuring text-to-image, image editing, and batch sequential ge
Manage and schedule social media content across Twitter, 小红书, 微博, B站, Binance Square, and Blog with calendar, status, st
Manage WordPress sites via REST API and WP-CLI. Use when: (1) publishing, editing, or deleting posts/pages, (2) managing
通过提取两张人脸核心特征并实现自然融合,支持多种风格适配,提升创意互动性和内容传播力,广泛应用于创意营销、娱乐互动和社交分享场景。
Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio
Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use wh
Generate BNBOT lobster-bot mascot images in consistent style using reference images. Use when the user asks to generate
Image generation with IMA Open API using only Nano Banana series: Nano Banana, Nano Banana Pro, Nano Banana 2. Budget (N
Remotely manage Docker Compose instances via SSH. Execute docker compose commands, view logs, update images, and exec in
生成抖音、快手、YouTube Shorts和B站短视频完整脚本、开场钩子、标题、分镜、CTA和热门方向建议。