3,611 tools and skills for media tasks
Generate images and videos via WangYi Banana API (nano-banana, SORA2). Supports text-to-image, image-to-image, text-to-v
This skill should be used when the user wants to "build a resume", "create a resume", "update m
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
Send prompts to a local ComfyUI instance to generate images based on user descriptions.
用统一脚本执行 pdf2zh-next。支持单/多PDF、目录批处理、按 glob 筛选;未指定 provider 时按 config.toml 生效;指定 provider 时按官方 --<Services> 参数传给主程序;
Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video&qu
Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate im
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", &quo
Send user prompts to local ComfyUI at http://127.0.0.1:8188 using workflow.json to generate images.
Generate original background music for short videos from a natural language description. Use when creators need royalty-
Recommend the best SenseAudio voice for any scenario or emotion. Use when users ask which voice to use — e.g. "儿童故事
Zero-config text-to-speech — give text, get an mp3 file. Handles natural-language voice selection ("用女声", &quo
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user
Fix Chinese polyphone (多音字) mispronunciation in TTS by auto-detecting ambiguous characters and applying pinyin annotatio
多源内容智能处理器:支持微信公众号、网页、YouTube、PDF、Markdown等,自动上传到NotebookLM并生成播客/PPT/思维导图等多种格式
Automated drama video generator - from script to multi-character drama videos with OpenAI TTS, Whisper, and Remotion
This skill should be used when the user wants to optimize an epub file for e-ink readers (墨水屏电子书). It handles image dedu
根据用户提供的文本和图片(支持图片URL和本地路径)生成视频任务,并使用API Key提交到远程视频服务。
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables fr
Control JoyIn AI robots (W-1 Walle / M-1 Mini) — movement, follow, photo, video, live stream, TTS, agent config, and dev
简历优化助手。帮我写简历,改简历、导出PDF、准备面试问答。适用于:更新简历、补充项目经验、排版调整、导出PDF、准备面试问答。
Import sources (URLs, YouTube, files, text) into Google NotebookLM and generate user-selected artifacts: podcast, video,
Comprehensive consumer electronics industry sourcing guide for international buyers – provides detailed information abou
Edit PDFs with natural-language instructions using the nano-pdf CLI.
使用提供的 Nano Banana API Key,根据提示词生成或编辑图像,支持文生图和图生图,默认模型为香蕉2,可切换香蕉pro。
Convert text to speech with Microsoft Edge neural voices via the node-edge-tts package. Use when a user wants audio outp
Download videos, magnet links, and torrents to various cloud storages via 115/Quark offline APIs, yt-dlp, rclone, and ar
Use this skill when the task is to merge PDFs from a nested directory tree into a single PDF with hierarchical bookmarks
100+ expert prompts to turn OpenClaw into an AI coworker across 44 professions. Instantly deploy agents for tech, busine
图片批量压缩和格式转换工具,支持批量调整大小、压缩质量、转换格式,预览模式和撤销功能!
Generate images from text prompts using AI models via OpenRouter or Kie.ai. Use when the user asks to generate, create,
Control macOS GUI apps visually — take screenshots, click, scroll, type. Use when the user asks to interact with any Mac
Multi-model OCR benchmark and comparison tool. Run OCR on images using Claude (Opus/Sonnet/Haiku via Bedrock), Gemini (P
AI assistant for creating, scheduling, and analyzing social media content tailored for Twitter, Instagram, LinkedIn, and
AI-powered assistant for creating, scheduling, and analyzing social media content with platform-specific optimization an
Create and maintain dedicated archives for each pet, including profiles, daily journals, photos, feeding logs, health re
Extract text from images using Tesseract.js OCR. Supports Chinese (simplified/traditional) and English.
Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditio
With me. YouTube 频道 Lofi 氛围视频制作全流程。小米(Content)全权调度,从选题到发布一条龙。含 AI 图片生成、Envato 音频下载、FFmpeg 合成、SEO 资料包、YouTube 上传排程、Shorts
Generate creative, brand-compliant image prompts for marketing visuals by blending artistic inspiration with CanMarket’s
Create AI videos with Veo 3 on Pixwith using text or image input, schema-guided parameters, and async result polling.
AI video, image generation. 40+ models — Sora, Veo 3, Kling, Seedance, GPT Image, Hailuo, WAN. Text-to-video, image-to-v
语音回复技能 - 每次回复自动生成语音并保存到桌面,支持 Noiz AI TTS
AI video, image generation. 40+ models — Sora, Veo 3, Kling, Seedance, GPT Image, Hailuo, WAN. Text-to-video, image-to-v
Build a reusable UI inspiration library that both archives and retrieves design references. Use when the user wants to s
Generate and edit AI images with Nano Banana 2 on Pixwith using text prompts, reference images, schema-aware options, an
Documents the tool services in Teamgram Server including idgen (Snowflake ID), status (online TTL), dfs (Minio file stor
Control real Android phones through the Mobilerun API. Supports tapping, swiping, typing, taking screenshots, reading th
AI Video generation toolkit — generate videos from text prompts or input images using multiple AI models (Veo 3.1, Veo 3
Generate images, faceswap, edit photos, animate expressions, and do style transfer via a self-hosted ComfyUI instance on