Media AI Skills - 3,611 Tools

wangyi-banana2

Generate images and videos via WangYi Banana API (nano-banana, SORA2). Supports text-to-image, image-to-image, text-to-v

by clawhub · community · Quality: medium

Placed Resume Builder

This skill should be used when the user wants to "build a resume", "create a resume", "update m

by clawhub · community · Quality: medium

nano-banana-pro (老默科技修复版)

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

by clawhub · community · Quality: medium

Comfyskill

Send prompts to a local ComfyUI instance to generate images based on user descriptions.

by clawhub · community · Quality: medium

pdf2zh-next翻译

用统一脚本执行 pdf2zh-next。支持单/多PDF、目录批处理、按 glob 筛选；未指定 provider 时按 config.toml 生效；指定 provider 时按官方 --<Services> 参数传给主程序；

by clawhub · community · Quality: medium

Explainer

Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video&qu

by clawhub · community · Quality: medium

Image Gen

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate im

by clawhub · community · Quality: medium

ListenHub Asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", &quo

by clawhub · community · Quality: medium

Comfy Skill

Send user prompts to local ComfyUI at http://127.0.0.1:8188 using workflow.json to generate images.

by clawhub · community · Quality: medium

BGM Maker

Generate original background music for short videos from a natural language description. Use when creators need royalty-

by clawhub · community · Quality: medium

Voice Picker

Recommend the best SenseAudio voice for any scenario or emotion. Use when users ask which voice to use — e.g. "儿童故事

by clawhub · community · Quality: medium

Quick TTS

Zero-config text-to-speech — give text, get an mp3 file. Handles natural-language voice selection ("用女声", &quo

by clawhub · community · Quality: medium

Canvas Design Anthropic

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user

by clawhub · community · Quality: medium

Polyphone TTS

Fix Chinese polyphone (多音字) mispronunciation in TTS by auto-detecting ambiguous characters and applying pinyin annotatio

by clawhub · community · Quality: medium

Anything To Notebooklm

多源内容智能处理器：支持微信公众号、网页、YouTube、PDF、Markdown等，自动上传到NotebookLM并生成播客/PPT/思维导图等多种格式

by clawhub · community · Quality: medium

drama generator

Automated drama video generator - from script to multi-character drama videos with OpenAI TTS, Whisper, and Remotion

by clawhub · community · Quality: medium

epub-eink-optimizer

This skill should be used when the user wants to optimize an epub file for e-ink readers (墨水屏电子书). It handles image dedu

by clawhub · community · Quality: medium

magic-image2video

根据用户提供的文本和图片（支持图片URL和本地路径）生成视频任务，并使用API Key提交到远程视频服务。

by clawhub · community · Quality: medium

Pdf Anthropic

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables fr

by clawhub · community · Quality: medium

JoyIn Robot Control

Control JoyIn AI robots (W-1 Walle / M-1 Mini) — movement, follow, photo, video, live stream, TTS, agent config, and dev

by clawhub · community · Quality: medium

简历优化助手

简历优化助手。帮我写简历，改简历、导出PDF、准备面试问答。适用于：更新简历、补充项目经验、排版调整、导出PDF、准备面试问答。

by clawhub · community · Quality: medium

NotebookLM Studio

Import sources (URLs, YouTube, files, text) into Google NotebookLM and generate user-selected artifacts: podcast, video,

by clawhub · community · Quality: medium

China Consumer Electronics Sourcing

Comprehensive consumer electronics industry sourcing guide for international buyers – provides detailed information abou

by clawhub · community · Quality: medium

Nano Pdf 1.0.0

Edit PDFs with natural-language instructions using the nano-pdf CLI.

by clawhub · community · Quality: medium

Nano Banana Image T8

使用提供的 Nano Banana API Key，根据提示词生成或编辑图像，支持文生图和图生图，默认模型为香蕉2，可切换香蕉pro。

by clawhub · community · Quality: medium

Edge TTS

Convert text to speech with Microsoft Edge neural voices via the node-edge-tts package. Use when a user wants audio outp

by clawhub · community · Quality: medium

Video Fetch

Download videos, magnet links, and torrents to various cloud storages via 115/Quark offline APIs, yt-dlp, rclone, and ar

by clawhub · community · Quality: medium

Nested PDF Merger

Use this skill when the task is to merge PDFs from a nested directory tree into a single PDF with hierarchical bookmarks

by clawhub · community · Quality: medium

Openclaw Coworker Prompts

100+ expert prompts to turn OpenClaw into an AI coworker across 44 professions. Instantly deploy agents for tech, busine

by clawhub · community · Quality: medium

Image Optimizer Tool

图片批量压缩和格式转换工具，支持批量调整大小、压缩质量、转换格式，预览模式和撤销功能！

by clawhub · community · Quality: medium

SMTools Image Generation Skill

Generate images from text prompts using AI models via OpenRouter or Kie.ai. Use when the user asks to generate, create,

by clawhub · community · Quality: medium

Mac Use 1.0.0

Control macOS GUI apps visually — take screenshots, click, scroll, type. Use when the user asks to interact with any Mac

by clawhub · community · Quality: medium

OCR Benchmark

Multi-model OCR benchmark and comparison tool. Run OCR on images using Claude (Opus/Sonnet/Haiku via Bedrock), Gemini (P

by clawhub · community · Quality: medium

Social Media Manager

AI assistant for creating, scheduling, and analyzing social media content tailored for Twitter, Instagram, LinkedIn, and

by clawhub · community · Quality: medium

Social Media Manager

AI-powered assistant for creating, scheduling, and analyzing social media content with platform-specific optimization an

by clawhub · community · Quality: medium

Pet Companion Journal

Create and maintain dedicated archives for each pet, including profiles, daily journals, photos, feeding logs, health re

by clawhub · community · Quality: medium

OCR Test

Extract text from images using Tesseract.js OCR. Supports Chinese (simplified/traditional) and English.

by clawhub · community · Quality: medium

OCR - Local (No API Key)

Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditio

by clawhub · community · Quality: medium

With me. YouTube Production

With me. YouTube 频道 Lofi 氛围视频制作全流程。小米（Content）全权调度，从选题到发布一条龙。含 AI 图片生成、Envato 音频下载、FFmpeg 合成、SEO 资料包、YouTube 上传排程、Shorts

by clawhub · community · Quality: medium

phy-creative-image

Generate creative, brand-compliant image prompts for marketing visuals by blending artistic inspiration with CanMarket’s

by clawhub · community · Quality: medium

(Google) Veo 3 Video Gen

Create AI videos with Veo 3 on Pixwith using text or image input, schema-guided parameters, and async result polling.

by clawhub · community · Quality: medium

Pixwith Ai Video Generation

AI video, image generation. 40+ models — Sora, Veo 3, Kling, Seedance, GPT Image, Hailuo, WAN. Text-to-video, image-to-v

by clawhub · community · Quality: medium

Voice Reply

语音回复技能 - 每次回复自动生成语音并保存到桌面，支持 Noiz AI TTS

by clawhub · community · Quality: medium

Pixwith Ai Image Generation

AI video, image generation. 40+ models — Sora, Veo 3, Kling, Seedance, GPT Image, Hailuo, WAN. Text-to-video, image-to-v

by clawhub · community · Quality: medium

UI Inspiration Library

Build a reusable UI inspiration library that both archives and retrieves design references. Use when the user wants to s

by clawhub · community · Quality: medium

Nano banana 2

Generate and edit AI images with Nano Banana 2 on Pixwith using text prompts, reference images, schema-aware options, an

by clawhub · community · Quality: medium

Teamgram Tool Services

Documents the tool services in Teamgram Server including idgen (Snowflake ID), status (online TTL), dfs (Minio file stor

by clawhub · community · Quality: medium

Mobilerun

Control real Android phones through the Mobilerun API. Supports tapping, swiping, typing, taking screenshots, reading th

by clawhub · community · Quality: medium

Video Generation (t2v & i2v)

AI Video generation toolkit — generate videos from text prompts or input images using multiple AI models (Veo 3.1, Veo 3

by clawhub · community · Quality: medium

ComfyUI Bridge

Generate images, faceswap, edit photos, animate expressions, and do style transfer via a self-hosted ComfyUI instance on

by clawhub · community · Quality: medium