Media AI Skills - 3,611 Tools

UI Element Ops

Parse UI screenshots into structured element JSON (type, OCR text, bbox) and operate desktop UI from parsed elements. Us

by murongg · community · Quality: medium

summerizeryoutube

Generates structured summaries and context-based Q&A from YouTube transcripts with multi-language support, ensuring

by Gangadharpadshetty · community · Quality: medium

Travel Itinerary Planner

Generate complete, image-rich travel plans from trip dates and destination, including day-by-day itinerary, transportati

by clawhub · community · Quality: medium

Youtube Lecture Analyzer

Extracts and structures key points, evidence, and actionable insights from YouTube lecture subtitles for review and teac

by wallfacer-web · community · Quality: medium

Youtube Lecture Analyzer

Extracts and analyzes YouTube lecture subtitles to identify key points, evidence, and actionable insights for review, wr

by wallfacer-web · community · Quality: medium

VoiceClaw

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp), and gene

by Asif2BD · community · Quality: medium

Boosta Long to Shorts

Create, monitor, and troubleshoot Boosta API video-processing jobs from natural-language requests. Use this skill when a

by hundevmode · community · Quality: medium

Google Search Skill

Performs Google searches with rich results including news, images, videos, places, shopping, scholar papers, patents, an

by minilozio · community · Quality: medium

Google Search Skill

Perform Google searches with Serper.dev API, delivering rich results including news, images, videos, places, shopping, s

by minilozio · community · Quality: medium

mmVoiceMaker

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use

by BLUE-coconut · community · Quality: medium

mmMusicMaker

Create music with MiniMax music models (e.g., music-2.5). Use when generating songs or instrumental tracks from lyrics a

by clawhub · community · Quality: medium

Selenium Browser

Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode a

by andreybespalov89 · community · Quality: medium

Nano Banana 2

Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via inference.sh CLI. Capabilities: text-to-i

by okaris · community · Quality: medium

专业彩票助手

专业彩票助手 - 支持双色球开奖查询、彩票OCR识别、中奖核对、开奖提醒。触发词：彩票、双色球、开奖、中奖、lottery。

by clawhub · community · Quality: medium

Material Report

Analyze ad material videos and produce a markdown report with framework, material traits, and acquisition keywords, then

by fevolq · community · Quality: medium

feishu-audio

将音频文件转换为飞书可播放的语音消息。先用 ffmpeg 转为 opus 格式，再上传到飞书，最后发送 audio 消息。适用于用户想要在飞书中收到可播放的语音消息的场景。

by clawhub · community · Quality: medium

video-audio-replace

Replace video audio with TTS voice while preserving original timing. Includes subtitle generation from video using Whisp

by clawhub · community · Quality: medium

Web Scout

给 AI Agent 一键装上全网采集能力。基于 Agent Reach，支持 Twitter/X、Reddit、YouTube、B站、小红书、抖音、GitHub、LinkedIn、Boss直聘、RSS、全网搜索等平台。一条命令安装，零 A

by aizain · community · Quality: medium

Creative Agents

Integration scripts for the creative agent swarm managed by overstory (Claude Code). Use when configuring or running res

by clawhub · community · Quality: medium

Weread Exporter Skill

微信读书书籍导出工具 - 支持 EPUB、PDF、MOBI、TXT、Markdown 格式

by semicolonstail · community · Quality: medium · 2 stars

Xiaohongshu Video Finder

Find fresh, viral Xiaohongshu videos by niche with filters for type, date, and popularity, and extract URLs with valid x

by travislius · community · Quality: medium

Zoom Meeting create/delete

Create or delete Zoom meetings via Server-to-Server OAuth. Use this skill when the user asks to: schedule a Zoom call, c

by clawhub · community · Quality: medium

AI Music Generation

Use for AI music generation via IMA Open API. Supports text_to_music with 3 models. IMPORTANT — Default model selection

by allenfancy-gan · community · Quality: medium

Video Pipeline Bundle

视频一站式工作流技能包。整合视频剪辑、转写、烧录、拼接全流程，支持分步执行和用户确认。包含：(1) auto-editor - 视频剪辑去除静音片段；(2) Faster Whisper + MiniMax LLM - 语音转字幕； (3

by Leochens · community · Quality: medium

Give eyes to your openclaw

Give your agent eyes — capture screenshots, voice, and annotations from any screen, monitor, or device via MCP.

by wolverin0 · community · Quality: medium

Video Intelligence

Download videos and get transcripts, summaries, or metadata from YouTube, TikTok, Instagram, and X (Twitter). Use when t

by nogara · community · Quality: medium

Paprika Recipe Importer

Convert recipe text (pasted text, video transcript, image description, or any raw content) into a .paprikarecipes file t

by nogara · community · Quality: medium

Zight - video instructions capability for agents

Extract structured data from Zight share links (a.cl.ly and share.zight.com), including title, stream URLs, AI smart sum

by phin · community · Quality: medium

Whisper STT

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text

by nickylin · community · Quality: medium

Excel Translator

Translates Excel files (.xlsx) from English to Chinese while preserving all formatting, images, and charts. Use for any

by ilove323 · community · Quality: medium

pymupdf

Render PDF pages to images, extract embedded images, annotate PDFs, and perform advanced PDF inspection using pymupdf (f

by Maverick-AI-Tech · community · Quality: medium

pypdf

Extract text, metadata, and pages from PDF files using pypdf. Use for tasks such as reading PDF content, extracting spec

by Maverick-AI-Tech · community · Quality: medium

MiniMax TTS 国内版

调用MiniMax语音合成API，支持中文多音色、高质量文本转语音，提供流式和非流式音频输出。

by whille · community · Quality: medium

Ai Image Generation Prompts 9142af54

Generate an image of a whimsical, fairytale-inspired floral · arch with pastel-colored flowers and delicate ivy.

by clawhub · community · Quality: medium

Bilibili Downloader

Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content f

by BenAngel65 · community · Quality: medium

Onivor Instagram Growth

Automate Instagram growth with a multi-agent workflow. Researches trending content across Instagram Explore, TikTok, Hac

by rullys · community · Quality: medium

Onivor Twitter Growth Engine

Automate X/Twitter personal brand growth with a multi-agent workflow. Researches trending content across X, Hacker News,

by rullys · community · Quality: medium

Onivor X Growth Engine

Automate X/Twitter personal brand growth with a multi-agent workflow. Researches trending content across X, Hacker News,

by rullys · community · Quality: medium

Video Downloader

Download online videos with quality and format controls using yt-dlp for reliable local saves.

by ivangdavila · community · Quality: medium

Hummingbot Developer

Developer skill for running Hummingbot and Gateway from source, building wheel and Docker images, and testing against Hu

by fengtality · community · Quality: medium

Onivor Playwright CLI

Browser automation via Playwright CLI. Open pages, interact with elements, take screenshots, and more. Ideal for coding

by rullys · community · Quality: medium

XHS Video Downloader

Download videos from Xiaohongshu (小红书) pages. Use when the user wants to save or download a video from a xiaohongshu.com

by HViktorTsoi · community · Quality: medium

Music Video Generator

Generate AI music videos from any MCP client. Turn text prompts into cinematic music videos with multiple styles and mod

by liuyinjiwen06 · community · Quality: medium

Playlistable

Create AI-powered Spotify playlists and discover music via Playlistable MCP. Use when the user wants to generate playlis

by Brackyt · community · Quality: medium

Libvips Image

High-performance image processing with libvips. Use for resizing, converting, watermarking, thumbnails, and batch image

by h1bomb · community · Quality: medium

Qwen3 Tts Mlx

Local Qwen3-TTS speech synthesis on Apple Silicon via MLX. Use for offline narration, audiobooks, video voiceovers, and

by h1bomb · community · Quality: medium

Gemini Watermark

Remove visible Gemini AI watermarks from images via reverse alpha blending. Use for cleaning Gemini-generated images, re

by h1bomb · community · Quality: medium

Slides Generator

Create Hummingbot-branded PDF slides from markdown with Mermaid diagram support. Use for presentations, decks, and techn

by fengtality · community · Quality: medium

Instant Client Audit Report Generator — Close Deals Before the First Call

Generates detailed AI-powered audit reports on website, SEO, ads, social media, reviews, tech stack, and competitors to

by g4dr · community · Quality: medium

Book Writing

Plan, draft, and revise complete books with chapter architecture, voice consistency, and finish-ready revision workflows

by ivangdavila · community · Quality: medium