3,611 tools and skills for media tasks
Parse UI screenshots into structured element JSON (type, OCR text, bbox) and operate desktop UI from parsed elements. Us
Generates structured summaries and context-based Q&A from YouTube transcripts with multi-language support, ensuring
Generate complete, image-rich travel plans from trip dates and destination, including day-by-day itinerary, transportati
Extracts and structures key points, evidence, and actionable insights from YouTube lecture subtitles for review and teac
Extracts and analyzes YouTube lecture subtitles to identify key points, evidence, and actionable insights for review, wr
Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp), and gene
Create, monitor, and troubleshoot Boosta API video-processing jobs from natural-language requests. Use this skill when a
Performs Google searches with rich results including news, images, videos, places, shopping, scholar papers, patents, an
Perform Google searches with Serper.dev API, delivering rich results including news, images, videos, places, shopping, s
Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use
Create music with MiniMax music models (e.g., music-2.5). Use when generating songs or instrumental tracks from lyrics a
Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode a
Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via inference.sh CLI. Capabilities: text-to-i
专业彩票助手 - 支持双色球开奖查询、彩票OCR识别、中奖核对、开奖提醒。触发词:彩票、双色球、开奖、中奖、lottery。
Analyze ad material videos and produce a markdown report with framework, material traits, and acquisition keywords, then
将音频文件转换为飞书可播放的语音消息。先用 ffmpeg 转为 opus 格式,再上传到飞书,最后发送 audio 消息。适用于用户想要在飞书中收到可播放的语音消息的场景。
Replace video audio with TTS voice while preserving original timing. Includes subtitle generation from video using Whisp
给 AI Agent 一键装上全网采集能力。基于 Agent Reach,支持 Twitter/X、Reddit、YouTube、B站、小红书、抖音、GitHub、LinkedIn、Boss直聘、RSS、全网搜索等平台。一条命令安装,零 A
Integration scripts for the creative agent swarm managed by overstory (Claude Code). Use when configuring or running res
微信读书书籍导出工具 - 支持 EPUB、PDF、MOBI、TXT、Markdown 格式
Find fresh, viral Xiaohongshu videos by niche with filters for type, date, and popularity, and extract URLs with valid x
Create or delete Zoom meetings via Server-to-Server OAuth. Use this skill when the user asks to: schedule a Zoom call, c
Use for AI music generation via IMA Open API. Supports text_to_music with 3 models. IMPORTANT — Default model selection
视频一站式工作流技能包。整合视频剪辑、转写、烧录、拼接全流程,支持分步执行和用户确认。 包含:(1) auto-editor - 视频剪辑去除静音片段;(2) Faster Whisper + MiniMax LLM - 语音转字幕; (3
Give your agent eyes — capture screenshots, voice, and annotations from any screen, monitor, or device via MCP.
Download videos and get transcripts, summaries, or metadata from YouTube, TikTok, Instagram, and X (Twitter). Use when t
Convert recipe text (pasted text, video transcript, image description, or any raw content) into a .paprikarecipes file t
Extract structured data from Zight share links (a.cl.ly and share.zight.com), including title, stream URLs, AI smart sum
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text
Translates Excel files (.xlsx) from English to Chinese while preserving all formatting, images, and charts. Use for any
Render PDF pages to images, extract embedded images, annotate PDFs, and perform advanced PDF inspection using pymupdf (f
Extract text, metadata, and pages from PDF files using pypdf. Use for tasks such as reading PDF content, extracting spec
调用MiniMax语音合成API,支持中文多音色、高质量文本转语音,提供流式和非流式音频输出。
Generate an image of a whimsical, fairytale-inspired floral · arch with pastel-colored flowers and delicate ivy.
Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content f
Automate Instagram growth with a multi-agent workflow. Researches trending content across Instagram Explore, TikTok, Hac
Automate X/Twitter personal brand growth with a multi-agent workflow. Researches trending content across X, Hacker News,
Automate X/Twitter personal brand growth with a multi-agent workflow. Researches trending content across X, Hacker News,
Download online videos with quality and format controls using yt-dlp for reliable local saves.
Developer skill for running Hummingbot and Gateway from source, building wheel and Docker images, and testing against Hu
Browser automation via Playwright CLI. Open pages, interact with elements, take screenshots, and more. Ideal for coding
Download videos from Xiaohongshu (小红书) pages. Use when the user wants to save or download a video from a xiaohongshu.com
Generate AI music videos from any MCP client. Turn text prompts into cinematic music videos with multiple styles and mod
Create AI-powered Spotify playlists and discover music via Playlistable MCP. Use when the user wants to generate playlis
High-performance image processing with libvips. Use for resizing, converting, watermarking, thumbnails, and batch image
Local Qwen3-TTS speech synthesis on Apple Silicon via MLX. Use for offline narration, audiobooks, video voiceovers, and
Remove visible Gemini AI watermarks from images via reverse alpha blending. Use for cleaning Gemini-generated images, re
Create Hummingbot-branded PDF slides from markdown with Mermaid diagram support. Use for presentations, decks, and techn
Generates detailed AI-powered audit reports on website, SEO, ads, social media, reviews, tech stack, and competitors to
Plan, draft, and revise complete books with chapter architecture, voice consistency, and finish-ready revision workflows