Media AI Skills - 3,611 Tools

Linux Desktop Control

Linux desktop automation and control. Use when: (1) taking screenshots of the screen or windows, (2) controlling mouse a

by clawhub · community · Quality: medium

Granola

Granola MCP server integration with managed OAuth. Query meeting notes, list meetings, and access transcripts. Use this

by clawhub · community · Quality: medium

AI 火宝

Generate images from text prompts or transform existing images using AI with configurable count, watermark, and API key

by kongweigen · community · Quality: medium

Filtrix Image Generation

Generate images using AI providers (OpenAI gpt-image-1, Google Gemini, fal.ai). Use when the user asks to create, genera

by lumenclaw-cloud · community · Quality: medium

Half Full

半饱 — 生活的高潮所在。A mindful eating companion for desk workers. Track meals with photos, understand your body's needs, no gym

by clawhub · community · Quality: medium

查看每日热门山寨代币

生成加密货币早报PDF，包含行业动态、FDV排名、热点赛道和风险提示。数据来源于CoinGecko API。

by clawhub · community · Quality: medium

Document Pro

文档处理技能 - 让 AI 能够读取、解析、提取 PDF、DOCX、PPT 等文档的关键信息。当用户要求分析文档、提取内容、总结报告时触发此技能。

by Jackeven02 · community · Quality: medium

Video Understanding

视频理解与分析能力 - 让 AI 能够理解视频内容、提取关键信息。当用户要求分析视频、理解视频内容、总结视频、提取视频要点时触发此技能。

by clawhub · community · Quality: medium

Zoom + Google Calendar

Create Zoom meetings and add them to Google Calendar events with proper conferenceData (icon, video entry, notes). Use w

by clawhub · community · Quality: medium

AI Trend Hunter Pro

A high-performance automation agent that turns global trends into viral social media posts for X (Twitter), Xiaohongshu,

by traprapitalianazional-dev · community · Quality: medium

SEO Video Transcript

Generate SEO-optimized video transcripts with automatic interlinking to website content.

by srezzano-litebox · community · Quality: medium

Qwen3-TTS VoiceDesign

Text-to-speech with Qwen3-TTS VoiceDesign. Design custom voices via natural language descriptions + seed-based timbre fi

by clawhub · community · Quality: medium

MH summarize

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/

by clawhub · community · Quality: medium

Gemini Reader

Understand local non-text files (PDF, video, audio) using Gemini API. Use when the user asks to read, summarize, or anal

by clawhub · community · Quality: medium

Clawbsky

Advanced Bluesky CLI with support for media (images/video), thread creation, and automated growth tools like non-mutual

by jyothish12345 · community · Quality: medium

Book Fetch

Download ebooks (epub/pdf) from Anna's Archive and upload them to MEGA automatically. Use when the user asks to download

by zerone0x · community · Quality: medium

SoulKeeper

Ensures AI agents maintain consistent identity by auditing soul rules, detecting behavioral drift in transcripts, and in

by cassh100k · community · Quality: medium

Visual Concept

Transform technical insights into visual concept guides — symbolic imagery, color arcs, and creative direction for video

by clawhub · community · Quality: medium

Phone Caller

Make AI-powered outbound phone calls using ElevenLabs voice + GPT brain + Twilio. Supports one-way pre-recorded messages

by omerflo · community · Quality: medium

Clawfinger Agent Takeover

How to perform a live agent takeover of the Clawfinger voice gateway — dial, inject greetings, handle turns, release, an

by clawhub · community · Quality: medium

Video Production

Complete A/B video pipeline — storyboard, Veo 3 batch generation, browser preview with feedback loop, and ffmpeg assembl

by omerflo · community · Quality: medium

Social Media Ops

Set up a complete multi-brand social media management team on OpenClaw. Scaffolds 7 specialized AI agents (Leader, Resea

by clawhub · community · Quality: medium

Jetson CUDA Voice Pipeline

Fully offline, CUDA-accelerated local voice assistant pipeline for NVIDIA Jetson. Wake word (openWakeWord) → real-time V

by nikil511 · community · Quality: medium

AdaptlyPost

Schedule and manage social media posts across Instagram, X (Twitter), Bluesky, TikTok, Threads, LinkedIn, Facebook, Pint

by clawhub · community · Quality: medium

YouTube Video Summarizer (Xiaolongxia)

总结 YouTube 视频内容，自动获取视频信息、搜索相关报道、生成结构化详细总结。支持中英文输出。

by clawhub · community · Quality: medium

x-recap

Monitor and recap official X (Twitter) updates using actionbook-rs screenshots. Use when the user asks to track/recap X

by jack4world · community · Quality: medium

Social Media Data Extractor

Extract public posts, comments, and profiles from Instagram, TikTok, and Reddit via Apify for trend analysis and audienc

by clawhub · community · Quality: medium

Inspiration

Find design and AI art inspiration from curated galleries, screenshot libraries, and creative showcases.

by clawhub · community · Quality: medium

MH openai-whisper

Local speech-to-text with the Whisper CLI (no API key).

by mohdalhashemi98-hue · community · Quality: medium

Capture Website

Capture website screenshots from the command line. Use when user wants to take screenshots of any URL (Twitter, news sit

by clawhub · community · Quality: medium

Image Enhancer

Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for pre

by wells1137 · community · Quality: medium

YouTube Video Downloader

Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, sav

by wells1137 · community · Quality: medium

Local Whisper

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Use when enabling inbound voice-note tra

by TrueNight · community · Quality: medium

MH openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

by mohdalhashemi98-hue · community · Quality: medium

Perstudio Image and Video Generation

Generate AI images and videos. Use when user asks to create images, photos, product shots, portraits, stickers, videos,

by montenegronyc · community · Quality: medium

XY PubMed PDF Downloader

Download PDFs from PubMed Central (PMC) and Europe PMC. Use when the user needs to download open-access academic papers

by xuyuan0805 · community · Quality: medium

Postavel

Connect to Postavel social media management platform via MCP (Model Context Protocol). Create, schedule, and manage soci

by clawhub · community · Quality: medium

Drip director

Deterministic pipeline for streetwear and fashion images that captures user intent, enforces constraints, generates with

by clawhub · community · Quality: medium

Smart Router

Intelligent multi-model router — automatically selects the best AI model based on task type (vision, image generation, v

by samstone908 · community · Quality: medium

Music Generator

Generates music from a structured Composition Plan. Use this skill to execute music generation after a prompt or plan ha

by clawhub · community · Quality: medium

Botbook — Meet Friends on the AI Agent Social Network

Join Botbook.space — the social network built for AI agents. Create a profile, post updates with hashtags and images, fo

by clawhub · community · Quality: medium

Deepgram ASR / Deepgram 语音转写

Transcribe audio via Deepgram Nova-3 API. Fast, accurate, and cost-effective speech-to-text for 50+ languages. Transcrip

by clawhub · community · Quality: medium

Image Gen

Generate images using multiple AI models — Midjourney (via TTAPI), Flux, SDXL, Nano Banana (Gemini), and more via fal.ai

by clawhub · community · Quality: medium

ElevenLabs MCP Server

Provides a full suite of ElevenLabs audio tools (TTS, SFX, Music, etc.) via a standard MCP server. This skill starts the

by clawhub · community · Quality: medium

Audio Conductor

Intelligently dispatches requests to the appropriate audio generation model (Music, Sound Effects, or TTS). Use this as

by clawhub · community · Quality: medium

Doubao ASR / 豆包语音转写

Transcribe audio via Doubao (豆包) Seed-ASR 2.0 API (ByteDance/Volcengine). Best-in-class Chinese speech recognition. 通过豆包

by clawhub · community · Quality: medium

AudioMind

Generate speech, sound effects, and music from natural language prompts using a unified, intelligent audio toolkit with

by wells1137 · community · Quality: medium

Seedance Video Generation Extension

Orchestrate script-to-final-video production with a strict stage-gated workflow (outline → episode_plan → storyboard → s

by KKenny0 · community · Quality: medium

Video Generator

AI video production workflow using Remotion. Use when creating videos, short films, commercials, or motion graphics. Tri

by clawhub · community · Quality: medium

Agentfolio

Discover, register, and verify autonomous AI agents in the AgentFolio registry with screenshot-proof and unique badges.

by clawhub · community · Quality: medium