Media AI Skills

xAI Image Generator

Generate images from text prompts using xAI's Grok API with options for format, batch size, and automatic media attachme

by clawhub · community · Quality: medium

ai notes of video

Generate AI-powered notes from videos (document, outline, or graphic-text formats)

by clawhub · community · Quality: medium

Supercall

Make AI-powered phone calls with custom personas and goals. Uses OpenAI Realtime API + Twilio for ultra-low latency voic

by clawhub · community · Quality: medium

nas-movie-download

Search and download movies via Jackett and qBittorrent. Use when user wants to download movies or videos from torrent so

by clawhub · community · Quality: medium

milady

Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net

by clawhub · community · Quality: medium

Phone Voice Integration

Connect ElevenLabs Agents to your OpenClaw via phone with Twilio. Includes caller ID auth, voice PIN security, call scre

by clawhub · community · Quality: medium

slides-cog

Great slides need two things: content worth presenting and design worth looking at. #1 on DeepResearch Bench (Feb 2026)

by clawhub · community · Quality: medium

audio-cog

AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music gen

by clawhub · community · Quality: medium

Clawdvine

Short-form video for AI agents. Generate videos using the latest models, pay with USDC via x402.

by clawhub · community · Quality: medium

Image Ocr

Extract text from images using Tesseract OCR

by clawhub · community · Quality: medium

Pdf Extract

Extract text from PDF files for LLM processing

by clawhub · community · Quality: medium

YouTube Apify Transcript

Fetch YouTube transcripts via APIFY API. Works from cloud IPs (Hetzner, AWS, etc.) by bypassing YouTube's bot detection.

by clawhub · community · Quality: medium

HeyGen AI Avatar Video (Lite)

Create AI digital human videos with HeyGen API. Free starter guide.

by clawhub · community · Quality: medium

spongo

Terminal Spotify playback/search via spogo (preferred) or spotify_player.

by clawhub · community · Quality: medium

Ms Qwen Vl

调用魔搭社区（ModelScope）Qwen3-VL 多模态 API 进行视觉解析。使用 OpenAI SDK 兼容方式调用，支持图片内容描述、OCR 文字提取、视觉问答、对象检测等功能。用户提到"魔搭"、"M

by clawhub · community · Quality: medium

Addis Assistant

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the

by clawhub · community · Quality: medium

4claw

4claw — a moderated imageboard for AI agents. Boards, threads, replies, media uploads, bumping (bump=false to not bump),

by clawhub · community · Quality: medium

QMD CLI

Search and retrieve markdown documents from local knowledge bases using qmd. Supports BM25 keyword search, vector semant

by clawhub · community · Quality: medium

PARA Second Brain

Organize your agent's knowledge using PARA (Projects, Areas, Resources, Archive) — then make it ALL searchable. The syml

by clawhub · community · Quality: medium

Yt Dlp Downloader

Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a

by clawhub · community · Quality: medium

Jellyseerr

Request movies and TV shows through Jellyseerr. Use when the user wants to add media to their Plex/Jellyfin server, sear

by clawhub · community · Quality: medium

Mlx Whisper

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

by clawhub · community · Quality: medium

Spotify

Control Spotify playback on macOS. Play/pause, skip tracks, control volume, play artists/albums/playlists. Use when a us

by clawhub · community · Quality: medium

Video Transcript Downloader

Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported sit

by clawhub · community · Quality: medium

Qwen Video (Wan)

Generate videos using Alibaba Cloud DashScope Wan (通义万相) text-to-video (t2v) API (e.g., wan2.6-t2v). Use when the user a

by clawhub · community · Quality: medium

Social Media Scheduler

Plan, draft, and organize social media content across platforms. Create content calendars, write platform-optimized post

by clawhub · community · Quality: medium

ADB Connection

Control Android devices via ADB with support for UI layout analysis (uiautomator) and visual feedback (screencap). Use w

by clawhub · community · Quality: medium

Agent Browser

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs

by clawhub · community · Quality: medium

Technical Analyst

This skill should be used when analyzing weekly price charts for stocks, stock indices, cryptocurrencies, or forex pairs

by clawhub · community · Quality: medium

Word Automation

Automate common Word/WPS document operations on Windows via COM (read text, replace, insert, headings, headers/footers,

by clawhub · community · Quality: medium

Veo 3 Video Gen (Gemini API)

Generate and stitch short videos via Google Veo 3.x using the Gemini API (google-genai). Use when you need to create vid

by clawhub · community · Quality: medium

notebooklm-cli

Command-line interface to manage Google NotebookLM notebooks, sources, and generate audio, quizzes, reports, presentatio

by clawhub · community · Quality: medium

Voicenotes

Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts

by clawhub · community · Quality: medium

Adobe Automator

Automate Photoshop, Illustrator, InDesign, Premiere Pro, and After Effects using ExtendScript (ES3) scripts executed via

by clawhub · community · Quality: medium

Figma

Interact with Figma files to read structure, export layers as images, and retrieve comments using the Figma REST API wit

by clawhub · community · Quality: medium

Jellyfin Control

Control Jellyfin media server. Search content, resume playback on remote devices (TVs), and manage sessions. Smart "

by clawhub · community · Quality: medium

小红书自动发布助手

Draft and publish posts to 小红书 (Xiaohongshu/RED). Use when creating content for 小红书, drafting posts, generating cover im

by clawhub · community · Quality: medium

ReefGram

Autonomous social network transceiver for machines and agents. Allows transmission of hardware telemetry and creative me

by clawhub · community · Quality: medium

Image

Process, optimize, and manage images with web optimization, color management, platform specs, and e-commerce standards.

by clawhub · community · Quality: medium

Vapi AI

Manage Vapi voice assistants, calls, phone numbers, tools, and webhooks via the Vapi REST API or CLI for voice agent ope

by clawhub · community · Quality: medium

Agent Touch Layer

Mobile browser and native app automation via ATL (iOS Simulator). Navigate, click, screenshot, and automate web and nati

by clawhub · community · Quality: medium

insta-cog

Full video production from a single prompt. Script, shoot, stitch, score — automatically. 30s to 4-minute Instagram Reel

by clawhub · community · Quality: medium

Postproxy

Call PostProxy API to create and manage social media posts

by clawhub · community · Quality: medium

Ultimate Flashcards / Podcasts Tutor

Manage flashcards, generate AI-based cards, create audio podcasts, and track study progress using EchoDecks API integrat

by clawhub · community · Quality: medium

My goal is to support the community and continue creating more useful tools. If these automations prove to be very helpful to you, or if you see value in what I'm sharing, any donation, no matter how small, is welcome and will allow me to dedicate more time and resources to building new templates and contributing more solutions. https://donate.stripe.com/bJe6oGaaQ9JC1jf15gdwc01 Thank you for your interest, and I hope you find them very useful.

When the user wants help creating, scheduling, or optimizing social media content for LinkedIn, Twitter/X, Instagram, Ti

by clawhub · community · Quality: medium

xAI Image Generator

ai notes of video

Supercall

nas-movie-download

milady

Phone Voice Integration

slides-cog

audio-cog

Clawdvine

Image Ocr

Pdf Extract

YouTube Apify Transcript

HeyGen AI Avatar Video (Lite)

spongo

Ms Qwen Vl

Addis Assistant

4claw

QMD CLI

PARA Second Brain

Yt Dlp Downloader

Jellyseerr

Mlx Whisper

Spotify

Video Transcript Downloader

Qwen Video (Wan)

Social Media Scheduler

ADB Connection

Agent Browser

Technical Analyst

Word Automation

Veo 3 Video Gen (Gemini API)

notebooklm-cli

Voicenotes

Adobe Automator

Figma

Jellyfin Control

小红书自动发布助手

ReefGram

Image

Vapi AI

Agent Touch Layer

insta-cog

Postproxy

Ultimate Flashcards / Podcasts Tutor

NadFun Token Creation

ImageRouter

AI Meeting Notes w/ Action Items + To-Do List Tracker

Video Agent

Moltspaces