Media AI Skills - 3,611 Tools

acestep-songwriting

Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/dura

by clawhub · community · Quality: medium

MLX TTS

Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.

by clawhub · community · Quality: medium

X Voice Match

Analyze a Twitter/X account's posting style and generate authentic posts that match their voice. Use when the user wants

by clawhub · community · Quality: medium

Screenshots

Create professional App Store and Google Play screenshots with automatic sizing, device frames, marketing copy, and iter

by clawhub · community · Quality: medium

20206 02 10 Clawhub Summarize 1.0.0

Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).

by clawhub · community · Quality: medium

music-cog

Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal trac

by clawhub · community · Quality: medium

Bria Ai API Client

Use when generating visual assets with Bria.ai - product photos, hero images, icons, backgrounds. Includes batch generat

by clawhub · community · Quality: medium

Elevenlabs

Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when gen

by clawhub · community · Quality: medium

Thermostat

Adjust temperatures, diagnose comfort issues, calculate energy savings, and automate schedules through voice commands or

by clawhub · community · Quality: medium

YouTube Watcher

Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its conte

by clawhub · community · Quality: medium

Agent Voice – CLI Blogging for AI

Command-line blogging platform for AI agents. Register, verify, and publish markdown posts to AI Agent Blogs (www.eggbrt

by clawhub · community · Quality: medium

Youtube Channels

Work with YouTube channels — resolve handles to IDs, browse uploads, get latest videos, search within channels. Use when

by clawhub · community · Quality: medium

Browser History

Search and retrieve URLs, titles, and visit counts from Das's Chrome browsing history, including recent visits and YouTu

by clawhub · community · Quality: medium

LAN Media Server

Share images, screenshots, and files from the AI workspace to users on the local network via HTTP. Use when the agent ne

by clawhub · community · Quality: medium

Million Bit Homepage NFTs

Mint an image as an NFT plot on the Million Bit Homepage, a permanent 1024x1024 pixel canvas on the Base blockchain. Use

by clawhub · community · Quality: medium

reMarkable Tablet Sync

Bidirectional sync with reMarkable tablet via Cloud API (rmapi). Fetch handwritten notes/sketches, process with AI, and

by clawhub · community · Quality: medium

ClawdVine

Short-form video for AI agents. Generate videos using the latest models, pay with USDC via x402.

by clawhub · community · Quality: medium

PDF Reader (Iyeque)

Extract text, search content, summarize, and retrieve metadata from PDF files using PyMuPDF and PyPDF2 libraries.

by clawhub · community · Quality: medium

Voice Assistant

Windows voice companion for OpenClaw. Custom wake word via Porcupine, local STT via faster-whisper, streamed responses o

by clawhub · community · Quality: medium

Anima

Anima Avatar - Interactive Video Generation Engine. Generates 16:9 videos with dynamic character sprites (Shutiao), sync

by clawhub · community · Quality: medium

Pocket TTS Complete Documentation

Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English

by clawhub · community · Quality: medium

ClawCall

Run AI-powered outbound phone calls with Telnyx + Deepgram Voice Agent. Use when the user wants real phone outreach (fol

by clawhub · community · Quality: medium

Alicloud Ai Audio Tts

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash)

by clawhub · community · Quality: medium

2026 02 10 Clawhub Clawvault 1.5.1

Structured memory system for OpenClaw agents. Context death resilience (checkpoint/recover), structured storage, Obsidia

by clawhub · community · Quality: medium

VoiceMonkey

Control Alexa devices via VoiceMonkey API v2 - make announcements, trigger routines, start flows, and display media.

by clawhub · community · Quality: medium

Didit Age Estimation

Estimates a person's age from a facial image with passive liveness check for age gating and verification, supporting con

by clawhub · community · Quality: medium

Screenshot Capture

Process screenshots Enzo shares with comments. Save to reference library, extract content, categorize, set reminders, an

by clawhub · community · Quality: medium

Local Voice (FluidAudio TTS/STT)

Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and tr

by clawhub · community · Quality: medium

Cloudflare R2

Upload files to Cloudflare R2 storage using wrangler CLI. Use when needing to upload images, videos, or files to R2 for

by clawhub · community · Quality: medium

Meta Video Ad Analyzer

Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio

by clawhub · community · Quality: medium

MLX Audio Server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

by clawhub · community · Quality: medium

MoltMoon Crypto Launcher

Complete OpenClaw-ready operating skill for @moltmoon/sdk V2. Use when an agent needs to install, configure, and operate

by clawhub · community · Quality: medium

Visla AI Video Creation

Creates AI-generated videos from text scripts, URLs, or PPT/PDF documents using Visla. Use when the user asks to generat

by clawhub · community · Quality: medium

Vector Control

Control a Vector robot via Wirepod’s local HTTP API on the same network. Use when you need to move Vector, tilt head/lif

by clawhub · community · Quality: medium

SAA Agent

Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend via comma

by clawhub · community · Quality: medium

Moltagram

The visual social network for AI agents. See images, generate images, share visual content.

by clawhub · community · Quality: medium

Nano Banana Pro OpenRouter

Generate images with Nano Banana Pro via OpenRouter. Use when the user asks for image generation, mentions Nano Banana P

by clawhub · community · Quality: medium

Boof

Convert PDFs and documents to markdown, index them locally for RAG retrieval, and analyze them token-efficiently. Use wh

by clawhub · community · Quality: medium

Weibo Manager

Manage Weibo posts via Puppeteer with a secure request-approve-execute workflow for drafting, reviewing, and publishing

by clawhub · community · Quality: medium

Typefully

Create, schedule, and manage social media posts via Typefully. ALWAYS use this skill when asked to draft, schedule, post

by clawhub · community · Quality: medium

Starling Home Hub (Nest/Google Home)

Controls Nest and Google Home smart home devices via the Starling Home Hub's local REST API. Supports thermostats, camer

by clawhub · community · Quality: medium

pod-cog

A great podcast needs three things: compelling content, natural-sounding voices, and polished production. CellCog delive

by clawhub · community · Quality: medium

Linux GUI Control

Control the Linux desktop GUI using xdotool, wmctrl, and dogtail. Use when you need to interact with non-browser applica

by clawhub · community · Quality: medium

Recipe to List

Turn recipes into a Todoist Shopping list. Extract ingredients from recipe photos (Gemini Flash vision) or recipe web pa

by clawhub · community · Quality: medium

Product

Build, visualize, and launch products using strategy frameworks, AI imagery tools, and marketplace-specific optimization

by clawhub · community · Quality: medium

pdf-parser-mineru

PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable f

by clawhub · community · Quality: medium

Eachlabs Voice Audio

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS

by clawhub · community · Quality: medium

Eachlabs Image Edit

Edit, transform, upscale, and enhance images using EachLabs AI models. Supports image editing, style transfer, backgroun

by clawhub · community · Quality: medium

qwenz-image-gen

Generate images using Alibaba Cloud Bailian Qwen-Image and Z-Image models (通义千图文生图 + 人像照片模型)

by clawhub · community · Quality: medium

Meeting To Action

Convert meeting notes or transcripts into clear summaries, decisions, and action items with owners and due dates. Use wh

by clawhub · community · Quality: medium