Media AI Skills - 3,611 Tools

CDP Browser

CDP browser control at localhost:9222. Use when you need to inspect tabs, take screenshots, navigate, scroll, post to X,

by clawhub · community · Quality: medium

Lead Magnet Creator

Create lead magnets (PDF guides, checklists, templates) with matching landing pages. Generate qualified leads for your b

by clawhub · community · Quality: medium

Twitter/X Reader

Extract complete data from X/Twitter tweets by URL, including text, author info, timestamps, engagement stats, media, qu

by clawhub · community · Quality: medium

German Content Writer

Professionelle deutsche Inhalte erstellen - Blog-Artikel, Landing Pages, E-Mails, Social Media. Perfekt für den DACH-Mar

by clawhub · community · Quality: medium

PC Master

Control the Windows PC from WSL2. Use when the user asks to open/close applications, manage processes, take screenshots,

by Amirgu · community · Quality: medium

Color Palette Generator

Extract a color palette from an image and return HEX/RGB values with optional swatch image.

by QROST · community · Quality: medium

小红书视频下载器

Download and summarize Xiaohongshu (小红书/RedNote) videos. Produces a full resource pack with video, audio, subtitles, tra

by clawhub · community · Quality: medium

Pdfreader

Extract text and metadata from PDF files using PyMuPDF, supporting large files and outputting results in JSON format.

by clawhub · community · Quality: medium

Togetherai Tts

Convert text to speech using the TogetherAI API with the MiniMax speech-2.6-turbo model and save audio in mp3 format.

by marcus20232023 · community · Quality: medium

Md2pdf

Markdown 转 PDF 技能。将 Markdown 文件转换为精美的 PDF 文档，完美支持中文、代码高亮、自定义样式。

by franklu0819-lang · community · Quality: medium

小野语音系统

基于双引擎策略的智能语音系统，支持中文本地Tingting语音和多语言Edge-TTS云端，输出高质量OGG音频，保障隐私。

by jackytianjp · community · Quality: medium

md2pdf-xelatex

Convert Markdown files to PDF with full LaTeX math formula rendering and CJK (Chinese/Japanese/Korean) support. Use when

by clawhub · community · Quality: medium

Pine Voice

Give your agent a real phone. It dials, waits on hold, negotiates your bills, and returns a full transcript.

by bojieli · community · Quality: medium

Faster Whisper Gpu

High-performance local speech-to-text transcription using Faster Whisper with NVIDIA GPU acceleration. Transcribe audio

by clawhub · community · Quality: medium

Pollinations Image Generator

Generate detailed images from text prompts using Pollinations.ai models with optional configuration, model selection, an

by Aprilox · community · Quality: medium

2nd Brain

Personal knowledge base for capturing and retrieving information about people, places, restaurants, games, tech, events,

by coderaven · community · Quality: medium

Mistral PDF OCR

Extracts text, tables, and images from PDFs (including scanned PDFs) using the Mistral OCR API. Use when user asks to OC

by clawhub · community · Quality: medium

Screenshot Skill

Capture screenshots on Windows using mss and Pillow. Provides full-screen, region, and multi-monitor capture with output

by sunrddd-a11y · community · Quality: medium

Ellya--Your Virtual Companion

OpenClaw virtual companion skill. Use it to bootstrap runtime files (SOUL and base image), guide user personalization, l

by clawhub · community · Quality: medium

Snipgrapher - generate images from code snippets

Configure and use snipgrapher to generate polished code snippet images

by clawhub · community · Quality: medium

RoomSound

RoomSound gives your agent the skill to play audio to your speakers. Starting with YouTube to Bluetooth speakers, expand

by icecat2005 · community · Quality: medium

Miranda SAG (ElevenLabs TTS say-UX)

ElevenLabs text-to-speech with mac-style say UX.

by jeffpignataro · community · Quality: medium

Zhipu AI TTS

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with

by franklu0819-lang · community · Quality: medium

Feishu Voice

飞书语音消息发送技能。将文本转换为语音并发送到飞书，支持 TTS 生成、格式转换、时长读取、文件上传和消息发送。

by clawhub · community · Quality: medium

Zhipu AI ASR

Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files

by clawhub · community · Quality: medium

Csam Shield

Detects, blocks, and reports Child Sexual Abuse Material using AI-driven image, video, and behavior analysis with automa

by raghulpasupathi · community · Quality: medium

Image Generate

使用内置 image_generate.py 脚本生成图片, 准备清晰具体的 `prompt`。

by clawhub · community · Quality: medium

Terrain Route Video

Generate a minimalist terrain-style animated driving route video (MP4) from a list of stops (cities/POIs) without Remoti

by clawhub · community · Quality: medium

Md2pdf WeasyPrint

Markdown 转 PDF 技能。将 Markdown 文件转换为精美的 PDF 文档，完美支持中文、代码高亮、自定义样式。

by franklu0819-lang · community · Quality: medium

Image Detection

Analyzes images to detect AI generation, extract metadata, identify artifacts, and perform content moderation using loca

by clawhub · community · Quality: medium

gemini video analyze

Send a public video URL directly to a Google Gemini model for analysis. Use when Codex must summarize a video, answer qu

by tokyo-s · community · Quality: medium

Index Youtube

Index YouTube channel videos and transcripts for semantic search. Use when user says "index YouTube", "ad

by clawhub · community · Quality: medium

Wip X

X Platform API. Read posts, search tweets, post, upload media.

by clawhub · community · Quality: medium

Wip Grok

xAI Grok API. Search the web, search X, generate images, generate video.

by clawhub · community · Quality: medium

Kiro Search Aggregator

Multi-source search skill for Kiro on OpenClaw. Aggregate and rank results from Google, Google Scholar, YouTube, and X,

by clawhub · community · Quality: medium

Faster Whisper Transcription

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on aud

by Kalmuraee · community · Quality: medium

Scanwow Sync

Sync your OpenClaw agent with the ScanWow iOS app. Start an HTTP webhook to receive high-quality OCR scans directly from

by BrandonS7 · community · Quality: medium

Bilibili Player

B 站视频播放器。用 Playwright 搜索 B 站视频并获取准确链接，然后用 open 命令在当前浏览器打开播放。Use when users request to play Bilibili videos or search for

by e421083458 · community · Quality: medium

Jarvis Tts

Jarvis TTS text-to-speech using Microsoft edge-tts with afplay playback. Use when users request voice output, audio resp

by clawhub · community · Quality: medium

Video Promo

Generate promo video plan with 30-45s script, shot-by-shot storyboard, and optional Remotion/Montage-tool config. Use wh

by fortunto2 · community · Quality: medium

Tarot Content

Generate tarot × astrology content for social media — weekly horoscope scripts, tarot spreads, video scripts, and cover

by clawhub · community · Quality: medium

Brand Voice Profile

Define and store your brand voice profile for consistent content generation. Captures writing style, vocabulary patterns

by clawhub · community · Quality: medium

Recover deleted YouTube videos

Searches multiple online archives to find and recover deleted YouTube videos, metadata, and comments using a video ID.

by clawhub · community · Quality: medium

Strudel Music

Compose, play, and render music using Strudel live-coding patterns. Use when composing music programmatically, generatin

by clawhub · community · Quality: medium

API Credit Health Bar Lite

Display API credit balances for 5 core providers (Anthropic, OpenAI, OpenRouter, Mistral, Groq) with video game style he

by clawhub · community · Quality: medium

Research Report Generator

Research technical projects/papers and generate comprehensive reports with PDF export. Modes: lite (analysis + writing)

by clawhub · community · Quality: medium

AnveVoice

Add AI voice assistants to your website. Engage visitors with natural voice conversations, capture leads, automate suppo

by ANVEAI · community · Quality: medium

BRICKS CLI

Manage BRICKS workspace devices, groups, apps, modules, media, and projects via CLI for control, monitoring, updates, an

by clawhub · community · Quality: medium

ClawTime Setup

Install, configure, start, and troubleshoot ClawTime — a private self-hosted webchat UI for OpenClaw with passkey (Face

by bewareofddog · community · Quality: medium

Azure Document OCR

Extract text and structured data from documents using Azure Document Intelligence (formerly Form Recognizer). Supports O

by Li-Hongmin · community · Quality: medium