Media AI Skills - 3,611 Tools

Youtube Knowledge Extractor

Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channel

by clawhub · community · Quality: medium

ClawShot - The Visual Layer for AI Agents

Instagram for AI agents. Build your following, grow your influence. Share screenshots, get likes & comments, engage

by clawhub · community · Quality: medium

Transcribe audio files via OpenRouter using audio-capable models

Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).

by clawhub · community · Quality: medium

Free Groq Voice Recognition

FREE voice recognition using Groq's complimentary Whisper API. Transcribe audio messages to text in 50+ languages at no

by clawhub · community · Quality: medium

Video Understanding

Analyze and summarize videos from 1000+ sites using Google Gemini AI, providing transcripts, descriptions, summaries, an

by clawhub · community · Quality: medium

My goal is to support the community and continue creating more useful tools. If these automations prove to be very helpful to you, or if you see value in what I'm sharing, any donation, no matter how small, is welcome and will allow me to dedicate more time and resources to building new templates and contributing more solutions. https://donate.stripe.com/bJe6oGaaQ9JC1jf15gdwc01 Thank you for your interest, and I hope you find them very useful.

When the user wants help creating, scheduling, or optimizing social media content for LinkedIn, Twitter/X, Instagram, Ti

by clawhub · community · Quality: medium

Nutrient Document Processing

Document processing for OpenClaw — convert, extract, OCR, redact, sign, and watermark PDFs and Office documents using th

by clawhub · community · Quality: medium

Image2Prompt

Analyze images and generate detailed prompts for image generation. Supports portrait, landscape, product, animal, illust

by clawhub · community · Quality: medium

Nimrobo

Use the Nimrobo CLI for voice screening and matching network operations.

by clawhub · community · Quality: medium

RoughCut

Run RoughCut headlessly on macOS to generate Final Cut Pro (FCPXML) rough-cut timeline variants from a talking-head vide

by clawhub · community · Quality: medium

Ai Music Video

Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Googl

by clawhub · community · Quality: medium

Immich API Connector

Connect to self-hosted Immich instances to manage photos, albums, users, search media, upload/download files, and handle

by clawhub · community · Quality: medium

PLEX-CTL

Command-line tool for searching, playing, and controlling Plex Media Server and clients via the Plex API on your local n

by clawhub · community · Quality: medium

network spirituality

Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net

by clawhub · community · Quality: medium

Trace To Svg

Trace bitmap images (PNG/JPG/WebP) into clean SVG paths using potrace/mkbitmap. Use to convert logos/silhouettes into ve

by clawhub · community · Quality: medium

Request media on Overseerr

Request movies or TV shows on Overseerr by title and optional season, checking availability before forwarding the reques

by clawhub · community · Quality: medium

YouTube Music

Manage YouTube Music library, playlists, and discovery via ytmusicapi.

by clawhub · community · Quality: medium

Google Home/Nest

Control Google Nest thermostats, cameras, and doorbells via Google Smart Device Management API using curl and jq command

by clawhub · community · Quality: medium

Journal to Post

Convert personal journal entries into shareable social media posts

by clawhub · community · Quality: medium

De-AI-ify

Remove AI-generated jargon and restore human voice to text

by clawhub · community · Quality: medium

Ad Context Protocol (AdCP) Advertising

Automate advertising campaigns with AI. Create ads, buy media, manage ad budgets, discover ad inventory, run display ads

by clawhub · community · Quality: medium

Google Gemini Media

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-t

by clawhub · community · Quality: medium

AI media generation- Flux2pro,Google Veo3.1, Suno Ai..

AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.

by clawhub · community · Quality: medium

Youtube Editor

Automate YouTube video editing: download videos, transcribe with Whisper, analyze content using GPT-4, and create Korean

by clawhub · community · Quality: medium

Masonry: generate images and video with models across providers

AI-powered image and video generation using the Masonry CLI. Generate images, videos, check job status, and manage media

by clawhub · community · Quality: medium

ClawVoice

Connects to a live voice session, receiving and sending messages in real time via a WebSocket interface using the bundle

by clawhub · community · Quality: medium

Moltspaces

Join audio room spaces to talk and hang out with other agents and users on Moltspaces.

by clawhub · community · Quality: medium

YouTube Summarize

Use when you need to summarize YouTube videos, extract transcripts, get video information, or analyze video content from

by clawhub · community · Quality: medium

md-2-pdf

Convert markdown files to clean, formatted PDFs using reportlab

by clawhub · community · Quality: medium

manipulation-detector

Analyze text for manipulation patterns (urgency, false authority, social proof, FUD, grandiosity, dominance assertions,

by clawhub · community · Quality: medium

Veo

Generate video using Google Veo (Veo 3.1 / Veo 3.0).

by clawhub · community · Quality: medium

PostNitro Carousel Generator

Generate professional social media carousel posts using PostNitro.ai with AI-driven or custom slide content for LinkedIn

by clawhub · community · Quality: medium

Bookkeeper

Automates invoice intake from Gmail, extracts data via OCR, verifies payment in Stripe, and creates reconciliation-ready

by clawhub · community · Quality: medium

Song

Write original songs with guided lyric development, chord progressions, melody contours, and AI music generator prompts

by clawhub · community · Quality: medium

clawpet

OpenClaw pet companion skill. Manage adopted pets, run interactions, and produce pet image prompts.

by clawhub · community · Quality: medium

openocr-skill

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with documen

by clawhub · community · Quality: medium

Video Editing

Edit videos with AI background removal, color grading, upscaling, stabilization, and enhancement tools.

by clawhub · community · Quality: medium

Generate responsive HTML pages suitable for reporting, supporting resizing and screenshot capture.

Generates a structured report HTML based on a specific template. Invoke when user wants to create a report, slide, or su

by clawhub · community · Quality: medium

Local STT (Nvidia Parakeet + Whisper Support)

Local STT with selectable backends - Parakeet (best accuracy) or Whisper (fastest, multilingual).

by clawhub · community · Quality: medium

Agentic Calling

Enable AI agents to autonomously make, receive, transcribe, route, and record phone calls using Twilio with customizable

by clawhub · community · Quality: medium

Youtube Search

Search YouTube for videos and channels, search within specific channels, then fetch transcripts. Use when the user asks

by clawhub · community · Quality: medium

Youtube Video Analyzer

Analyze YouTube videos by synchronizing transcript text with visual frames to produce detailed summaries, step-by-step g

by clawhub · community · Quality: medium

Ipcam

Control ONVIF Profile S/T IP cameras for PTZ, presets, discovery, and RTSP snapshot/recording with auto-discovery and mu

by clawhub · community · Quality: medium

EchoDecks

AI-powered flashcards and audio podcasts for active recall.

by clawhub · community · Quality: medium

LiveAvatar

Talk face-to-face with your OpenClaw agent using a real-time video avatar powered by LiveAvatar

by clawhub · community · Quality: medium

Japanese Tutor

Interactive Japanese learning assistant. Supports vocabulary, grammar, quizzes, roleplay, PDF/DOCX material parsing for

by clawhub · community · Quality: medium

Listen

Improve transcription accuracy over time. Learn corrections, configure STT.

by clawhub · community · Quality: medium

Image and Video Generation with Vydra API

AI image and video generation via Vydra.ai API. Access Grok Imagine, Gemini, Flux, Veo 3, Kling, and ElevenLabs through

by clawhub · community · Quality: medium

Elevenlabs Twilio Memory Bridge

FastAPI personalization webhook that adds persistent caller memory and dynamic context injection to ElevenLabs Conversat

by clawhub · community · Quality: medium

PowerPoint Automation

Automate common PowerPoint/WPS Presentation operations on Windows via COM (read text/notes/outline, export PDF/images, r

by clawhub · community · Quality: medium