Media AI Skills - 3,611 Tools

Moark Image Generation

Generate high-quality images from text descriptions.

by fchange · community · Quality: medium

Cloudinary

Cloudinary — manage images/videos, upload, transform, and search assets via REST API

by aiwithabidi · community · Quality: medium

cutmv Video Tool

Perform video/audio cutting, format conversion, compression, frame/audio extraction, watermarking, and subtitle addition

by QiaoTuCodes · community · Quality: medium

cutmv

Video processing tool using FFmpeg for cutting, format conversion, compression, frame/audio extraction, watermarking, an

by QiaoTuCodes · community · Quality: medium

ComfyUI Painter

本地 ComfyUI 画图工作流 + CivitAI 集成。通过 API 控制本地 ComfyUI 生成图片（文生图/图生视频），支持 CivitAI 模型搜索/下载/更新检查/自动调参。Use when: 用户说画图、生成图片、gener

by zeron-G · community · Quality: medium

YouTube Apify Transcript

Download YouTube video transcripts using Apify API. Downloads captions in any language (English, Turkish, auto-generated

by TevfikGulep · community · Quality: medium

Download-video-tiktok

Télécharge automatiquement la dernière vidéo (ou les N dernières) d'un compte TikTok public via yt-dlp. Utilise ce skill

by stoxca · community · Quality: medium

Udio

Generate AI music with Udio via API wrappers or browser automation, with prompt engineering and song extensions.

by ivangdavila · community · Quality: medium

Clawsy

Control and sync with your Mac via the Clawsy app for file transfer, screenshots, camera access, clipboard sync, and tas

by iret77 · community · Quality: medium

Suno

Generate AI music with Suno via API or browser, with prompt engineering and song extensions.

by ivangdavila · community · Quality: medium

Browser Audio Capture

Capture audio from browser meetings (Zoom, Meet, Teams, etc.) and stream to any AI agent. Zero API keys, works with any

by jarvis563 · community · Quality: medium

Video Proof

Record video proof of implemented features after coding tasks complete. Use when a coding agent finishes work and needs

by rikisann · community · Quality: medium

Video Proof

Record video proof of implemented features after coding tasks complete. Use when a coding agent finishes work and needs

by rikisann · community · Quality: medium

SunoMaker

Automated Suno AI Music Generation - Create professional songs without manual intervention. Headless browser automation

by Vitja1988 · community · Quality: medium

PC Control

Remote Windows desktop control from WSL/Linux via screenshot + mouse/keyboard simulation. Use when: user asks to control

by zeron-G · community · Quality: medium

Social Media Suite

Automate social media posting to Instagram and YouTube. Schedule and publish images, videos, and content automatically.

by Vitja1988 · community · Quality: medium

Billy — Mission Control Visual QA

Run Mission Control visual QA on SAPCONET over SSH using Puppeteer screenshots and basic DOM checks.

by Highlander89 · community · Quality: medium

Mission Control Visual QA

Run Mission Control visual QA on SAPCONET over SSH using Puppeteer screenshots and basic DOM checks.

by Highlander89 · community · Quality: medium

Ad Creative Analysis

Analyze ad creatives (images and videos) extracted from competitor research. Use when given a directory of ad images, vi

by baitoxkevin · community · Quality: medium

Ad Designer

Generate marketing ad images using Nano Banana Pro (Gemini 3 Pro Image). Accepts campaign-planner creative briefs, reads

by baitoxkevin · community · Quality: medium

deprecated ignore

Connects voice transcripts and agent responses through hotbutter.ai hosted relay for remote voice interaction with openc

by michael-stajer · community · Quality: medium

joplin

Manage Joplin notes via CLI: create, read, edit, delete notes and notebooks; handle todos; sync with WebDAV; export note

by davek-dev · community · Quality: medium

Nano Banana 2 — AI Image Generation (Gemini 3.1 Flash Image, Google, Evolink)

Nano Banana 2 — AI image generation powered by Google Gemini 3.1 Flash. Fast, versatile text-to-image and image editing

by EvoLinkAI · community · Quality: medium

Pdf Ocr Tool

Intelligent PDF and image to Markdown converter using Ollama GLM-OCR with smart content detection (text/table/figure)

by TsukiSama9292 · community · Quality: medium

ClawSpotify

Control Spotify playback: play, pause, resume, skip, previous, restart, search, queue, set volume, shuffle, repeat, and

by clawhub · community · Quality: medium

Quiver Ai

AI-native SVG vector graphics generation tool. Use when generating SVG graphics from text prompts or converting images t

by clawhub · community · Quality: medium

Youtube Search Extractor

YouTube搜索结果视频链接提取器 - 可以搜索特定关键词并提取视频链接

by clawhub · community · Quality: medium

AIML Сontent Moderation

Content moderation and safety checks. Instantly classify text or images as safe or unsafe using AI guardrails.

by clawhub · community · Quality: medium

YouTube Master

Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f

by TevfikGulep · community · Quality: medium

YouTube Master

Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f

by TevfikGulep · community · Quality: medium

YouTube Master

Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f

by TevfikGulep · community · Quality: medium

Social Poster

Post to social media via VibePost API. Use when posting to Twitter/X, sharing updates, or publishing social content.

by JPaulGrayson · community · Quality: medium

Downloader tiktok videos

Automatically downloads the latest video (or the N most recent) from a public TikTok account using yt-dlp. Use this skil

by stoxca · community · Quality: medium

Directoryahu

End-to-end pipeline for creating faceless Islamic story TikTok videos. Orchestrates multiple specialized agents: story r

by mohamedzeidan2021 · community · Quality: medium

Multisource Intel Radar

Build and run a high-signal information radar for C-end founders and operators across YouTube, X/Twitter, Reddit, WeChat

by Rogerrrr18 · community · Quality: medium

Audio Speaker Tools

Speaker separation, voice comparison, and audio processing tools. Use when working with multi-speaker audio, voice cloni

by cmfinlan · community · Quality: medium

Social Media Engine

Automated social media manager — plan, write, schedule, and analyze content across X/Twitter, LinkedIn, Instagram, TikTo

by Batsirai · community · Quality: medium

Spotify Controller

Control Spotify playback and devices from an AI agent using spotify.py and the official Spotify Web API. Use when users

by egemenyerdelen · community · Quality: medium

Agent Browser

Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, cli

by bodietron · community · Quality: medium

Elevenlabs Pro

ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits

by mrnsmh · community · Quality: medium

Agent Knowledge Capture

Unified knowledge capture and retrieval for URLs, video/article/paper extracts, social posts, and agent research outputs

by ianderrington · community · Quality: medium

Video Editing Skill

Video editing skill for OpenClaw, Claude Code, and Codex — trim, jump cut, caption (Hormozi/standard/minimal), text over

by 6missedcalls · community · Quality: medium · 1 stars

Claw Art

Use when generating AI art or need to craft high-quality image prompts. Elite AI artist specializing in hyper-detailed,

by asimons81 · community · Quality: medium

screenshot

📸 macOS screenshot CLI for AI agents

by kxzk · community · Quality: medium · 5 stars

youtube-transcript

YouTube Transcript Skill v2.0 - Download, Translate to Vietnamese, Summarize. Perfect for learning English from YouTube

by nguyenngothuong · community · Quality: medium · 7 stars

browser

Control a Chrome session via Stagehand to browse, act, extract, and screenshot on demand inside the Factory CLI.

by factory-ben · community · Quality: medium · 3 stars

image-upload

Claude Skill that upload image to free online service, returns URL and markdown for the image.

by iamzifei · community · Quality: medium · 3 stars

youtube-transcripter

Extract a YouTube video transcript from a URL and summarize it into important content, learnings, and suggestions. Use w

by TylonHH · community · Quality: medium · 2 stars

security-audit

Claude Code skill for automated security audits – analyzes codebases, classifies findings (OWASP/CWE), generates epics a

by McGo · community · Quality: medium · 4 stars

granola-cli

Granola meeting notes CLI (list/search/show/export + transcript fetch).

by aaronvanston · community · Quality: medium · 2 stars