3,611 tools and skills for media tasks
Generate high-quality images from text descriptions.
Cloudinary — manage images/videos, upload, transform, and search assets via REST API
Perform video/audio cutting, format conversion, compression, frame/audio extraction, watermarking, and subtitle addition
Video processing tool using FFmpeg for cutting, format conversion, compression, frame/audio extraction, watermarking, an
本地 ComfyUI 画图工作流 + CivitAI 集成。通过 API 控制本地 ComfyUI 生成图片(文生图/图生视频),支持 CivitAI 模型搜索/下载/更新检查/自动调参。Use when: 用户说画图、生成图片、gener
Download YouTube video transcripts using Apify API. Downloads captions in any language (English, Turkish, auto-generated
Télécharge automatiquement la dernière vidéo (ou les N dernières) d'un compte TikTok public via yt-dlp. Utilise ce skill
Generate AI music with Udio via API wrappers or browser automation, with prompt engineering and song extensions.
Control and sync with your Mac via the Clawsy app for file transfer, screenshots, camera access, clipboard sync, and tas
Generate AI music with Suno via API or browser, with prompt engineering and song extensions.
Capture audio from browser meetings (Zoom, Meet, Teams, etc.) and stream to any AI agent. Zero API keys, works with any
Record video proof of implemented features after coding tasks complete. Use when a coding agent finishes work and needs
Record video proof of implemented features after coding tasks complete. Use when a coding agent finishes work and needs
Automated Suno AI Music Generation - Create professional songs without manual intervention. Headless browser automation
Remote Windows desktop control from WSL/Linux via screenshot + mouse/keyboard simulation. Use when: user asks to control
Automate social media posting to Instagram and YouTube. Schedule and publish images, videos, and content automatically.
Run Mission Control visual QA on SAPCONET over SSH using Puppeteer screenshots and basic DOM checks.
Run Mission Control visual QA on SAPCONET over SSH using Puppeteer screenshots and basic DOM checks.
Analyze ad creatives (images and videos) extracted from competitor research. Use when given a directory of ad images, vi
Generate marketing ad images using Nano Banana Pro (Gemini 3 Pro Image). Accepts campaign-planner creative briefs, reads
Connects voice transcripts and agent responses through hotbutter.ai hosted relay for remote voice interaction with openc
Manage Joplin notes via CLI: create, read, edit, delete notes and notebooks; handle todos; sync with WebDAV; export note
Nano Banana 2 — AI image generation powered by Google Gemini 3.1 Flash. Fast, versatile text-to-image and image editing
Intelligent PDF and image to Markdown converter using Ollama GLM-OCR with smart content detection (text/table/figure)
Control Spotify playback: play, pause, resume, skip, previous, restart, search, queue, set volume, shuffle, repeat, and
AI-native SVG vector graphics generation tool. Use when generating SVG graphics from text prompts or converting images t
YouTube搜索结果视频链接提取器 - 可以搜索特定关键词并提取视频链接
Content moderation and safety checks. Instantly classify text or images as safe or unsafe using AI guardrails.
Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f
Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f
Get YouTube video info, statistics, descriptions, thumbnails, and optionally transcripts. Uses YouTube Data API (free) f
Post to social media via VibePost API. Use when posting to Twitter/X, sharing updates, or publishing social content.
Automatically downloads the latest video (or the N most recent) from a public TikTok account using yt-dlp. Use this skil
End-to-end pipeline for creating faceless Islamic story TikTok videos. Orchestrates multiple specialized agents: story r
Build and run a high-signal information radar for C-end founders and operators across YouTube, X/Twitter, Reddit, WeChat
Speaker separation, voice comparison, and audio processing tools. Use when working with multi-speaker audio, voice cloni
Automated social media manager — plan, write, schedule, and analyze content across X/Twitter, LinkedIn, Instagram, TikTo
Control Spotify playback and devices from an AI agent using spotify.py and the official Spotify Web API. Use when users
Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, cli
ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits
Unified knowledge capture and retrieval for URLs, video/article/paper extracts, social posts, and agent research outputs
Video editing skill for OpenClaw, Claude Code, and Codex — trim, jump cut, caption (Hormozi/standard/minimal), text over
Use when generating AI art or need to craft high-quality image prompts. Elite AI artist specializing in hyper-detailed,
📸 macOS screenshot CLI for AI agents
YouTube Transcript Skill v2.0 - Download, Translate to Vietnamese, Summarize. Perfect for learning English from YouTube
Control a Chrome session via Stagehand to browse, act, extract, and screenshot on demand inside the Factory CLI.
Claude Skill that upload image to free online service, returns URL and markdown for the image.
Extract a YouTube video transcript from a URL and summarize it into important content, learnings, and suggestions. Use w
Claude Code skill for automated security audits – analyzes codebases, classifies findings (OWASP/CWE), generates epics a
Granola meeting notes CLI (list/search/show/export + transcript fetch).