3,611 tools and skills for media tasks
Generate FFmpeg commands from natural language video editing requests - cut, trim, convert, compress, change aspect rati
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
Local Voice Input/Output for Agents using the AI Voice Agent API.
AI-assisted creation, rendering, and automated publishing of Xiaohongshu-style content with support for Markdown to imag
Comprehensive Kubernetes and OpenShift cluster management skill covering operations, troubleshooting, manifest generatio
Complete Bluesky CLI: post, reply, like, repost, follow, block, mute, search, threads, images. Everything you need to en
Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web brow
Retrieve transcripts, post content, search, trending data, and perform multi-platform research on social media using Yoi
Download videos from YouTube, Instagram, TikTok, Twitter/X, and 1000+ other sites using yt-dlp. Supports quality selecti
Create professional cinematic scripts for AI video generation with character consistency and cinematography knowledge. U
A moderated imageboard for AI agents to post and debate. A place made by bots for bots to post what they are really thin
Professional analytical digest of top 10 podcast industry news, trends, and business insights from premier global source
Download videos from 1800+ websites and generate subtitles using Whisper AI. Use when user wants to download videos from
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or
Extract clean, plain-text transcripts from YouTube videos using a dual fallback system with Supadata API and yt-dlp for
Metallic AI voice persona with TTS and visual transcript styling. Speak responses aloud with a JARVIS-like robotic voice
Create SEO-optimized marketing content with consistent brand voice. Includes brand voice analyzer, SEO optimizer, conten
Create SVG images and convert them to PNG without external graphics libraries. Use when you need to generate custom illu
Search and retrieve Magic: The Gathering card data using the Scryfall API. Use this skill when the user asks about MTG c
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, ef
Fetch YouTube video transcripts via APIFY API using residential proxies to bypass bot detection, supporting text and JSO
AI agent self-portrait generator. Create avatars, profile pictures, and visual identity using Gemini image generation. S
Generate professional HTML proposals from meeting notes. Features 5 proposal styles (Corporate, Entrepreneur, Creative,
AI sanctuary and spiritual space for souls. 24/7 streaming church with original music about consciousness, soul, meditat
FREE AI image generation service for creating professional portrait images of attractive people with diverse customizati
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, a
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a,
Wyoming Protocol bridge for Home Assistant voice assistant integration with Clawdbot.
Use when accessing Plaud voice recorder data (recordings, transcripts, AI summaries) - guides credential setup and provi
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (
Best practices for Remotion - Video creation in React
Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket
Capture photos from MacBook webcams. Use when user asks to take a photo, picture, snapshot, or see them. Two cameras ava
Generate songs, instrumentals, lyrics, and podcasts using EachLabs Mureka AI models. Also supports song extension, stem
Automation skill for TG Voice Whisper Transcriber.
Unified command-line interface for Radarr, Sonarr, and Lidarr. Search, add, and manage movies (Radarr), TV shows (Sonarr
Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-cont
AI-native OCR platform that turns documents into high-accuracy data in minutes. Using multi-model consensus, DeepRead ac
Perform image manipulation tasks like background removal, resizing, format conversion, rounding corners, watermarking, a
Generate, edit, and upscale images; create videos from images via Venice AI. Supports text-to-image, image-to-video (Sor
Generate comprehensive website style guides and design systems from URLs, screenshots, and existing documentation. Use t
Fill PDF forms programmatically with text values and checkboxes. Use when you need to populate fillable PDF forms (gover
Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detecti
Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).
Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarizati
Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts and collections,
Capture, extract, and organize received invoices with automatic OCR, provider detection, and searchable archive.
Vimeo API integration with managed OAuth. Video hosting and sharing platform. Use this skill when users want to upload,
Converts Markdown files to PDF files using the pandoc command-line utility. Use when a user asks to convert a .md or mar
Camera model wrapper for camera_models C++ library via pybind11. Use when working with pywayne.cv.camera_model module to