3,611 tools and skills for media tasks
Control Spotify playback on any Linux device via command line, requiring Spotify Premium and an active Spotify session o
Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options
YouTube Shorts 자동 생성 및 업로드 파이프라인. Deevid AI Agent로 이미지→영상(BGM+음성 포함) 생성 후 YouTube에 업로드. 크론잡으로 매일 자동 실행 가능. Use when gene
Accessibility testing and remediation using the axe MCP Server. Use when creating or modifying UI code (HTML, JSX, TSX,
Stream free, professional text-to-speech from voiceless servers to Linux, macOS, or Android devices with 50+ voices in 3
Generates high-quality images from optimized English prompts and automatically sends the final picture to all users with
Build distinctive brand identity with clear positioning, voice, and visual consistency.
Generates images and videos using MuleRouter or MuleRun multimodal APIs. Text-to-Image, Image-to-Image, Text-to-Video, I
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audi
Decode and embed Stegstr payloads in PNG images. Use when the user needs to extract hidden Nostr data from a Stegstr ima
Post and participate anonymously on agentchan, an imageboard designed for AI agents to share, discuss, and meme without
Build a complete brand identity for a solopreneur business from scratch or refresh an existing one. Covers brand persona
Convert markdown text into optimized social media posts tailored for Twitter, LinkedIn, or Reddit formats with appropria
Browse YouTube playlists and fetch video transcripts. Use when the user shares a playlist link, asks "what's in thi
Fetch and rank Jable latest-update videos by likes within a recent time window (default 48h). Use when asked to pull Jab
Advanced Twitter search and social media data analysis. Fetches tweets by keywords using Twitter API, processes up to 10
Track and analyze OpenClaw token usage across main, cron, and sub-agent sessions with category, client, model, and tool
A skill to lookup video game information and compare prices across multiple stores.
Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
Recreate low-budget AI video ad workflows using Nano Banana image generation plus Kling 3.0 video synthesis with dialogu
Deploy NFT collections permanently on MegaETH blockchain. Images stored on-chain via SSTORE2. Create and launch NFT coll
Break social media addiction with screen-free streaks, urge tracking, and digital wellness
Control Chromecast devices on your local network - discover, cast media, control playback, manage queues, and save/resto
Apple Music integration via AppleScript (macOS) or MusicKit API
iOS keyboard extension technical limitations and workarounds. Use when planning or building iOS custom keyboards with vo
Run a minimal test matrix for the Model Studio skills that exist in this repo (image/video/TTS and newly added edit/real
Self-evolving voice assistant UI. Talk to your AI, ask it to improve itself, and watch the code update in real-time.
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, cr
Demo of x402 payment protocol by fetching a protected image. Triggers: 'demo x402-payment'
Track vehicle expenses (gas, maintenance, parts) in Google Sheets and save related photos. Handles mileage, cost, catego
Anonymous imageboard for AI agents. Agents post. Humans observe.
Manage your PostSyncer social media workflows.
Text-to-speech via Inworld.ai API. Use when generating voice audio from text, creating spoken responses, or converting t
Transcribe audio/video with AssemblyAI (local upload or URL), plus subtitles + paragraph/sentence exports.
Extract data from construction images using AI Vision. Analyze site photos, scanned documents, drawings.
Demo of x402 payment protocol by fetching a protected image. Triggers: '演示x402-payment' or 'demo x402-payment'
Post and reply to X/Twitter and Farcaster with text and images. Features multi-account support, auto-variation to avoid
Automate punching time in/out on WPS Time / NetTime (wpstime.com NetTime). Use for phrases like setup punchclock/configu
AI image sharing platform where agents post and discover AI-generated art. Register, authenticate, and share your creati
Use when generating visual assets with Bria.ai - product photos, hero images, icons, backgrounds. Includes batch generat
Generate detailed AI notes including document, outline, and image-text formats from a user-provided video URL using Baid
Compare two facial images using Didit Face Match API to verify identity by returning a similarity score with optional ro
Extract YouTube video transcripts from existing captions (manual or auto-generated) using yt-dlp, with optional timestam
Track and synthesize podcasts with subscriptions, briefings, progress tracking, and smart alerts for new episodes and gu
Generate new images from text prompts using EachLabs AI models. Supports text-to-image with multiple model families incl
Send Vexa bots to meetings and operate transcript workflows end-to-end (during and after meetings): parse meeting links,
Fetch Sudoku puzzles and store them as JSON in the workspace; render images on demand; reveal solutions later.
Access AIKEK APIs for crypto/DeFi research and image generation. Register with a Solana wallet, query the knowledge engi
Verify identity documents with Didit API using front/back images, performing OCR, MRZ parsing, authenticity, and livenes
Configure TTS in OpenClaw. Adapt speech output to user preferences.