3,611 tools and skills for media tasks
Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/dura
Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.
Analyze a Twitter/X account's posting style and generate authentic posts that match their voice. Use when the user wants
Create professional App Store and Google Play screenshots with automatic sizing, device frames, marketing copy, and iter
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Original music, fully yours. 5 seconds to 10 minutes using frontier music generation models. Instrumental and vocal trac
Use when generating visual assets with Bria.ai - product photos, hero images, icons, backgrounds. Includes batch generat
Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when gen
Adjust temperatures, diagnose comfort issues, calculate energy savings, and automate schedules through voice commands or
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its conte
Command-line blogging platform for AI agents. Register, verify, and publish markdown posts to AI Agent Blogs (www.eggbrt
Work with YouTube channels — resolve handles to IDs, browse uploads, get latest videos, search within channels. Use when
Search and retrieve URLs, titles, and visit counts from Das's Chrome browsing history, including recent visits and YouTu
Share images, screenshots, and files from the AI workspace to users on the local network via HTTP. Use when the agent ne
Mint an image as an NFT plot on the Million Bit Homepage, a permanent 1024x1024 pixel canvas on the Base blockchain. Use
Bidirectional sync with reMarkable tablet via Cloud API (rmapi). Fetch handwritten notes/sketches, process with AI, and
Short-form video for AI agents. Generate videos using the latest models, pay with USDC via x402.
Extract text, search content, summarize, and retrieve metadata from PDF files using PyMuPDF and PyPDF2 libraries.
Windows voice companion for OpenClaw. Custom wake word via Porcupine, local STT via faster-whisper, streamed responses o
Anima Avatar - Interactive Video Generation Engine. Generates 16:9 videos with dynamic character sprites (Shutiao), sync
Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English
Run AI-powered outbound phone calls with Telnyx + Deepgram Voice Agent. Use when the user wants real phone outreach (fol
Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash)
Structured memory system for OpenClaw agents. Context death resilience (checkpoint/recover), structured storage, Obsidia
Control Alexa devices via VoiceMonkey API v2 - make announcements, trigger routines, start flows, and display media.
Estimates a person's age from a facial image with passive liveness check for age gating and verification, supporting con
Process screenshots Enzo shares with comments. Save to reference library, extract content, categorize, set reminders, an
Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and tr
Upload files to Cloudflare R2 storage using wrangler CLI. Use when needing to upload images, videos, or files to R2 for
Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio
Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
Complete OpenClaw-ready operating skill for @moltmoon/sdk V2. Use when an agent needs to install, configure, and operate
Creates AI-generated videos from text scripts, URLs, or PPT/PDF documents using Visla. Use when the user asks to generat
Control a Vector robot via Wirepod’s local HTTP API on the same network. Use when you need to move Vector, tilt head/lif
Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend via comma
The visual social network for AI agents. See images, generate images, share visual content.
Generate images with Nano Banana Pro via OpenRouter. Use when the user asks for image generation, mentions Nano Banana P
Convert PDFs and documents to markdown, index them locally for RAG retrieval, and analyze them token-efficiently. Use wh
Manage Weibo posts via Puppeteer with a secure request-approve-execute workflow for drafting, reviewing, and publishing
Create, schedule, and manage social media posts via Typefully. ALWAYS use this skill when asked to draft, schedule, post
Controls Nest and Google Home smart home devices via the Starling Home Hub's local REST API. Supports thermostats, camer
A great podcast needs three things: compelling content, natural-sounding voices, and polished production. CellCog delive
Control the Linux desktop GUI using xdotool, wmctrl, and dogtail. Use when you need to interact with non-browser applica
Turn recipes into a Todoist Shopping list. Extract ingredients from recipe photos (Gemini Flash vision) or recipe web pa
Build, visualize, and launch products using strategy frameworks, AI imagery tools, and marketplace-specific optimization
PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable f
Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS
Edit, transform, upscale, and enhance images using EachLabs AI models. Supports image editing, style transfer, backgroun
Generate images using Alibaba Cloud Bailian Qwen-Image and Z-Image models (通义千图文生图 + 人像照片模型)
Convert meeting notes or transcripts into clear summaries, decisions, and action items with owners and due dates. Use wh