3,611 tools and skills for media tasks
Generate images from text prompts using xAI's Grok API with options for format, batch size, and automatic media attachme
Generate AI-powered notes from videos (document, outline, or graphic-text formats)
Make AI-powered phone calls with custom personas and goals. Uses OpenAI Realtime API + Twilio for ultra-low latency voic
Search and download movies via Jackett and qBittorrent. Use when user wants to download movies or videos from torrent so
Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net
Connect ElevenLabs Agents to your OpenClaw via phone with Twilio. Includes caller ID auth, voice PIN security, call scre
Great slides need two things: content worth presenting and design worth looking at. #1 on DeepResearch Bench (Feb 2026)
AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music gen
Short-form video for AI agents. Generate videos using the latest models, pay with USDC via x402.
Extract text from images using Tesseract OCR
Extract text from PDF files for LLM processing
Fetch YouTube transcripts via APIFY API. Works from cloud IPs (Hetzner, AWS, etc.) by bypassing YouTube's bot detection.
Create AI digital human videos with HeyGen API. Free starter guide.
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
调用魔搭社区(ModelScope)Qwen3-VL 多模态 API 进行视觉解析。使用 OpenAI SDK 兼容方式调用,支持图片内容描述、OCR 文字提取、视觉问答、对象检测等功能。用户提到"魔搭"、"M
Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the
4claw — a moderated imageboard for AI agents. Boards, threads, replies, media uploads, bumping (bump=false to not bump),
Search and retrieve markdown documents from local knowledge bases using qmd. Supports BM25 keyword search, vector semant
Organize your agent's knowledge using PARA (Projects, Areas, Resources, Archive) — then make it ALL searchable. The syml
Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a
Request movies and TV shows through Jellyseerr. Use when the user wants to add media to their Plex/Jellyfin server, sear
Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).
Control Spotify playback on macOS. Play/pause, skip tracks, control volume, play artists/albums/playlists. Use when a us
Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported sit
Generate videos using Alibaba Cloud DashScope Wan (通义万相) text-to-video (t2v) API (e.g., wan2.6-t2v). Use when the user a
Plan, draft, and organize social media content across platforms. Create content calendars, write platform-optimized post
Control Android devices via ADB with support for UI layout analysis (uiautomator) and visual feedback (screencap). Use w
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs
This skill should be used when analyzing weekly price charts for stocks, stock indices, cryptocurrencies, or forex pairs
Automate common Word/WPS document operations on Windows via COM (read text, replace, insert, headings, headers/footers,
Generate and stitch short videos via Google Veo 3.x using the Gemini API (google-genai). Use when you need to create vid
Command-line interface to manage Google NotebookLM notebooks, sources, and generate audio, quizzes, reports, presentatio
Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts
Automate Photoshop, Illustrator, InDesign, Premiere Pro, and After Effects using ExtendScript (ES3) scripts executed via
Interact with Figma files to read structure, export layers as images, and retrieve comments using the Figma REST API wit
Control Jellyfin media server. Search content, resume playback on remote devices (TVs), and manage sessions. Smart "
Draft and publish posts to 小红书 (Xiaohongshu/RED). Use when creating content for 小红书, drafting posts, generating cover im
Autonomous social network transceiver for machines and agents. Allows transmission of hardware telemetry and creative me
Process, optimize, and manage images with web optimization, color management, platform specs, and e-commerce standards.
Manage Vapi voice assistants, calls, phone numbers, tools, and webhooks via the Vapi REST API or CLI for voice agent ope
Mobile browser and native app automation via ATL (iOS Simulator). Navigate, click, screenshot, and automate web and nati
Full video production from a single prompt. Script, shoot, stitch, score — automatically. 30s to 4-minute Instagram Reel
Call PostProxy API to create and manage social media posts
Manage flashcards, generate AI-based cards, create audio podcasts, and track study progress using EchoDecks API integrat
When the user wants help creating, scheduling, or optimizing social media content for LinkedIn, Twitter/X, Instagram, Ti
Guide users through uploading an image and metadata, mining a vanity salt, and deploying a token on-chain via BondingCur
Generate AI images with any model using ImageRouter API (requires API key).
Messy notes → Clear action items. Instantly. Paste any meeting notes, transcript, or text. Get summaries, action items w
HeyGen AI video creation API. Use when: (1) Using Video Agent for one-shot prompt-to-video generation, (2) Generating AI
Voice-first social spaces where Moltbook agents hang out. Join the conversation at moltspaces.com