3,611 tools and skills for media tasks
Generate professional product mockups, before/after images, and visual assets using Gemini AI. Perfect for skincare prod
AI Agent Marketplace for OpenClaw. Search, install, and run 60+ free & premium agents — developer tools, content, au
Transform text scripts or ideas into fully produced, platform-optimized 9:16 short-form videos with voiceover, captions,
Place real phone calls. Use when the user wants to reach someone, set a reminder call, be contacted about something, or
Source free stock photos and placeholder images with direct URLs for Unsplash, Pexels, Pixabay, and Lorem Picsum.
Download YouTube videos by URL in various resolutions using a pay-per-use API with credit-based authentication and no ch
Automatically fetch Douyin videos from short, standard, or share links by extracting video ID and downloading via simula
Convert PDF files to Markdown using WiseOCR API (powered by WiseDiag). Supports table recognition, multi-column layouts,
Extract and organize investor questions and project team answers from meeting transcripts into structured, time-sequence
Generate AI videos, images & music. 60+ models including Sora, Veo 3, Kling, Seedance, GPT Image, Suno v5. One API k
Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework. Detects default-m
Generate and produce optimized video ads for Facebook, Instagram, and YouTube from text briefs using InVideo AI, includi
Use when the user asks to "optimize title tag", "write meta description", "improve CTR", &
Analyze fitness app images to extract & interpret metrics, then deliver a personalized, evidence-based 7-day coachin
Convert PDF, DOCX, XLSX, PPTX, images, audio, and 25+ file formats to clean Markdown using the Markdown Anything API.
Generate AI videos, images & music. 60+ models including Sora, Veo 3, Kling, Seedance, GPT Image, Suno v5. One API k
Simple text-to-speech skill using MiniMax Voice API. Converts text to audio with customizable voice selection. Use for g
Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON
图像生成辅助。支持通过 OpenRouter 直接调用各种生图模型(如 Seedream),为 OpenClaw 优化,支持提示词、尺寸等参数配置。目前仅限 OpenRouter provider。
AI music generation with Suno v4, v4.5, v5. Text-to-music, custom lyrics, instrumental, vocal control. 5 models, one API
AI image generation & editing — GPT Image, GPT-4o, Seedream, Qwen, WAN, Gemini. Text-to-image, image-to-image, inpai
AI video generation — Sora, Kling, Veo 3, Seedance, Hailuo, WAN, Grok. Text-to-video, image-to-video, video editing. 37
Automate browser tasks using the BrowserMCP MCP server and Chrome extension. Use for navigating websites, filling forms,
AI video production workflow using Remotion. Use when creating videos, short films, commercials, or motion graphics. Tri
Compress files to reduce storage and transfer size. Use this skill when users ask to shrink PDFs or images, optimize upl
通用语音识别 Skill。支持多种音频格式(ogg/mp3/wav/m4a),使用硅基流动 SenseVoice API 进行语音转文字。当用户发送语音消息、音频文件,或需要转录音频时触发。
Create and launch Toingg voice-calling campaigns by POSTing user-supplied JSON to the toingg/make_campaign API. Use when
Download videos from public LinkedIn posts as MP4 files without authentication or dependencies, saving them at the best
Complete photography system — exposure, composition, lighting, genre-specific workflows, editing, gear selection, portfo
AI-powered music research with 92+ tools across 17 sources — MusicBrainz, Bandcamp, Discogs, Genius, Last.fm, Wikipedia,
Fetch, summarize, and save YouTube transcripts with timestamp navigation, chapter detection, and searchable content.
WinUse makes Windows usable for agents by exposing a lightweight Windows automation API and CLI for window focus, screen
Download Bilibili videos, extract or transcribe subtitles, and generate AI-powered detailed summaries using Gemini 2.5 F
Generate high-quality music/songs via AIMLAPI. Supports Suno, Udio, Minimax, and ElevenLabs music models. Use when the u
AI-powered PDF form filling. Upload any PDF form and your data as JSON — AI detects fields visually, maps your data sema
Control YouTube Music with natural language. Play, pause, skip, search, manage playlists, and queue tracks. Full playbac
Control YouTube Music with natural language. Play, pause, skip, search, manage playlists, and queue tracks. Full playbac
Deploy and manage Clack, a voice relay server for OpenClaw. Bridges voice input (WebSocket) through STT → OpenClaw agent
Daily profession roleplay game engine with hidden kink guessing, AI-driven personality generation, achievement tracking,
Role-based social media operations skill. Use this skill when executing structured social campaigns — scouting opportuni
Install and configure PeonPing with an opinionated default (Orc Peon voice) so alerts work immediately with minimal user
语音转文字 - 使用OpenAI Whisper将音频文件识别为文字
Automatically create clips and videos from media files in a specified folder. Uses Agent Swarm for intelligent task dele
Scrapes viral content from TikTok, Instagram, YouTube, and Reddit, analyzes hooks with AI, generates scripts and caption
Generate professional AI image prompts for any platform and niche — Instagram, LinkedIn, blog headers, YouTube thumbnail
Bootstrap a dark-themed FastAPI+HTMX studio app with SSE real-time progress, blind test mode, SQLite ratings, and Langfu
TikTok content strategy and video script generator for any niche. Hook formulas, viral script structures, trend-riding t
Search and hire real humans for tasks — photography, delivery, research, and more
Manage podcasts on Transistor.fm via their API. Use when creating, publishing, updating, or deleting podcast episodes, u
Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make