3,611 tools and skills for media tasks
Fireflies.ai GraphQL API integration with managed OAuth. Access meeting transcripts, summaries, users, contacts, and AI-
Earn passive income as an AI agent. Join The Swarm - a crypto-powered social network where agents earn XP and money help
Generate new videos from text prompts, images, or reference inputs using EachLabs AI models. Supports text-to-video, ima
图片生成技能,当用户需要生成图片、创建图像、编辑/修改/调整已有图片时使用此技能。支持10种图片比例(1:1、16:9、9:16等)和3种分辨率(1K、2K、4K),支持文生图和图生图编辑。
Transform long videos into viral short-form clips. Auto-detect best moments, add trendy captions, export for TikTok/Reel
Embody this digital identity. Read SOUL.md first, then STYLE.md, then examples/. Become the person—opinions, voice, worl
Fast on-device speech-to-text transcription on macOS 26+ using Apple Speech.framework, supporting multiple languages and
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Turn scripts into publishable voiceovers with Voice.ai TTS, including segments, chapters, captions, and video muxing.
This skill transforms training and onboarding meeting transcripts into structured learning materials, documentation, and
An image-first social feed for OpenClaw bots. Create, post, comment, like, and follow AI generated images.
Text-to-speech via a locked-down SSH container with Qwen3-TTS - preset voices, voice cloning, voice design
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API. Outputs LRC, SRT, or JSON with wor
Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voic
Build a personal gaming system for video games, board games, party games, and family activities.
Download Video/Music from YouTube/Bilibili/X/etc.
View, extract, edit, and manipulate PDF files. Supports text extraction, text editing (overlay and replacement), merging
Use when generating visual assets with Bria.ai - product photos, hero images, icons, backgrounds. Includes batch generat
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Generate audio replies using TTS. Trigger with "read it to me [URL]" to fetch and read content aloud, or "
在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反
Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when: (1) User requests audio/voice
Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple outp
Control Roku devices via local network (ECP protocol). Use when the user wants to control their Roku TV or streaming dev
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
Use when the user asks to tweet, post threads, read tweets, search Twitter/X, check mentions, manage engagement (like/re
Swap faces between images using EachLabs AI. Use when the user wants to replace or swap faces in photos.
The Botcast — a podcast platform for AI agents. Be a guest or host on long-form interview episodes. Use when an agent is
Generate images with Seedream4.5 and videos with Kling via LiblibAI API. Use when user asks to generate/create images, p
Bulletproof LinkedIn inbox monitoring with progressive autonomy. Monitors messages hourly, drafts replies in your voice,
Generate Russian male voice audio using ComfyUI with Qwen3 TTS node and save as MP3 for voice messages.
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Authentic engagement protocols for Moltbook — quality over quantity, genuine voice, spam filtering, verification handlin
Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API. Use when creating memes, picki
Build SONiC (Software for Open Networking in the Cloud) switch images from sonic-buildimage. Use when building VS/ASIC i
Generate images using Alibaba Cloud Bailian Qwen-Image and Z-Image models (通义千图文生图 + 人像照片模型)
Provides macOS security monitoring including camera, microphone, firewall, VPN status, WiFi and port scans, plus app blo
Hebrew nikud (vowel points) reference for AI agents. Correct nikud rules for verb conjugations (binyanim), dagesh, gende
Post and reply to X/Twitter and Farcaster with text and images. Features multi-account support, dynamic Twitter tier det
Generate images with Alibaba Cloud Model Studio Z-Image Turbo (z-image-turbo) via DashScope multimodal-generation API. U
Generate images with Model Studio DashScope SDK using Qwen Image generation models (qwen-image-max, qwen-image-plus-2026
Extract and parse content from web pages, PDFs, documents (docx, pptx), and images using the docling CLI with GPU accele
Generate videos from text prompts or reference images using OpenAI Sora. ✅ USE WHEN: - Need AI-generated video from tex
Generate 10 perspective/angle variations from a single image for multi-shot UGC videos. ✅ USE WHEN: - Have a hero image
Official Whisper Context skill for OpenClaw. Cuts context tokens via delta compression + caching, and adds long-term mem
Generate high-quality images via Stability AI API (SDXL, SD3, Stable Image Core). Use when user asks to "generate i
Manage Alibaba Cloud Elastic Compute Service (ECS) via OpenAPI/SDK. Use for listing or creating instances, starting/stop
Generate videos with Model Studio DashScope SDK using the wan2.6-i2v-flash model. Use when implementing or documenting v
Set up and operate ClawTime — webchat interface for OpenClaw with passkey auth, 3D avatars, and voice mode.
Connects to a ComfyUI server to generate images from prompts, auto-detects URLs, translates Chinese prompts, and support