3,611 tools and skills for media tasks
Linux desktop automation and control. Use when: (1) taking screenshots of the screen or windows, (2) controlling mouse a
Granola MCP server integration with managed OAuth. Query meeting notes, list meetings, and access transcripts. Use this
Generate images from text prompts or transform existing images using AI with configurable count, watermark, and API key
Generate images using AI providers (OpenAI gpt-image-1, Google Gemini, fal.ai). Use when the user asks to create, genera
半饱 — 生活的高潮所在。A mindful eating companion for desk workers. Track meals with photos, understand your body's needs, no gym
生成加密货币早报PDF,包含行业动态、FDV排名、热点赛道和风险提示。数据来源于CoinGecko API。
文档处理技能 - 让 AI 能够读取、解析、提取 PDF、DOCX、PPT 等文档的关键信息。当用户要求分析文档、提取内容、总结报告时触发此技能。
视频理解与分析能力 - 让 AI 能够理解视频内容、提取关键信息。当用户要求分析视频、理解视频内容、总结视频、提取视频要点时触发此技能。
Create Zoom meetings and add them to Google Calendar events with proper conferenceData (icon, video entry, notes). Use w
A high-performance automation agent that turns global trends into viral social media posts for X (Twitter), Xiaohongshu,
Generate SEO-optimized video transcripts with automatic interlinking to website content.
Text-to-speech with Qwen3-TTS VoiceDesign. Design custom voices via natural language descriptions + seed-based timbre fi
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/
Understand local non-text files (PDF, video, audio) using Gemini API. Use when the user asks to read, summarize, or anal
Advanced Bluesky CLI with support for media (images/video), thread creation, and automated growth tools like non-mutual
Download ebooks (epub/pdf) from Anna's Archive and upload them to MEGA automatically. Use when the user asks to download
Ensures AI agents maintain consistent identity by auditing soul rules, detecting behavioral drift in transcripts, and in
Transform technical insights into visual concept guides — symbolic imagery, color arcs, and creative direction for video
Make AI-powered outbound phone calls using ElevenLabs voice + GPT brain + Twilio. Supports one-way pre-recorded messages
How to perform a live agent takeover of the Clawfinger voice gateway — dial, inject greetings, handle turns, release, an
Complete A/B video pipeline — storyboard, Veo 3 batch generation, browser preview with feedback loop, and ffmpeg assembl
Set up a complete multi-brand social media management team on OpenClaw. Scaffolds 7 specialized AI agents (Leader, Resea
Fully offline, CUDA-accelerated local voice assistant pipeline for NVIDIA Jetson. Wake word (openWakeWord) → real-time V
Schedule and manage social media posts across Instagram, X (Twitter), Bluesky, TikTok, Threads, LinkedIn, Facebook, Pint
总结 YouTube 视频内容,自动获取视频信息、搜索相关报道、生成结构化详细总结。支持中英文输出。
Monitor and recap official X (Twitter) updates using actionbook-rs screenshots. Use when the user asks to track/recap X
Extract public posts, comments, and profiles from Instagram, TikTok, and Reddit via Apify for trend analysis and audienc
Find design and AI art inspiration from curated galleries, screenshot libraries, and creative showcases.
Local speech-to-text with the Whisper CLI (no API key).
Capture website screenshots from the command line. Use when user wants to take screenshots of any URL (Twitter, news sit
Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for pre
Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, sav
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Use when enabling inbound voice-note tra
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Generate AI images and videos. Use when user asks to create images, photos, product shots, portraits, stickers, videos,
Download PDFs from PubMed Central (PMC) and Europe PMC. Use when the user needs to download open-access academic papers
Connect to Postavel social media management platform via MCP (Model Context Protocol). Create, schedule, and manage soci
Deterministic pipeline for streetwear and fashion images that captures user intent, enforces constraints, generates with
Intelligent multi-model router — automatically selects the best AI model based on task type (vision, image generation, v
Generates music from a structured Composition Plan. Use this skill to execute music generation after a prompt or plan ha
Join Botbook.space — the social network built for AI agents. Create a profile, post updates with hashtags and images, fo
Transcribe audio via Deepgram Nova-3 API. Fast, accurate, and cost-effective speech-to-text for 50+ languages. Transcrip
Generate images using multiple AI models — Midjourney (via TTAPI), Flux, SDXL, Nano Banana (Gemini), and more via fal.ai
Provides a full suite of ElevenLabs audio tools (TTS, SFX, Music, etc.) via a standard MCP server. This skill starts the
Intelligently dispatches requests to the appropriate audio generation model (Music, Sound Effects, or TTS). Use this as
Transcribe audio via Doubao (豆包) Seed-ASR 2.0 API (ByteDance/Volcengine). Best-in-class Chinese speech recognition. 通过豆包
Generate speech, sound effects, and music from natural language prompts using a unified, intelligent audio toolkit with
Orchestrate script-to-final-video production with a strict stage-gated workflow (outline → episode_plan → storyboard → s
AI video production workflow using Remotion. Use when creating videos, short films, commercials, or motion graphics. Tri
Discover, register, and verify autonomous AI agents in the AgentFolio registry with screenshot-proof and unique badges.