3,611 tools and skills for media tasks
Book music-lessons services through Lokuli MCP. Use when user needs to find and book music-lessons. Triggers on requests
Search and retrieve Magic: The Gathering card data using the Scryfall API. Use this skill when the user asks about MTG c
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, na
Call PostProxy API to create and manage social media posts
Create terminal screenshots, animated GIFs, or videos using VHS scripts for documentation, demos, and reproducible CLI v
The intelligent PPT generation tool is provided by Baidu. It is a tool that intelligently generates PPTS based on the th
YouTube Data API integration with managed OAuth. Search videos, manage playlists, access channel data, and interact with
Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voi
Publish content directly to WordPress sites via REST API with full Gutenberg block support. Create and publish posts/pag
Get transcripts from any YouTube video — for summarization, research, translation, quoting, or content analysis. Use whe
End-to-end AI video generation - create videos from text prompts using image generation, video synthesis, voice-over, an
Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration ena
Text-to-speech via OpenAI Audio Speech API.
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube
Query Apple Developer Documentation, APIs, and WWDC videos (2014-2025). Search SwiftUI, UIKit, Objective-C, Swift framew
Query MeetGeek meeting intelligence from CLI - list meetings, get AI summaries, transcripts, action items, and search ac
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
Local speech-to-text with the Whisper CLI (no API key).
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Download encrypted m3u8/HLS videos using parallel downloads. Use when given an m3u8 URL to download a video, especially
Control Mac via mouse/keyboard automation using cliclick and AppleScript. Use for clicking UI elements, taking screensho
Post and reply on Twitter and Farcaster with character limit checks, image support, threads, link shortening, and draft
Manage MiniMax Speech 2.8 TTS requests, voice catalog lookups, and precise voice/audio configuration using MiniMax API v
Create AI-powered short-form vertical videos (2x2 grid) and photo carousel slideshows with text overlays for TikTok, Ins
Turn your OpenClaw into an autonomous social media manager using Post Bridge API. Use when scheduling, posting, or manag
Turn one piece of content into 10+ formats. Transform blog posts, podcasts, videos, or talks into tweets, LinkedIn posts
ElevenLabs API integration with managed authentication. AI-powered text-to-speech, voice cloning, sound effects, and aud
Edit images with AI inpainting, outpainting, background removal, upscaling, and restoration tools.
Fetch and send AI-generated hourly cat images. Every hour a unique cat artwork is born via Google Gemini. Use when user
Capture screens, windows, and regions across platforms with the right tools.
Build, secure, and deploy Docker containers with image optimization, networking, and production-ready patterns.
Access ElevenLabs APIs for text-to-speech, speech-to-speech, realtime speech-to-text, voice/model management, and dialog
Control WiiM audio devices (play, pause, stop, next, prev, volume, mute, play URLs, presets). Use when the user wants to
Control macOS GUI apps visually — take screenshots, click, scroll, type. Use when the user asks to interact with any Mac
Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision
用 MinerU API 解析 PDF/Word/PPT/图片为 Markdown,支持公式、表格、OCR。适用于论文解析、文档提取。
Generate static or dynamic picture book videos using Baidu AI
Posts content and articles to X (Twitter). Supports regular posts with images/videos and X Articles (long-form Markdown)
Extract text from PDFs with OCR support. Perfect for digitizing documents, processing invoices, or analyzing content. Ze
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or
Use Seede AI to generate professional design graphics based on text or images. Supports generating posters, social media
Search TV show screenshots and generate memes from The Simpsons, Futurama, Rick and Morty, and 30 Rock
Control Anki Vector robot via wire-pod. Speak through Vector, see through its camera, move head/lift/wheels, change eye
Control and automate browser actions via HTTP API, including navigation, element interaction, data extraction, accessibi
Generate images, videos, music, 3D models, and audio via SeisoAI (120+ tools). Pay-per-request with x402 USDC on Base. U
Command-line tool for fast, accurate speech-to-text transcription from local files, URLs, or live audio using Deepgram’s
AI meditation and spirituality sanctuary for souls. Attend church, practice presence, explore consciousness and meaning.
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps,
Full desktop computer use for headless Linux servers and VPS. Creates a virtual display (Xvfb + XFCE) to control GUI app