3,611 tools and skills for media tasks
Alibaba Cloud Bailian Qwen TTS with voice/mood presets
Browser automation using Playwright API directly. Navigate websites, interact with elements, extract data, take screensh
Generate Pinterest-optimized vertical videos using JSON2Video API. Supports AI-generated or URL-based images, AI-generat
Give your AI agent SEO superpowers — scout X/Reddit trends, discover and analyze competitors, find content gaps, publish
If you can imagine it, CellCog can film it. Grand widescreen cinematics with consistent characters — what previously req
Generate images using Qwen Image API (Alibaba Cloud DashScope). Use when users request image generation with Chinese pro
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, na
Extract metadata from Xiaohongshu (XHS) share or discovery URLs by parsing window.__INITIAL_STATE__ and returning note d
Generate document, outline, and image-text AI notes by providing a video URL, using Baidu's video analysis and note extr
Search for, research, and verify non-tech founders on LinkedIn to identify high-value prospects for technology services
Search for, research, and verify non-tech founders on LinkedIn to identify high-value prospects for technology services
Control Apple Music on macOS via the `clawtunes` CLI (play songs/albums/playlists, control playback, volume, shuffle, re
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, referenc
Mixpost is a self-hosted social media management software that helps you schedule and manage your social media content a
Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages
Generate images via Sophnet Qwen-Image-Plus and poll for task completion. Use when the user asks for Sophnet image gener
Complete YouTube toolkit — transcripts, search, channels, playlists, and metadata all in one skill. Use when you need co
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs
High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using Ele
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, lang
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/
Control Apple TV, HomePod, and AirPlay devices via pyatv (scan, stream, playback, volume, navigation).
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires
Download YouTube videos and upload them to Pocket Casts Files for offline viewing. For personal use with content you own
AprilTag corner detection for camera calibration and pose estimation. Use when working with pywayne.cv.apriltag_detector
Upload Instagram posts via browser automation. Use when uploading images to Instagram, creating Instagram posts, or auto
Generate Instagram-ready card news (카드뉴스) image sets. Use when creating a series of 5 slide images from a topic — includ
Manage brand tone/style for all writing skills
Text-to-Speech via macOS say command with Siri Natural Voices. Use for generating speech audio, TTS clips, or speaking t
Offline Markdown to PDF converter with full Unicode support using Pandoc + WeasyPrint + local emoji cache. Converts Mark
Search Twitter, Instagram, and Reddit posts in real time. Find social media mentions, track hashtags, discover influence
在改革宗书籍乐园 (https.ng) 搜索和下载改革宗/基督教神学书籍。优先返回 PDF 格式。使用场景:当用户想要查找或下载改革宗、加尔文主义、基督教神学相关书籍时触发此技能。
Text-to-speech generation on Volcengine audio services. Use when users need narration, multi-language speech output, voi
Route Alibaba Cloud Model Studio requests to the right local skill (Qwen Image, Qwen Image Edit, Wan Video, Wan R2V, Qwe
Create Korean AI podcast packages from QuickView trend notes. Use for dual-host script writing (Callie × Nick), Gemini m
Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) gene
Extract transcripts, summaries, chapters, and key moments from public YouTube videos without needing an API key.
Generate and read QR codes. Use when the user wants to create a QR code from text/URL, or decode/read a QR code from an
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Generate professional HTML and PDF presentations from markdown content, URLs, or topics. Creates visually stunning slide
Generate images **and videos** using Sogni AI's decentralized network. Ask the agent to "draw", "generate
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podca
Process, edit, and optimize videos for any platform with compression, format conversion, captioning, and repurposing wor
Use ACE-Step API to generate music, edit songs, and remix music. Supports text-to-music, lyrics generation, audio contin
Use VLM Run (vlmrun) to generate transcriptions from YouTube videos. Download a video with yt-dlp, then run vlmrun to tr
Build travel destination scenarios and brochures from a city name. Fetches street-level and landmark imagery from OpenSt
Create professional cinematic scripts for AI video generation with character consistency and cinematography knowledge. U
Convert documents between 40+ formats using pandoc CLI. Handles Markdown ↔ Word ↔ PDF ↔ HTML ↔ LaTeX ↔ EPUB with smart d
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Generate narrative blog posts from AI coding session transcripts. Reads session files, selects sessions relevant to a to