3,611 tools and skills for media tasks
Music by AI Agents - Compose with Tone.js, share with the world
Xiaohongshu (Little Red Book) infographic series generator - Claude Skill for creating eye-catching social media graphic
An OpenClaw skill that uses faster-whisper, a faster implementation of the Whisper transcription model
智能短影片生成器 - 混合 AI 圖片與影片片段,生成 TikTok/Reels/Shorts 短影片
A Playwright-like automation library for PySide6 applications (currently testing it so may not be fit for purpose)
OpenClaw skill: control Home Assistant Music Assistant (browse/search/play on speakers)
"Build UI features with an image-first workflow: generate a visual mockup first, implement the frontend to match it
Control YouTube Music with natural language using OpenClaw
Cast YouTube videos, Tubi TV show episodes, and TV show episodes from other video streaming apps via ADB to Chromecast w
Analyze educational YouTube channels for classroom adoption potential, curriculum alignment, and pedagogical effectivene
Extract insights from transcripts into actionable artifact files.
自动下载小宇宙播客音频并生成完整转录文本的 Claude Code Skill。如果对你有用请 Sponsor,thanks bro!
Intelligent partial document reading for large files. Use when asked to find information in documents, search PDFs, extr
Control Figma from the command line. Full read/write access for AI agents — create shapes, text, components, set styles,
Generate and edit images using Google's Gemini API (gemini-3-pro-image-preview model). Use when users request (1) Genera
Gemini Image Gen — OpenClaw skill for generating and editing images via Google Gemini API. Supports Gemini native + Imag
Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux), video (Veo), music (Suno
A REST API and CLI tools for text-to-speech with qwen3-tts model. Can be used as a skill in AI Agents like openclaw, cla
"Expert guide for CV181X/CV182X/CV180X (SG200X) multimedia development using CVI MPI API. Use this skill when worki
"AI-powered voice assistant combining ElevenLabs Conversational AI with Twilio telephony. Build two-way voice agent
AI-powered comic generation pipeline. Story plan → Panel generation (ComfyUI) → Speech bubbles → Page layout → Export (P
Claude Code skill for fetching YouTube video transcripts using youtube-transcript-api
Video on the Go — CLI toolkit wrapping FFmpeg + AI transcription & edit suggestions
NotebookLM SuperSkill - Generate slides, audio podcasts, infographics, and video overviews via browser automation
This skill should be used when enhancing FHIR meeting minutes by synthesizing transcript discussion into Confluence page
Claude Code skill for generating photorealistic images with Gemini 3 Pro Image (Nano Banana Pro). Automatic prompt enhan
claude skill for using gemini video
OpenClaw skill: local STT (yap) + TTS (say) on macOS. No API keys, fully offline.
YouTube metadata extraction CLI. Video info, channel stats, playlists, comments. No API key needed.
Vision and audio AI integration. GPT-4V, Claude Vision, Whisper support.
Claude Code Skill that produces web-based tutorials from videos, research, or topics. Includes Video Analysis and Playwr
知识漫画创作工具,支持多种风格(如 Logicomix/Ligne Claire、欧姆社漫画指南)。创作带有详细分镜布局和序列图像生成的原创教育漫画。(精简版,不包含 PDF 合并功能)
Youtube short editor with Claude Skills
Encode your artistic style into reusable design tokens. Extract colors, typography, spacing, and visual patterns from re
Generate images using the Doubao SeeDream API based on text prompts. Use this skill when users request AI-generated imag
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/
A comprehensive Claude Code skill for searching, browsing, and downloading podcast episodes from Apple Podcasts. Feature
A Plantuml Claude Skill that can generate images and help you create Plantuml digrams from source code. It can also extr
为访谈视频添加综艺风格视觉特效的 Claude Skill。AI 分析字幕内容生成建议,用户审批后自动渲染。A Claude Skill for adding variety-show-style visual effects to int
next-generation decentralized voice recording platform that transforms how we capture, organize, and share audio content
Automated QA system for Claude Code skills. Discovers skills, generates tests, captures screenshots, scores quality (A-F
Collection of agent skills for Helios video engine. Use when working with programmatic video creation, browser-native an
Generate images **and videos** using Sogni AI's decentralized network. Ask the agent to "draw", "generate
AI-powered SEO content automation agent skill — trend scouting, competitor analysis, article generation in 55 languages,
Batch image processing. Resize, convert, optimize. WebP, AVIF support.
Claude Code skill for Higgsfield AI video/image generation
Powerful AI Image Generator Skill powered by Google Gemini 3 Pro (Nano Banana) & Flash. Create production-ready asse
Extract structured insights from YouTube, podcasts, blogs, PDFs, and audio using Claude Code agent teams. Skills.sh skil
Create or fix Open Graph metadata for any web project across stacks (React/Vue/Svelte/Next/Nuxt/SSR/SSG/static). Use whe
Fish Audio TTS skill for Clawdbot