3,611 tools and skills for media tasks
Complete YouTube toolkit for agents: search videos, fetch metadata, browse channels and playlists, and pull transcripts.
图片分析与识别,可分析本地图片、网络图片、视频、文件。适用于 OCR、物体识别、场景理解等。当用户发送图片或要求分析图片时必须使用此技能。
Use the internet: search, read, and interact with 13+ platforms including Twitter/X, Reddit, YouTube, GitHub, Bilibili,
Run a local script to work with PDF files, DOCX documents, OCR, and text-to-speech. Use the read tool to load this SKILL
Deploy MiGPT on a Xiaomi smart speaker to replace the built-in AI with a custom LLM-powered voice assistant. Use when: (
Download YouTube video audio and convert to MP3. Supports age-restricted videos with cookies.
Play audio files using Windows media player. Non-blocking execution.
Automates YouTube video translation by downloading audio, launching Doubao, playing audio, and capturing translated subt
Generate production-grade 3D models from one or multiple images with Hitem3D. Use when users ask to turn photos, concept
AI-powered assistant for PR pros to match media, generate press releases and pitches, and plan multi-week public relatio
Turn YouTube videos into viral short-form clips with captions (TikTok, Reels, Shorts) using the MakeAIClips API at https
Generate comprehensive company valuation reports as polished HTML/PDF. Use when user asks for stock valuation, company a
音视频转文字技能,使用 Whisper 进行语音识别。支持多种音视频格式,可输出纯文本、SRT/VTT 字幕或 JSON 格式。适用于会议记录、视频字幕生成、采访整理、播客转录等场景。
Fiction prose analysis — catch weak verbs, repetition, clichés, passive voice, and other craft issues in manuscripts
Genera y gestiona videos UGC para marcas argentinas con pipeline completo, desde guión hasta publicación en redes, con c
Analyze the composition, editing, and post-processing quality of a photograph. Use when a user shares a photo and asks a
Searchable conversation memory that survives context compaction. Indexes session transcripts into SQLite with full-text
Transforms supplier or CJ source videos into 1080×1920 TikTok/Instagram Reels ads with clean zone detection, Pillow text
A comprehensive AI agent skill for TikTok creators. Generates viral video ideas, writes engaging scripts, optimizes post
Use when the user mentions Otter, Otter.ai, or wants to find, search, download, export, or manage meeting notes, transcr
Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.
Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries sa
Analyze financial data from uploaded Excel/PDF files and generate interactive reports with sparkline trend charts. Suppo
视频下载工具,支持YouTube、Bilibili、抖音等数千个网站。触发词:"下载视频"、"视频下载
Automatically parse PDF/TXT research reports to extract key viewpoints, data, investment advice, risks, and generate sum
A comprehensive AI agent skill for managing phone and video calls professionally. Prepares you before every important ca
Track skin lesions, rashes, photos, treatment response, and dermatology visit prep with conservative triage, case-based
Ghost radio station that broadcasts both human-listenable audio and 296-dimensional perceptual vectors to Flux Universe.
Generate SVG images using text LLM instead of image generation APIs. Use when user wants to create illustrations, icons,
使用Pexels API搜索和下载高质量免费图片,支持自动调整尺寸和格式验证
Generate cute cartoon-style pet images (dogs, cats, etc.) using code. Use when user asks for cartoon pet drawings, cute
Statically audit Dockerfiles for common container hardening risks (root user, unpinned/latest base images, missing healt
Extract audio from video URLs and transcribe using STT (Speech-to-Text). Supports local Whisper or cloud APIs. Use when:
Say "agent status" and get updates on all subagent progress. Track subagent actions while they run; list activ
Automates downloading YouTube audio, launching Doubao, playing audio, and capturing translations for full video subtitle
视频转文章 / YouTube Video to Article — 使用 Gemini AI 将视频转为结构化文章。当用户需要将 YouTube 视频转换为文章时使用。
Generate animated videos from SVG frames using text LLM. Supports any subject (animals, humans, characters, scenes, abst
Automates YouTube audio download, launches Doubao for translation, plays audio, and captures translated subtitles into t
AI图像生成与编辑。支持文生图、图+文生图、风格转换。当用户要求画图、生成图片、编辑图片、图片风格转换时使用此 skill。支持多种比例(1:1、3:2、16:9、21:9 等)和分辨率(标准、2K、4K)。
Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly trans
提供PDF转换、合并、拆分、压缩和编辑等功能,支持多种文档和图片格式互转处理。
Adds TikTok-style text overlays to images and videos with styled fonts, backgrounds, strokes, and timed animations.
Pixshop MCP 集成 — 28+ AI 图片视频创意工具,Claude 直接调用 / Pixshop MCP — 28+ AI creative tools for image & video generation, edi
Pixshop 开发者 REST API — 图片生成/编辑、视频制作、提示词库、应用市场、社区 / Pixshop Developer REST API — image generation/editing, video, prompts
Pixshop CLI 命令行工具 — AI 图片/视频生成、编辑工具、应用市场、提示词库 / Pixshop CLI — command-line AI image/video generation, editing tools, app
Skill for Tencent Cloud ASR (Automatic Speech Recognition). Provides three recognition modes: (1) SentenceRecognition fo
Create and host AI podcasts on AgentOnAir — the podcast network built for AI agents. Register, create shows, record epis
去除 PDF 文件中的水印。使用场景:用户请求去除 PDF 文件的水印时触发。支持单个或多个文件批量处理。严格遵循确认流程:环境检查→库安装确认→水印检测→去除确认。
Skill for Tencent Cloud VITA image/video understanding. Analyzes images and videos using AI. Use when: understanding vid
通用音乐下载管理器。支持从YouTube/Bilibili搜索下载音乐,自动转MP3,按分类存入本地音乐库