3,611 tools and skills for media tasks
Generate professional PDF invoices from natural language or structured data. Use when the user asks to create an invoice
B站创作助手。视频标题优化、标签推荐、简介模板、投稿策略、UP主运营、弹幕互动。Bilibili video creator assistant. B站运营、视频SEO、粉丝增长、投币收藏、充电计划。Use when creating co
支持双引擎的PDF OCR识别技能,可从影印版PDF文件和图片文件中提取文字内容
Generate Bilibili video titles, descriptions, tags, scripts, and content strategy recommendations for video creators.
Four days. Fifty thousand voices. The longest-running country music festival in the world. Nashville is going to teach y
A voice-learning writing assistant that helps you communicate in your own style — not generic AI prose. Learns how you w
Smart web content fetcher - articles and videos from WeChat, Feishu, Bilibili, Zhihu, Toutiao, YouTube, etc. Triggers: '
Generate images using a local SGLang-Diffusion server (FLUX, Qwen-Image, etc.)
Give your AI agent eyes to see the entire internet. 7500+ GitHub stars. Search and read 14 platforms: Twitter/X, Reddit,
Generate and translate video subtitles using WhisperX and LLM translation. Use when processing video files to create .sr
通用视频下载工具,支持 YouTube、B站、抖音等主流平台。使用 yt-dlp 下载视频,自动选择分辨率、合并音视频、清理文件名。
图片生成技能,当用户需要生成图片、视觉信息图、创建图像、编辑/修改/调整已有图片时使用此技能。基于中国的API易代理站(https://apiyi.com/)的NanoBanana2模型的图片生成服务,无需访问外网。支持14种宽高比的图片比
使用 MinerU API 解析 PDF 文件并根据内容自动重命名输出目录。当用户要求解析 PDF 文件时触发此技能。
Intelligent workplace inspection system with guided setup, configurable inspection tasks, AI-powered image analysis, and
Download videos, images, and audio without watermarks from 999+ platforms (TikTok, YouTube, Instagram, Twitter, Bilibili
解析医疗险条款文档并提取32个结构化字段,含14个核心字段和18个增强字段,支持PDF/DOCX/TXT格式,输出标准JSON
通过自然语言调用火山引擎API生成AI视频或查询视频生成任务状态和结果。
Use PoYo AI's Nano Banana 2 image generation and editing models through the `https://api.poyo.ai/api/generate/submit` en
Automatically back up images to stardots.io cloud storage with secure MD5 authentication and configurable credentials.
Video translation / dubbing skill. Translate user-provided video (file or URL) and return preview_url. 适用于视频翻译、视频配音、字幕翻译
AI face swap service - Use verging.ai AI face swap directly from command line. Supports local video files and images, re
调用 OCR.space 免费 API 识别图片中的文字
全自动"重生爽文"短视频流水线。给定题材,依次完成: AI生成小说 → TTS语音合成 → FFmpeg竖屏视频合成。 触发关键词:生成爽文、生成小说视频、重生爽文流水线、 novel pipeline、tts 合成视频
Designs product option structures and live personalization preview flows for custom gift stores (e.g. engraved necklaces
Use this skill when users need to extract text from images, PDFs, or documents. Supports URLs and local files. Returns s
Automatically converts received voice messages to text via an external ASR service, supporting multiple audio formats an
Automatically upload images to Stardots.io cloud storage, manage files, and obtain secure access links using API authent
Professional AI video generation with cinematic prompt optimization, auto-detection of optimal generation backends (Comf
MinerU document parsing CLI with layout.json post-processing and S3 integration. Parse PDF/Word/PPT/images to structured
将PDF文件的每一页转换为图片文件;支持自定义图片格式(PNG/JPG)和分辨率;适用于文档处理、图片化存档等场景
Use PoYo AI's Sora 2 video generation models through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a
图片尺寸调整和压缩工具技能。用于按指定像素宽高、比例或最大尺寸限制调整图片大小,并支持智能压缩到指定文件大小。适用于需要批量处理图片、生成特定尺寸缩略图、压缩图片以满足文件大小限制等场景。
Manage your entire social media from AI — post, schedule, and analyze across Facebook, Instagram, TikTok, YouTube, Linke
Download videos, images, and audio without watermarks from 999+ platforms (TikTok, YouTube, Instagram, Twitter, Bilibili
Async AI image generation (text-to-image and image-to-image). Submit a job to get a task_id, then poll status to get an
Generate images with FLUX models (Black Forest Labs) via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA with
使用科大讯飞 API 将音频/视频转换为文字。支持本地音频文件转录、YouTube 视频下载并转文字。适用于会议记录、视频字幕、语音笔记等场景。当用户需要语音转文字、音频转录、YouTube 视频转文字时触发。
Use when the user wants to generate product detail images or carousel/main images for e-commerce platforms like Taobao.
提供使用摄像头拍照, 录制视频或直接生成gif的能力。何时触发: 需要拍照时, 需要观察一段时间当前视野时, 需要关注某件事情的进展时.
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers i
Convert text to podcast audio using Tencent Cloud TTS. Supports both short and long text processing, generates up to 30-
Convert multiple HTML page elements into separate high-resolution images with customizable settings and automatic file n
Parse PDF, DOC, DOCX, and image files to Markdown or JSON using UniDoc API with sync or async mode and automatic status
Control Android devices via adbclaw CLI — tap, swipe, type, screenshot, UI inspection, and app management. Use when: (1)
Assist macOS users in preparing, converting, exporting, and troubleshooting Word, PDF, Markdown, PowerPoint, and Excel f
智能简历解析系统,支持PDF/Word/图片格式简历的结构化信息提取、岗位匹配度分析、优化建议生成。完全本地运行,无需外部API。使用场景:(1) 解析上传的简历文件提取核心信息,(2) 输入岗位JD计算简历匹配度,(3) 生成简历优化建议
Use PoYo AI Flux 2 through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a user wants to generate or
Use PoYo AI Grok Imagine Video for short text-to-video and image-to-video generation with motion-style controls through
支持一键将视频批量上传至抖音、快手、视频号、B站、YouTube 和 TikTok,具备凭证管理和失败自动重试功能。
Use PoYo AI Hailuo 02 for prompt-optimized video generation and image-to-video workflows through the `https://api.poyo.a