3,611 tools and skills for media tasks
Use Vidau Open API to generate short videos with Veo3, Sora2, and other models, or query account credits. Register at su
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without
OpenClaw AIGC models (video + image) with parameterized per-model scripts and shared request runners. Use to generate Op
部署和使用军舰的 macOS Automator 自动化服务集合。包含 5 个实用工作流:PDF转JPG、PNG重命名并转JPG、图像拼接、解压RAR、顺序命名图像文件。一键安装所有服务到 ~/Library/Services/ 目录。使用
Generates images in Google Flow (labs.google/fx) through browser UI automation. Supports Nano Banana 2 and Nano Banana P
视频生成技能,使用 MiniMax 视频生成 API 创建视频
Use when the user wants to use agent-data-cli as a local information center for news, social media, financial data, RSS
Draft short, high-clarity replies for comments, objections, and pre-sale questions. Use when responding to public commen
AI-powered avatar accessory synthesis — automatically analyzes art style, lighting, and angle to seamlessly add hats and
Generate, schedule, and publish posts to X and LinkedIn in your voice using AI. Browse viral content, manage preferences
Create AI-generated video content (scripts, images, videos) and publish to social platforms via the Content Engine MCP s
增值税发票识别技能:自动识别 PDF(单页/多页)或各种常见图片格式(PNG/JPG等)的发票,调用百度云增值税发票 OCR API 提取关键信息,输出结构化 Excel 报告。适用于以下场景: 用户上传发票文件并要求识别、提取、转换信息时
自动分析视频内容,提取关键帧进行AI视觉分析。支持 Windows、macOS 和 Linux。首次使用自动从GitHub下载对应平台的工具脚本。
Convert meeting notes or transcripts into a clean Kanban board with owners, due dates, blockers, and next actions.
通过火山引擎 TTS 合成多音色语音,转换为 Opus 格式后,使用飞书 API 自动上传并发送语音消息。
Index local photos, videos, and creative assets into a searchable manifest with tags, dates, shoot info, and reuse ideas
Use when interacting with the Tuqu Dream Weaver photo or billing APIs for image generation, preset application, prompt e
Converts input images or prompt to 3D models using Hyper3D Rodin Gen-2 API. Use this skill when users want to generate 3
Generate images using NanaBanana 2 for daily needs or switch to NanaBanana Pro for high-quality, realistic, or final dra
Search 699pic enterprise photo/video assets, check whether an asset was already downloaded, inspect download records, an
AI图片编辑工具,支持自然语言驱动的换装、换背景、换脸、风格转换(动漫/粘土/油画等)、美颜修图。当用户需要AI图片处理、人像编辑、背景替换、风格迁移、服装更换、脸部融合时使用此skill。支持用户通过描述性prompt(如"把衣
Call the coze-js-api Douyin transcription endpoint and return transcript-ready results from Douyin URLs or share-text. U
使用火山引擎豆包模型生成图片。通过火山引擎豆包图片生成 API 创建图片。支持自定义提示词、尺寸、模型等参数。使用方式:生图:一只可爱的小猫。
Create product demo videos with voiceover, text overlays, and real browser interactions. Fully automated, zero cost. Use
当用户需要查询基金、策略、公告、财经资讯,做资产配置、组合诊断、风险回测、现金流分析,或生成图表、PDF 时,优先使用本 Skill 获取真实数据与可执行能力。
Fix garbled text in PDF/SVG vector graphics for final editing in AI. Detect, replace and repair garbled text in vector g
AI background removal service - Remove background from images using verging.ai AI technology. Supports local images and
AI图片生成与编辑工具,使用Sih.AI API进行自然语言驱动的图片处理。支持换装、换背景、换脸、风格转换(动漫/粘土/油画等)、美颜修图等功能。当用户需要通过自然语言描述来编辑图片(如"把衣服换成bikini"、&q
newtranx CLI for translate MP4 videos, Used for directly translating video files on the terminal. When you want to trans
AI short drama generation - account management, script writing, video production. Integrated X2C billing for commercial
Use Lux3D to generate 3D models from 2D images. Trigger conditions: when user asks to generate 3D model from image, imag
识别图片中的K12算式(加减乘除、竖式计算、分数、方程等),返回结构化文本结果。 支持手写体和印刷体,可拒绝非算式图片。 触发条件:用户要求识别算式、数学题、计算题图片,或上传数学题图片时调用。 关键词:算式识别、数学题、OCR、竖式计算、
MOSI Studio 指令式音色生成(moss-voice-generator): 用自然语言描述想要的音色风格,无需指定预设 voice_id, 模型根据描述实时生成对应的声音。 触发词:指令式语音、按描述生成声音、自定义音色、描述一个
全自动教学视频制作技能。根据课程主题自动生成教学视频,包含文案编写、TTS配音、画面设计、Remotion代码开发、视频导出。触发场景:用户要求制作教学视频、课程视频、讲解视频、教育内容时使用。支持竖屏(1080x1920)和横屏(1920
多平台视频/图文内容发布技能集合。支持账号管理、登录状态维护、一键多平台发布。 当用户要求发布内容到抖音、小红书、微信视频号、Threads、Instagram,或管理发布账号时触发。
使用慧穗云发票识别 API,通过上传发票影像文件(图片、PDF、OFD、ZIP)自动识别发票信息。
This skill provides audio sleep aid recommendations and guidance for users experiencing insomnia or sleep-related issues
Turn app screenshots into structured UX, copywriting, and conversion audits with issue severity and recommended fixes.
支持查询、绑定及切换火山引擎 TTS 机器人音色,设置默认音色并生成测试音频,配置自动保存生效。
Use this skill whenever a user uploads a large image and wants to see interesting details, highlights, or close-ups crop
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Generate photorealistic rendering scripts for PyMOL and UCSF ChimeraX to create publication-quality molecular visualizat
Use the official MinerU (mineru.net) parsing API to convert a URL (HTML pages like WeChat articles, or direct PDF/Office
Conduct FTO patent searches and infringement risk analyses by querying granted and pending patents based on technical de
Virtual gene knockout simulation using foundation models to predict transcriptional changes
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structur
AI-powered invoice scanning and data extraction from images and PDFs. Use when: (1) user sends an invoice image/PDF to s
使用极速数据 VIN 识别 API,对车辆挡风玻璃或行驶证上的车架号图片进行识别,返回 VIN 及品牌、厂家信息。
Automatically convert X (Twitter) posts into highly engaging viral videos using Gemini scriptwriting and HeyGen AI Avata
Generate music via Suno with the local browser-backed flow. Use when the user wants Suno songs, instrumental tracks, lyr