3,611 tools and skills for media tasks
Xplai video generation skill. Invoke when user wants to generate videos or query video generation status via xplai.ai
飞书语音消息发送器。将文字转换为语音并发送到飞书群聊。 支持多 TTS 供应商(Edge TTS、Azure 等),自动转换为飞书 OPUS 格式。 使用场景: 1. 发送语音通知/提醒到飞书 2. 文字转语音自动播报 3. 多音色选择(男
Use screenshot + natural language instruction to locate PSD text layers and dispatch automated edits with confidence gat
通过本地 ComfyUI 生成图片并发送到指定飞书会话,成功发送后无回复文字,保持界面简洁。
Generate QR codes from text, URLs, or data for payments, WiFi sharing, or any task needing a scannable QR code image.
Automate PSD text replacement on Mac and Windows with Photoshop, dry-run safety, style-lock checks, rollback, and local
Convert text to speech using Volcengine TTS with preset or cloned voices and send audio messages to Feishu chats or grou
统一图片内容提取技能。智能识别终端/文档/通用模式,自动提取内容生成Markdown。
将素材转换为戏言系列葵井巫女子的说话风格。Triggers on "zaregoto", "character voice", "style conversion". 触发场景:(
将素材转换为戏言系列葵井巫女子的说话风格。Triggers on "zaregoto", "character voice", "style conversion". 触发场景:(
Takes a URL, HTML file path, or raw HTML code and generates a printable image.
Convert articles, blog posts, or any text into professional podcast scripts and TTS audio. Use when a user wants to: (1)
Quickly send local images to channel. Auto-compress large images, copy small images directly.
Analyze crypto news sentiment (bullish/bearish). Scan Twitter, news sites, and social media. Get sentiment scores and tr
本地语音转文字。使用 faster-whisper 在本地运行 Whisper 模型,无需 API 费用。
Search and query TikHub APIs for TikTok, Douyin, Xiaohongshu, Lemon8, Instagram, YouTube, Twitter, Reddit, and more. Use
Unified QCut media toolkit — organize project files, process media with FFmpeg, generate AI content, control the QCut ed
音乐文件批量标签工具,支持读取/编辑音乐元数据(歌名、艺术家、专辑、流派等),批量编辑标签,按标签整理音乐文件,预览模式和撤销功能!
Compare two face images and return similarity score using iFlytek Face Recognition API.
Create stunning designs with AI. Social media graphics, presentations, and marketing materials without design skills. No
Generate images, music, and videos from text prompts using Pollinations AI with models like flux, zimage, and suno-4 via
AI生成图片,支持Kolors/FLUX/Qwen-Image等模型(需SiliconFlow API)
OpenClaw adaptation of @mvanhorn's last30days skill. Research any topic from the last 30 days across Reddit, X, YouTube,
Turn messy recordings, transcripts, voice notes, or brain dumps into clean, team-ready Standard Operating Procedures (SO
Zeelin Social Watch: monitor social media sentiment, trending events, platform rankings, and account data via GSData ope
Story generation pipeline skill. Supports multi-episode continuous generation, graph management, AI quality check + huma
Generate professional PDFs from HTML/CSS using flow layouts and selective break controls to avoid whitespace gaps and la
Optimize and generate text-to-image prompts for AI art platforms. Use when a user wants to: (1) Optimize prompts for Mid
Generate speech audio with 阿里云百炼 TTS via the `bailian-cli` npm package. Use when users ask to convert text to voice, cho
PDF文字水印工具 / PDF text watermark tool. 智能检测页面方向,自动调整角度和大小,支持中文,居中显示。Auto-detects page orientation, adjusts angle & siz
Trigger this skill for ANY of these situations — writing OR conversation: WRITING: blog posts, articles, social media ca
调用 Nano Banana API 生成或编辑图片,支持文生图和图生图,需提供API Key和提示词,支持自定义尺寸比例。
Binary classification-based human portrait segmentation for complete body contour recognition and image matting.
Recognize songs by singing or audio file using iFlytek's Query By ACRCloud technology.
Generate educational comic-style Xiaohongshu posts using AI-generated comic images. Includes topic research, storyboard
Helps choose the right fal.ai model before API calls. Provides quick decision matrix for video generation (text-to-video
Provide real-time traffic camera footage and livestreams for specified roads or highways to check current traffic condit
Discover, research, script, fact-check, and generate podcast episodes automatically. Multi-source topic discovery, LLM s
Feishu Document Exporter - Batch export Feishu docs to markdown/PDF
腾讯云语音合成(TTS)服务技能包。当用户需要将文本转换为语音文件时使用此技能,支持多种音频格式输出和灵活的配置选项。当用户提到语音合成、文本转语音、TTS服务、音频文件生成时,都应该考虑使用此技能。
Skill for Tencent Cloud HunYuan Text-to-Image Generation (混元生图). Provides AI image generation from text prompts using th
Plan, launch, and optimize digital marketing with growth marketing systems, short-form video, funnel operations, and rev
Generate images with FLUX models (Black Forest Labs) via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA with
Full AI image creation workflow — intent classification, prompt enhancement, multi-direction generation via fal.ai, and
Automatically publish notes to Xiaohongshu (小红书) creator center. Generates cover images (PIL poster, multiple styles), w
Give your AI agent eyes to see the entire internet. Install and configure upstream tools for Twitter/X, Reddit, YouTube,
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio file
图片人脸融合(专业版)为同步接口,支持自定义美颜、人脸增强、牙齿增强、拉脸等参数,最高支持8K分辨率,有多个模型类型供选择。
通过上传图片和选择特效模板,生成一段特效视频,将静态图像转化为充满活力、动感、有趣的视频画面。
腾讯云混元生图 3.0,文生图 / 图生图,智能生成贴合描述的图片。Tencent Cloud Hunyuan Image Generation 3.0, text-to-image / image-to-image, intelligen