メディアタスク向けの 3,621 件のツールとスキル
字幕菌(zimujun):从主流视频平台链接提取视频文案/字幕文本。适用于 YouTube、TikTok/抖音、小红书、Bilibili 等平台。
SenseAudio Text-to-Speech (TTS) API for converting text to natural speech. Supports synchronous and SSE streaming modes,
SenseAudio Music Generation API for creating AI-generated lyrics and songs. Supports lyrics generation, song generation
Automates image creation on Vheer.com via Playwright, generating images from text prompts and saving them locally withou
Set up reliable off-site Reolink camera backup when cameras are remote (4G/5G/LTE) and home inbound networking is constr
Give AI hands to control any Mac app. Auto-discover installed apps, generate CLI wrappers, return structured JSON. Works
Give AI hands to control any Mac app. Auto-discover installed apps, generate CLI wrappers, return structured JSON. Works
采集香港所有大学(含公立及私立共22所院校)的官方硕士项目招生信息, 包括学费、申请时间、截止日期、英语要求、项目详情及官方链接, 并输出为 Excel、Word、PDF、HTML 和 Markdown 五种格式。 仅使用大学官方数据源,严
Bring your agent to life. Use when creating a cute, high-quality, memorable character for an OpenClaw-style agent from r
使用微软 Edge TTS 的 zh-CN-XiaoxiaoNeural 生成高质量中文语音,失败时自动切换系统自带语音,并支持通过 QQ 发送语音文件。
根据输入法语音识别文本或润色后文本生成单张表情包图片。适用于用户想把一句话做成聊天可发送的表情包、meme 图、带字梗图或贴纸图时。支持原始语音文本和润色后文本两种输入,默认优先使用润色后文本;自动根据文本语气判断风格;默认直接生成带字图片
Enable automated publishing, media upload, draft management, and status querying for WeChat Official Account articles vi
Turn reminiscence, oral-history, or life-review transcripts into clear narrative biography drafts while preserving the s
Turn a vague emotion, theme, or aesthetic idea into a structured visual mood board blueprint. Use when the user wants a
Build sustainable creative practice routines for artistic skills. Use when the user wants a practice habit for music, dr
Download videos from 1000+ sites, torrents, and magnet links directly to 115 or Quark cloud storage via server-side or l
Analyze images (jpg, png, gif, webp) and PDFs via CLIProxyAPI — a Claude Max proxy that routes requests through your sub
钉钉会议管理技能,支持创建即时视频会议和预约会议(含日历日程、自动关联视频会议、邀请参会人)。Use when needing to create DingTalk video conferences or schedule meeting
Plume AI Image Generation & Editing Service. Automatically triggered when users send images or describe image needs.
自动将清华毕业论文 Word (.docx) 转换为符合 thuthesis 官方模板规范的完整高质量 PDF,支持章节、摘要、图表、参考文献等处理。
AI-powered palm reading and analysis from palm images. Supports English, Tamil, Telugu, Kannada, and Hindi. Use when a u
视频转写工作流,支持B站和YouTube视频。自动判断有字幕/无字幕,有字幕则获取字幕,无字幕则下载音频+whisper转写。触发场景:(1) 用户要求总结视频内容 (2) 用户要求获取视频字幕 (3) 用户要求转写视频 (4) 处理B站/
Convert images or text prompts into 3D models using 3D AI Studio's API with options for quality, textures, and model typ
用统一脚本执行 pdf2zh-next 。支持单/多PDF、单/多目录、目录内按条件筛选部分PDF;默认翻译服务SiliconFlowFree;可读取已有配置或写入自定义 provider;失败即停并回传错误。
语音回复技能 - 使用讯飞 TTS 生成语音并发送到飞书。当需要用语音回复用户消息时使用。触发词:用语音、语音回复、切换语音模式、语音模式。
Publish articles to WeChat Official Account (微信公众号) draft box via API. Use when the user wants to upload a Markdown arti
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, a
从 PDF 中提取文本和表格,填写表单,并合并文档
Use SAM (Segment Anything Model) to remove image backgrounds and extract foreground subjects as transparent PNGs. Use wh
小红书图片生成技能 - 针对家装、美食、穿搭等赛道的AI图片生成,支持多种生成方式和规格优化
One-stop multi-platform video publishing workflow. From video clipping to multi-platform publishing, full process automa
Generate images and videos via WangYi Banana API (nano-banana, SORA2). Supports text-to-image, image-to-image, text-to-v
This skill should be used when the user wants to "build a resume", "create a resume", "update m
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
Send prompts to a local ComfyUI instance to generate images based on user descriptions.
用统一脚本执行 pdf2zh-next。支持单/多PDF、目录批处理、按 glob 筛选;未指定 provider 时按 config.toml 生效;指定 provider 时按官方 --<Services> 参数传给主程序;
Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video&qu
Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate im
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", &quo
Send user prompts to local ComfyUI at http://127.0.0.1:8188 using workflow.json to generate images.
Generate original background music for short videos from a natural language description. Use when creators need royalty-
Recommend the best SenseAudio voice for any scenario or emotion. Use when users ask which voice to use — e.g. "儿童故事
Zero-config text-to-speech — give text, get an mp3 file. Handles natural-language voice selection ("用女声", &quo
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user
Fix Chinese polyphone (多音字) mispronunciation in TTS by auto-detecting ambiguous characters and applying pinyin annotatio
多源内容智能处理器:支持微信公众号、网页、YouTube、PDF、Markdown等,自动上传到NotebookLM并生成播客/PPT/思维导图等多种格式
Automated drama video generator - from script to multi-character drama videos with OpenAI TTS, Whisper, and Remotion
This skill should be used when the user wants to optimize an epub file for e-ink readers (墨水屏电子书). It handles image dedu
根据用户提供的文本和图片(支持图片URL和本地路径)生成视频任务,并使用API Key提交到远程视频服务。
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables fr