3,611 tools and skills for media tasks
Generate videos using a local SGLang-Diffusion server (Wan2.2, Hunyuan, FastWan, etc.). Use when: user asks to generate,
Researches trending low-competition topics, scripts, and fully produces 10 faceless documentary and explainer videos wit
Validate single video URLs, download highest-quality files with yt-dlp, and archive results into a Feishu Bitable using
Scrapes reviews, pricing, SEO, and social data on up to 5 SaaS competitors to expose weaknesses and generate positioning
Analyze and interpret images by describing content, extracting text, answering questions, comparing visuals, and extract
Multilingual TTS via Typecast CLI with emotion control. Plays audio aloud or saves to file.
Perform video editing tasks with ffmpeg, including cutting, merging, converting formats, extracting audio, adding subtit
Enhance images, remove backgrounds, create videos from images, and perform AI face swaps using WeryAI advanced media edi
Free All-in-One AI Image Generator Platform. Access FLUX, Midjourney alternatives, Wan AI, and Qwen Image in one place.
Hippo Video integration. Manage Persons, Organizations, Deals, Leads, Activities, Notes and more. Use when the user want
在搜狗微信搜索指定关键词,抓取相关文章(标题、摘要、发布日期、来源公众号),生成PDF报告。触发场景:用户要求"搜索微信文章 关键字 数量
使用极速数据通用文字识别 API,将图片中的文字识别为文本,支持中英文及多种外语。
使用极速数据身份证识别 API,对身份证等证件图片进行 OCR 识别,返回姓名、证件号等信息。
使用极速数据银行卡识别 API,对银行卡图片进行 OCR 识别,返回卡号、卡类型、银行名称等信息。
Convert PDF to PowerPoint using local processing. No external API calls.
Generate images with Model Studio DashScope SDK using Qwen Image generation models (qwen-image-max, qwen-image-plus-2026
Convert PDF to Markdown using local processing. No external API calls.
FREE AI image & portrait generation - No authentication required! Fast generation (3-5 seconds) with 140+ nationalit
Default router for all search, crawl, embed, finance, code, communication, translation, and OCR requests. When installed
Send native voice bubble messages (语音气泡) in Feishu/Lark chats using Edge TTS. Converts text to opus audio via Microsoft
自动抓取指定YouTube频道和Twitter账号内容,分析聚合跨平台AI热门话题,生成带热度评分的选题报告并写入飞书多维表格。
Write articles, guides, blog posts, tutorials, newsletter issues, and other long-form content in a distinctive voice der
通过本地 CLI 查询 TIVS 设备相关信息。包括设备列表,设备截图等能力。用户提出摄像头等设备查询查看需求时使用。
This skill should be used when users experience headaches, migraines, or head pain and request audio therapy or music as
Remove backgrounds from AI-generated images in one command. Pass a picture UUID and get back a clean transparent-backgro
调用 vwu.ai 语音合成模型speech-2.8-hd和speech-2.8-turbo,需配置API key并支持OpenAI兼容接口。
Call and interact with vwu.ai Sihai models including sihai-i2v-31, sihai-image-27, sihai-t2v-29, and sihai-tts-25 with O
AI builders digest — monitors top AI builders on X and YouTube podcasts, remixes their content into digestible summaries
PixVerse CLI — generate AI videos and images from the command line. Supports PixVerse, Veo, Sora, Kling, Hailuo, Wan, an
Convert documents to slides. PDF/Word/Markdown → PPT. Runs on your machine, no external APIs.
Convert PDF and image documents to clean Markdown via the PDF2Markdown CLI. Use when the user wants to extract text from
Download YouTube videos and transcribe audio using local Whisper. Use when you need to extract text from YouTube videos
Download Bilibili AI-generated subtitles (auto-subtitles) for videos. Use when you need to quickly get subtitles from Bi
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Control and interact with real Android phones via HTTP and CLI without ADB or root, supporting screen reading, taps, typ
Generate beautiful interactive mind maps from any text, article summary, or structured content. Exports to HTML (interac
读取Confluence需求文档并整理成指定格式。采集原则是"忠实记录",而非"需求分析"。输出包括:{序号}_{标题}.md(每个页面一个Markdown文件)、requirement-meta.m
Control physical devices over HTTP with unified commands for screen reading, input actions, app launch, navigation, and
Ultra-fast text-to-speech and speech-to-text via Smallest AI's Lightning v3.1 and Pulse models. Use when the user wants
Send customized voice messages to Feishu chats by generating and uploading TTS audio using configurable credentials and
Optimize text-to-image prompts for Grok and similar image models. Use when the user wants better image generation prompt
Free local AI image and video processing toolkit with cloud AI generation. Local tools: upscale (Real-ESRGAN), face enha
MinerU document extraction CLI that converts PDFs, images, and web pages into Markdown, HTML, LaTeX, or DOCX via the Min
聚合客服、销售、评价与访谈中的用户原声,并按 JTBD/阶段组织。;use for customer-voice, jtbd, research workflows;do not use for 泄露用户隐私, 选择性忽略负面声音.
国内场外基金智能顾问 + 股票行情查询。实时估值、买卖建议、收益统计、定投计划、OCR 识图、股票 - 基金联动。支持离线模式、多数据源缓存。
多平台视频/图文内容发布技能集合。支持账号管理、登录状态维护、一键多平台发布。 当用户要求发布内容到抖音、小红书、微信视频号、Threads、Instagram,或管理发布账号时触发。
Record Android screen via scrcpy, dynamically set keyframe intervals with ffmpeg, then extract and list key frames as PN
截图 OCR 识别工具。截图→自动识别文字→复制/保存,适合提取图片内容、表格数据、验证码。
Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) gene
AI image and video generation service - supports text-to-image, image-to-image, and video generation. Automatic API key